Home
last modified time | relevance | path

Searched defs:numelPerWarp (Results 1 – 1 of 1) sorted by relevance

/external/pytorch/torch/csrc/distributed/c10d/
Dintra_node_comm.cu470 const auto numelPerWarp = numelPerThread * kWarpSize; in getLaunchConfig() local
520 const size_t numelPerWarp = in oneShotAllReduce() local
584 size_t numelPerWarp = kBytesPerThread / input.element_size() * kWarpSize; in twoShotAllReduce() local
639 size_t numelPerWarp = kBytesPerThread / input.element_size() * kWarpSize; in hybridCubeMeshAllReduce() local