Home
last modified time | relevance | path

Searched defs:thr_mma (Results 1 – 1 of 1) sorted by relevance

/external/pytorch/aten/src/ATen/native/transformers/cuda/flash_attn/
Dflash_fwd_kernel.h162 auto thr_mma = tiled_mma.get_thread_slice(tidx); in compute_attn_1rowblock() local
608 auto thr_mma = tiled_mma.get_thread_slice(tidx); in compute_attn_1rowblock_splitkv() local