Searched defs:dq_ (Results 1 – 1 of 1) sorted by relevance
813 std::optional<at::Tensor> &dq_, // batch_size x seqlen_q x num_heads x head_size in mha_bwd()1024 …std::optional<at::Tensor> &dq_, // total_q x num_heads x head_size, total_q := \sum_{i=0}^{b} s_i in mha_varlen_bwd()