Searched defs:gradient_aggregation_group (Results 1 – 1 of 1) sorted by relevance
145 optimizer_shard=False, gradient_aggregation_group=4, vocab_emb_dp=True): argument171 def gradient_aggregation_group(self): member in TransformerOpParallelConfig175 def gradient_aggregation_group(self, value): member in TransformerOpParallelConfig