1AlbertForMaskedLM,4 2AlbertForQuestionAnswering,4 3AllenaiLongformerBase,4 4BartForCausalLM,4 5BartForConditionalGeneration,2 6BertForMaskedLM,16 7BertForQuestionAnswering,16 8BigBird,32 9BlenderbotForCausalLM,32 10BlenderbotSmallForCausalLM,64 11BlenderbotSmallForConditionalGeneration,64 12CamemBert,16 13DebertaForMaskedLM,32 14DebertaForQuestionAnswering,8 15DebertaV2ForMaskedLM,16 16DebertaV2ForQuestionAnswering,2 17DistilBertForMaskedLM,128 18DistilBertForQuestionAnswering,256 19DistillGPT2,16 20ElectraForCausalLM,8 21ElectraForQuestionAnswering,8 22GoogleFnet,16 23GPT2ForSequenceClassification,4 24LayoutLMForMaskedLM,16 25LayoutLMForSequenceClassification,16 26M2M100ForConditionalGeneration,16 27MBartForCausalLM,4 28MBartForConditionalGeneration,2 29MegatronBertForCausalLM,4 30MegatronBertForQuestionAnswering,8 31MobileBertForMaskedLM,64 32MobileBertForQuestionAnswering,64 33MT5ForConditionalGeneration,16 34OPTForCausalLM,2 35PegasusForCausalLM,32 36PegasusForConditionalGeneration,32 37PLBartForCausalLM,8 38PLBartForConditionalGeneration,4 39RobertaForCausalLM,16 40RobertaForQuestionAnswering,16 41Speech2Text2ForCausalLM,32 42T5ForConditionalGeneration,4 43T5Small,1 44TrOCRForCausalLM,32 45XGLMForCausalLM,8 46XLNetLMHeadModel,8 47YituTechConvBert,16 48