You're right. Encoder only models like BERT aren't auto-regressive and are train...

		woadwarrior01 10 hours ago \| parent \| context \| favorite \| on: BERTs Are Generative In-Context Learners You're right. Encoder only models like BERT aren't auto-regressive and are trained with the MLM objective. Decoder only (GPT) and encoder-decoder (T5) models are auto-regressive and are trained with the CLM and sometimes the PrefixLM objectives.