Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models
Hugging Face releases a method to reuse pre-trained language model checkpoints in encoder-decoder architectures. The technique improves training efficiency and reduces resources needed to build performant seq2seq models.