November 2020

3 articles

Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models

Hugging Face releases a method to reuse pre-trained language model checkpoints in encoder-decoder architectures. The technique improves training efficiency and reduces resources needed to build performant seq2seq models.

Fine-tuning Code generation Tools

SIG

HYP

Hugging Face Blog·Nov 3

Porting fairseq wmt19 translation system to transformers

Hugging Face documents porting the WMT19 translation system from fairseq to the transformers library. Technical migration of a neural machine translation architecture to the transformers ecosystem, with reproduction of WMT19 benchmark results.

Benchmarks Code generation Tools

SIG

HYP

Hugging Face Blog·Nov 2

Hyperparameter Search with Transformers and Ray Tune

Hugging Face integrates Ray Tune for automated hyperparameter optimization in the Transformers library. This integration enables researchers and practitioners to efficiently tune training parameters (learning rate, batch size, etc.) using distributed search algorithms.

Tools Infrastructure Fine-tuning

SIG

HYP