It was actually just for learning purpose, but since it was trained for many hours on multiple gpus, I though it would be good also for other if I put it to huggingface's models zoo if I am able to convert it. Explanation: Gensim is a high-end, industry-level software for topic modeling of a specific piece of text. It is a sequence modeling toolkit for machine translation, text summarization, language modeling, text generation, and other tasks. The Hugging Face Transformers library makes state-of-the-art NLP models like BERT and training techniques like mixed precision and gradient checkpointing easy to use. Fairseq also features multi-GPU training on one or across multiple machines, and lightning fast beam search generation on both CPU and GGPU. Fairseq has facebook implementations of translation and language models and scripts for custom training. Create a mask from the two sequences passed to be used in a sequence-pair classification task. Hi @sshleifer, as mentioned above I fine tuned mbart.cc25 for machine translation (en-de) with Fairseq.