Introduction

Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks.

What's New:

August 2019: WMT'19 models released
July 2019: fairseq relicensed under MIT license
July 2019: RoBERTa models and code released
June 2019: wav2vec models and code released

Features:

Fairseq provides reference implementations of various sequence-to-sequence models, including:

Convolutional Neural Networks (CNN)
LightConv and DynamicConv models
- Pay Less Attention with Lightweight and Dynamic Convolutions (Wu et al., 2019)
Long Short-Term Memory (LSTM) networks
- Effective Approaches to Attention-based Neural Machine Translation (Luong et al., 2015)
Transformer (self-attention) networks

Additionally:

multi-GPU (distributed) training on one machine or across multiple machines
fast generation on both CPU and GPU with multiple search algorithms implemented:
- beam search
- Diverse Beam Search (Vijayakumar et al., 2016)
- sampling (unconstrained, top-k and top-p/nucleus)
large mini-batch training even on a single GPU via delayed updates
mixed precision training (trains faster with less GPU memory on NVIDIA tensor cores)
extensible: easily register new models, criterions, tasks, optimizers and learning rate schedulers

We also provide pre-trained models for several benchmark translation and language modeling datasets.

Requirements and Installation

PyTorch version >= 1.1.0
Python version >= 3.5
For training new models, you'll also need an NVIDIA GPU and NCCL
For faster training install NVIDIA's apex library with the --cuda_ext option

To install fairseq:

pip install fairseq

On MacOS:

CFLAGS="-stdlib=libc++" pip install fairseq

If you use Docker make sure to increase the shared memory size either with --ipc=host or --shm-size as command line options to nvidia-docker run.

Installing from source

To install fairseq from source and develop locally:

git clone https://github.com/pytorch/fairseq
cd fairseq
pip install --editable .

Getting Started

The full documentation contains instructions for getting started, training new models and extending fairseq with new model types and tasks.

Pre-trained models and examples

We provide pre-trained models and pre-processed, binarized test sets for several tasks listed below, as well as example training and evaluation commands.

Translation: convolutional and transformer models are available
Language Modeling: convolutional and transformer models are available

We also have more detailed READMEs to reproduce results from specific papers:

Join the fairseq community

Facebook page: https://www.facebook.com/groups/fairseq.users
Google group: https://groups.google.com/forum/#!forum/fairseq-users

License

fairseq(-py) is MIT-licensed. The license applies to the pre-trained models as well.

Citation

Please cite as:

@inproceedings{ott2019fairseq,
  title = {fairseq: A Fast, Extensible Toolkit for Sequence Modeling},
  author = {Myle Ott and Sergey Edunov and Alexei Baevski and Angela Fan and Sam Gross and Nathan Ng and David Grangier and Michael Auli},
  booktitle = {Proceedings of NAACL-HLT 2019: Demonstrations},
  year = {2019},
}

Code for the project carried out fulfilling the course requirements for Fall 2021 NLP at NYU

Related tags

Overview

Introduction

What's New:

Features:

Requirements and Installation

Getting Started

Pre-trained models and examples

Join the fairseq community

License

Citation

Owner

Sai Himal Allu

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"

Stanford CoreNLP provides a set of natural language analysis tools written in Java

A minimal Conformer ASR implementation adapted from ESPnet.

Awesome Treasure of Transformers Models Collection

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Create a semantic search engine with a neural network (i.e. BERT) whose knowledge base can be updated

🏖 Easy training and deployment of seq2seq models.

✔👉A Centralized WebApp to Ensure Road Safety by checking on with the activities of the driver and activating label generator using NLP.

Translators - is a library which aims to bring free, multiple, enjoyable translation to individuals and students in Python

auto_code_complete is a auto word-completetion program which allows you to customize it on your need

Graph Coloring - Weighted Vertex Coloring Problem

Code for Emergent Translation in Multi-Agent Communication

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

Text-to-Speech for Belarusian language

Data loaders and abstractions for text and NLP

We have built a Voice based Personal Assistant for people to access files hands free in their device using natural language processing.

Code for "Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments".

Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch