Non-Autoregressive Predictive Coding

This repository contains the implementation of Non-Autoregressive Predictive Coding (NPC) as described in the preprint paper submitted to ICASSP 2021.

A quick example for training NPC

python main.py --config config/self_supervised/npc_example.yml \
               --task self-learning

For more complete examples including downstream tasks, please see the example script.
For preparing data, please visit preprocess.
For detailed hyperparameters setting and description, please checkout example config file of NPC.
For all run-time options, use -h flag.
Implementation of Autoregressive Predictive Coding (APC, 2019, Chung et al.) and Vector-Quantized APC (VQ-APC, 2020, Chung et al.) are also available using similar training/downstream execution with example config files here.

Some notes

We found the unmasked feature produced by the last ConvBlock layer a better representation. In the phone classification tasks, switching to the unmasked feature (PER 25.6%) provided a 1.6% improvement over the masked feature (PER 27.2%). Currently, this is not included in the preprint version and will be updated to the paper in the future. Please refer to downstream examples to activate this option.
APC/VQ-APC are implemented with the following modifications for improvement (for the unmodified version, please visit the official implementation of APC / VQAPC)
- Multi-group VQ available for VQ-APC, but with VQ on last layer only
- Using utterance-wised CMVN surface feature（just as NPC did)
- Using Gumbel Softmax from official API of pytorch
See package requirement for toolkits used, tensorboard can be used to access log files in --logdir.

Contact

Feel free to contact me for questions or feedbacks, my email can be found in the paper or my personal page.

Citation

If you find our work and/or this repository helpful, please do consider citing us

@article{liu2020nonautoregressive,
  title   = {Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies},
  author  = {Liu, Alexander and Chung, Yu-An and Glass, James},
  journal = {arXiv preprint arXiv:2011.00406},
  year    = {2020}
}

Non-Autoregressive Predictive Coding

Related tags

Overview

Non-Autoregressive Predictive Coding

Some notes

Contact

Citation

Owner

Alexander H. Liu

Chinese Pre-Trained Language Models (CPM-LM) Version-I

A desktop GUI providing an audio interface for GPT3.

Making text a first-class citizen in TensorFlow.

An ActivityWatch watcher to pose questions to the user and record her answers.

Tevatron is a simple and efficient toolkit for training and running dense retrievers with deep language models.

Unofficial Python library for using the Polish Wordnet (plWordNet / Słowosieć)

Code and data accompanying Natural Language Processing with PyTorch

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Bot to connect a real Telegram user, simulating responses with OpenAI's davinci GPT-3 model.

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022

MMDA - multimodal document analysis

Dust model dichotomous performance analysis

List of GSoC organisations with number of times they have been selected.

Chinese named entity recognization (bert/roberta/macbert/bert_wwm with Keras)

Code to reproduce the results of the paper 'Towards Realistic Few-Shot Relation Extraction' (EMNLP 2021)

CCF BDCI 2020 房产行业聊天问答匹配赛道 A榜47/2985

This is a NLP based project to extract effective date of the contract from their text files.

Code associated with the Don't Stop Pretraining ACL 2020 paper

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Code examples for my Write Better Python Code series on YouTube.