Meta Language-Specific Layers in Multilingual Language Models

This repo contains the source codes for our paper

On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment

Zirui Wang, Zachary C. Lipton, Yulia Tsvetkov

EMNLP 2020

Introduction

This repo contains code to train multilingual language models (XLM) that (1) contain language-specific layers, and (2) meta-learn these layers through gradient of gradient.

Language-specific layers are served as meta parameters, optimized using an iterative procedure. The goal is to remedy negative transfer in multilingual models through a meta training objective. Please see our paper for details.

Dependencies

Python 3
XLM
NumPy
PyTorch

Usage

The code is based on the official implementation of XLM. This repo only contains files that we modified from the original codebase. To train a model, please merge code with the source code of XLM, and then follow the standard preprocessing and training instructions there.

Meta Language-Specific Layers in Multilingual Language Models

Related tags

Overview

Meta Language-Specific Layers in Multilingual Language Models

Introduction

Dependencies

Usage

Owner

Zirui Wang

Scalable training for dense retrieval models.

An Api for Emotion recognition.

A crossplatform menu bar application using mpv as DLNA Media Renderer.

Pytorch and Torch testing code of CartoonGAN

Using VapourSynth with super resolution models and speeding them up with TensorRT.

implementation for paper "ShelfNet for fast semantic segmentation"

N-RPG - Novel role playing game da turfu

A Small and Easy approach to the BraTS2020 dataset (2D Segmentation)

Official PyTorch implementation of RIO

Public repository of the 3DV 2021 paper "Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Clouds"

Efficiently computes derivatives of numpy code.

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

Video Autoencoder: self-supervised disentanglement of 3D structure and motion

Location-Sensitive Visual Recognition with Cross-IOU Loss

Multi-agent reinforcement learning algorithm and environment

Solving SMPL/MANO parameters from keypoint coordinates.

Data and analysis code for an MS on SK VOC genomes phenotyping/neutralisation assays

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018