JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Last update: Oct 26, 2022

Related tags

Deep Learning JASS

Overview

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

This the repository for this paper.

Find extensions of this work and new pre-trained models here: code, paper

Requirements

Install OpenNMT-py (1.0) and subword-nmt.

pip install OpenNMT-py
pip install subword-nmt

Pre-trained JASS models

We release JASS models on 2 language pairs: ja+en, ja+ru. For Japanese seq2seq pretraining, we use our proposed JASS methods while MASS is utilized for English and Russian.

Model	Vocabulary	BPE codes
JASS-jaen	ja-en	ja-en.bpe.codes
JASS-jaru	ja-ru	ja-ru.bpe.codes

Usage

Run the bpe precrocessing for the dataset to be finetuned. After setting up the downloaded vocabulary for src and tgt sentences during the preprocessing phase by preprocess.py of OpenNMT, use train_from argument of train.py in OpenNMT to implement the finetuning for the pretrained model.

Others

We will update the current Japanese--English pre-trained model and release pretrained models on Japanese--Chinese and Japanese--Korean. We released new models here: code

Reference

[1] Zhuoyuan Mao, Fabien Cromieres, Raj Dabre, Haiyue Song, Sadao Kurohashi, JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

@inproceedings{mao-etal-2020-jass,
    title = "{JASS}: {J}apanese-specific Sequence to Sequence Pre-training for Neural Machine Translation",
    author = "Mao, Zhuoyuan  and
      Cromieres, Fabien  and
      Dabre, Raj  and
      Song, Haiyue  and
      Kurohashi, Sadao",
    booktitle = "Proceedings of The 12th Language Resources and Evaluation Conference",
    month = may,
    year = "2020",
    address = "Marseille, France",
    publisher = "European Language Resources Association",
    url = "https://www.aclweb.org/anthology/2020.lrec-1.454",
    pages = "3683--3691",
    language = "English",
    ISBN = "979-10-95546-34-4",
}

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Related tags

Overview

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Requirements

Pre-trained JASS models

Usage

Others

Reference

Owner

Zhuoyuan Mao

Combinatorial model of ligand-receptor binding

Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code

Study of human inductive biases in CNNs and Transformers.

An example of Scatterbrain implementation (combining local attention and Performer)

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)

Flaxformer: transformer architectures in JAX/Flax

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

Implementation of the bachelor's thesis "Real-time stock predictions with deep learning and news scraping".

Indoor Panorama Planar 3D Reconstruction via Divide and Conquer

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

An Unsupervised Graph-based Toolbox for Fraud Detection

A lightweight python AUTOmatic-arRAY library.

DNA sequence classification by Deep Neural Network

This repository provides code for "On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness".

DRLib：A concise deep reinforcement learning library, integrating HER and PER for almost off policy RL algos.

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

MAU: A Motion-Aware Unit for Video Prediction and Beyond, NeurIPS2021

Implementation of average- and worst-case robust flatness measures for adversarial training.