Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

Last update: May 18, 2022

Related tags

Deep Learning multiDDS

Overview

Balancing Training for Multilingual Neural Machine Translation

Implementation of the paper

Balancing Training for Multilingual Neural Machine Translation

Xinyi Wang, Yulia Tsvetkov, Graham Neubig

Data:

The preprocessed and binarized data for fairseq can be downloaded here

To process data from scrach, see the script

util_scripts/prepare_multilingual_data.sh

Training Scripts:

The training scripts for many-to-one translation of the related language group (Related M2O) is under the directory job_scripts/related_ted8_m2o/.

Our methods:

MultiDDS-S:

job_scripts/related_ted8_m2o/multidds_s.sh

MultiDDS:

job_scripts/related_ted8_m2o/multidds.sh

Baselines:

Proportional:

job_scripts/related_ted8_m2o/proportional.sh

Temperature:

job_scripts/related_ted8_m2o/temperature.sh

The scripts for Related O2M is under the directory job_scripts/related_ted8_o2m/

The scripts for Diverse M2O is under the directory job_scripts/diverse_ted8_m2o/

The scripts for Diverse O2M is under the directory job_scripts/diverse_ted8_o2m/

Inference Scripts:

Each of the experiment script directory contains a trans.sh file to translate the test set. To translate the test set for the Related M2O MultiDDS-S

job_scripts/related_ted8_m2o/trans.sh checkpoints/related_ted8_m2o/multidds_s/

To translate other experiment, simply replace the argument with the experiment checkpoint directory.

Citation

Please cite as:

@inproceedings{wang2020multiDDS,
  title = {Balancing Training for Multilingual Neural Machine Translation},
  author = {Xinyi Wang, Yulia Tsvetkov, Graham Neubig},
  booktitle = {ACL},
  year = {2020},
}

Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

Related tags

Overview

Balancing Training for Multilingual Neural Machine Translation

Data:

Training Scripts:

Inference Scripts:

Citation

Owner

Xinyi Wang

Re-TACRED: Addressing Shortcomings of the TACRED Dataset

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Mail classification with tensorflow and MS Exchange Server (ham or spam).

PyTorch code for DriveGAN: Towards a Controllable High-Quality Neural Simulation

Automatic differentiation with weighted finite-state transducers.

A unified 3D Transformer Pipeline for visual synthesis

git《Investigating Loss Functions for Extreme Super-Resolution》(CVPR 2020) GitHub:

Generative Exploration and Exploitation - This is an improved version of GENE.

Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle

Qcover is an open source effort to help exploring combinatorial optimization problems in Noisy Intermediate-scale Quantum(NISQ) processor.

code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"

Official repository of DeMFI (arXiv.)

LoL Runes Recommender With Python

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search.

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

DC540 hacking challenge 0x00005a.

Repository for publicly available deep learning models developed in Rosetta community

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.

Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized