EMNLP 2021 paper Models and Datasets for Cross-Lingual Summarisation.

Last update: Oct 28, 2022

Related tags

Overview

This repository contains data and code for our EMNLP 2021 paper Models and Datasets for Cross-Lingual Summarisation. Please contact me at [email protected] for any question.

Please cite this paper if you use our code or data.

@InProceedings{clads-emnlp,
  author =      "Laura Perez-Beltrachini and Mirella Lapata",
  title =       "Models and Datasets for Cross-Lingual Summarisation",
  booktitle =   "Proceedings of The 2021 Conference on Empirical Methods in Natural Language Processing ",
  year =        "2021",
  address =     "Punta Cana, Dominican Republic",
}

The XWikis Corpus

You can create the corpus using the instructions below. The original XWikis corpus is available at XWikis.

Instructions to re-create our corpus and extract other languages are available here.

Cross-lingual Summarisation Code

Our code is based on Fairseq and mBART/mBART50. You'll find our clone of Fairseq and the code extension to implement our models here and instructions to pre-process the data, and train and evaluate our models here.

EMNLP 2021 paper Models and Datasets for Cross-Lingual Summarisation.

Related tags

Overview

The XWikis Corpus

Cross-lingual Summarisation Code

Models' Outputs

Owner

Official implementation for Scale-Aware Neural Architecture Search for Multivariate Time Series Forecasting

This project uses Template Matching technique for object detecting by detection of template image over base image.

Pcos-prediction - Predicts the likelihood of Polycystic Ovary Syndrome based on patient attributes and symptoms

NeuroFind - A solution to the to the Task given by the Oberseminar of Messtechnik Institute of TU Dresden in 2021

Generates all variables from your .tf files into a variables.tf file.

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Source code for EquiDock: Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking (ICLR 2022)

PyTorch reimplementation of hand-biomechanical-constraints (ECCV2020)

Code for the paper "Query Embedding on Hyper-relational Knowledge Graphs"

Commonsense Ability Tests

PRTR: Pose Recognition with Cascade Transformers

[NeurIPS 2020] This project provides a strong single-stage baseline for Long-Tailed Classification, Detection, and Instance Segmentation (LVIS).

DLL: Direct Lidar Localization

DP-CL(Continual Learning with Differential Privacy)

The code is for the paper "A Self-Distillation Embedded Supervised Affinity Attention Model for Few-Shot Segmentation"

Deep-Learning-Image-Captioning - Implementing convolutional and recurrent neural networks in Keras to generate sentence descriptions of images

This implementation contains the application of GPlearn's symbolic transformer on a commodity futures sector of the financial market.

DRIFT is a tool for Diachronic Analysis of Scientific Literature.

[2021][ICCV][FSNet] Full-Duplex Strategy for Video Object Segmentation

AI virtual gym is an AI program which can be used to exercise and can be used to see if we are doing the exercises