Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

Last update: Nov 30, 2022

Overview

Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning

This is the Github repository of our paper, "Common Sense Beyond English: Evaluating and Improving Multilingual Language Models for Commonsense Reasoning" (in Proc. of ACL2021) about MickeyProbe and X-CSR. The detailed information and the links to download our data are available on the project website: https://inklab.usc.edu/XCSR/.

Code

Herein, we show the code and scripts for running the MickeyProbe experiments (mickey_probe), X-CSR experiments (xcsr_experiments) and the proposed multilingual contrastive pre-training method (mcp_generation). Ther are instructions under each folder and please refer to our paper if you would like to know more details.

Paper Abstract

Commonsense reasoning research has so far been mainly limited to English. We aim to evaluate and improve popular multilingual language models (ML-LMs) to help advance commonsense reasoning (CSR) beyond English. We collect the Mickey Corpus, consisting of 561k sentences in 11 different languages, which can be used for analyzing and improving ML-LMs. We propose Mickey Probe, a language-agnostic probing task for fairly evaluating the common sense of popular ML-LMs across different languages. In addition, we also create two new datasets, X-CSQA and X-CODAH, by translating their English versions to 15 other languages, so that we can evaluate popular ML-LMs for cross-lingual commonsense reasoning. To improve the performance beyond English, we propose a simple yet effective method --- multilingual contrastive pre-training (MCP). It significantly enhances sentence representations, yielding a large performance gain on both benchmarks.

Resources

We provide our resources and method for studying cross-lingual commonsense reasoning.

A multi-lingual corpus for MickeyProbe task towards analyzing and pre-training ML-LMs.
Two X-CSR datasets (i.e., X-CSQA and X-CODAH) for evaluation.
The multilingual contrastive pre-training (MCP) method for improving ML-LMs' performance.

We also build X-CSR leaderboard so that people can compare their cross-lingual/multilingual models with each other in a unified evaluation protocol.

Citation

@inproceedings{lin-etal-2021-xcsr,
    title = "Common Sense Beyond English: Evaluating and Improving Multilingual Language Models for Commonsense Reasoning",
    author = "Lin, Bill Yuchen and Lee, Seyeon and Qiao, Xiaoyang and Ren, Xiang",
    booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL-IJCNLP 2021)",
    year = "2021",
    note={to appear}
}

Contact

This repo is now under active development, and there may be issues caused by refactoring code. Please email [email protected] if you have any questions.

Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

Related tags

Overview

Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning

Code

Paper Abstract

Resources

Citation

Contact

Owner

INK Lab @ USC

Iterative Normalization: Beyond Standardization towards Efficient Whitening

NumPy로 구현한 딥러닝 라이브러리입니다. (자동 미분 지원)

Scheduling BilinearRewards

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

📚 A collection of all the Deep Learning Metrics that I came across which are not accuracy/loss.

Implementation of association rules mining algorithms (Apriori|FPGrowth) using python.

Attention-driven Robot Manipulation (ARM) which includes Q-attention

PointCloud Annotation Tools, support to label object bound box, ground, lane and kerb

RCD: Relation Map Driven Cognitive Diagnosis for Intelligent Education Systems

Pixel-level Crack Detection From Images Of Levee Systems : A Comparative Study

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

CyTran: Cycle-Consistent Transformers for Non-Contrast to Contrast CT Translation

Malware Bypass Research using Reinforcement Learning

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning

GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training @ KDD 2020

PyTorch Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)

This code is the implementation of the paper "Coherence-Based Distributed Document Representation Learning for Scientific Documents".

List some popular DeepFake models e.g. DeepFake, FaceSwap-MarekKowal, IPGAN, FaceShifter, FaceSwap-Nirkin, FSGAN, SimSwap, CihaNet, etc.