Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Last update: Jan 04, 2023

Related tags

Text Data & NLP PABEE

Overview

Patience-based Early Exit

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

NEWS: We now have a better and tidier implementation integrated into Hugging Face transformers!

Citation

If you use this code in your research, please cite our paper:

@inproceedings{zhou2020bert,
 author = {Zhou, Wangchunshu and Xu, Canwen and Ge, Tao and McAuley, Julian and Xu, Ke and Wei, Furu},
 booktitle = {Advances in Neural Information Processing Systems},
 pages = {18330--18341},
 publisher = {Curran Associates, Inc.},
 title = {BERT Loses Patience: Fast and Robust Inference with Early Exit},
 url = {https://proceedings.neurips.cc/paper/2020/file/d4dd111a4fd973394238aca5c05bebe3-Paper.pdf},
 volume = {33},
 year = {2020}
}

Requirement

Our code is built on huggingface/transformers. To use our code, you must clone and install huggingface/transformers.

Training

You can fine-tune a pretrained language model and train the internal classifiers by configuring and running finetune_bert.sh and finetune_albert.sh .

Inference

You can inference with different patience settings by configuring and running patience_infer_albert.sh and patience_infer_bert.sh.

Bug Report and Contribution

If you'd like to contribute and add more tasks (only GLUE is available at this moment), please submit a pull request and contact me. Also, if you find any problem or bug, please report with an issue. Thanks!

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Related tags

Overview

Patience-based Early Exit

Citation

Requirement

Training

Inference

Bug Report and Contribution

Owner

Kevin Canwen Xu

Legal text retrieval for python

PyTorch implementation of NATSpeech: A Non-Autoregressive Text-to-Speech Framework

Ceaser-Cipher - The Caesar Cipher technique is one of the earliest and simplest method of encryption technique

History Aware Multimodal Transformer for Vision-and-Language Navigation

Dual languaged (rus+eng) tool for packing and unpacking archives of Silky Engine.

Segmenter - Transformer for Semantic Segmentation

Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].

An Explainable Leaderboard for NLP

This is the Alpha of Nutte language, she is not complete yet / Essa é a Alpha da Nutte language, não está completa ainda

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Open-source offline translation library written in Python. Uses OpenNMT for translations

A toolkit for document-level event extraction, containing some SOTA model implementations

Kestrel Threat Hunting Language

MRC approach for Aspect-based Sentiment Analysis (ABSA)

GPT-2 Model for Leetcode Questions in python

Non-Autoregressive Predictive Coding

Mysticbbs-rjam - rJAM splitscreen message reader for MysticBBS A46+

InferSent sentence embeddings

Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

Korea Spell Checker