当前位置:网站首页>A survey on model compression for natural language processing (NLP model compression overview)
A survey on model compression for natural language processing (NLP model compression overview)
2022-06-24 16:32:00 【Zhiyuan community】
author :Canwen Xu, Julian McAuley
brief introduction : With Transformer And pre training technology , natural language processing (NLP) Great progress has been made in the application of . However ,Transformer High energy consumption and long reasoning delay hinder NLP Into a broader scene , Including edge and mobile computing . Effective NLP The purpose of the study is to comprehensively consider the calculation , The whole life cycle of time and carbon emissions NLP, Including data preparation , Model training and reasoning . In this review , The author focuses on the reasoning stage , And review NLP Current situation of model compression , Including benchmark 、 Indicators and methods , The last author also The current obstacles and future research directions are summarized .



Paper download :https://arxiv.org/pdf/2202.07105
边栏推荐
- Go deep into the implementation principle of go language defer
- Some experiences of project K several operations in the global template
- Global and Chinese markets of natural insect repellents 2022-2028: Research Report on technology, participants, trends, market size and share
- Global and Chinese market of music synthesizer 2022-2028: Research Report on technology, participants, trends, market size and share
- [tke] modify the cluster corendns service address
- Bitwise Operators
- April 26, 2021: the length of the integer array arr is n (3 < = n < = 10^4), and each number is
- Ps\ai and other design software pondering notes
- Global and Chinese market of computer protective film 2022-2028: Research Report on technology, participants, trends, market size and share
- [idea] dynamic planning (DP)
猜你喜欢
MySQL Advanced Series: locks - locks in InnoDB

My network relationship with "apifox"

ZOJ——4104 Sequence in the Pocket(思维问题)
MySQL Advanced Series: Locks - Locks in InnoDB

Cognition and difference of service number, subscription number, applet and enterprise number (enterprise wechat)

C. Three displays(动态规划)Codeforces Round #485 (Div. 2)

Ps\ai and other design software pondering notes

Applet - use of template

There are potential safety hazards Land Rover recalls some hybrid vehicles

C. K-th not divisible by n (Mathematics + thinking) codeforces round 640 (Div. 4)
随机推荐
Abnormal dockgeddon causes CPU 100%
How FEA and FEM work together
Object store signature generation
MySQL Advanced Series: locks - locks in InnoDB
2021-05-01: given an ordered array arr, it represents the points located on the X axis. Given a positive number k
What is the difference between get and post? After reading it, you won't be confused and forced, and you won't have to fight with your friends anymore
SQL multi table updating data is very slow
Global and Chinese markets of natural insect repellents 2022-2028: Research Report on technology, participants, trends, market size and share
@There is a free copyright protection service for enterprises in Dawan District
It may be a good idea to use simulation software in the cloud for simulation
Serial of H3CNE experiment column - spanning tree STP configuration experiment
Nature publishes significant progress in quantum computing: the first quantum integrated circuit implementation in history
How to pop up an alarm through the national standard gb28181 protocol video platform easygbs for mobile detection / perimeter intrusion detection video recording
对深度可分离卷积、分组卷积、扩张卷积、转置卷积(反卷积)的理解
Detailed explanation of transpose convolution in pytorch
Development trend of CAE simulation analysis software
Snowflake algorithm implemented in go language
2021-04-28: force buckle 546, remove the box. Give some boxes of different colors
Global and Chinese market of insect proof clothing 2022-2028: Research Report on technology, participants, trends, market size and share
Istio FAQ: sidecar startup sequence