当前位置:网站首页>A survey on dynamic neural networks for natural language processing, University of California
A survey on dynamic neural networks for natural language processing, University of California
2022-06-24 16:32:00 【Zhiyuan community】
author :Canwen Xu, Julian McAuley
brief introduction : Scale large objects effectively Transformer Models are the main driving force behind the latest advances in natural language processing . Dynamic neural network is a new research direction , It can dynamically adjust the calculation path of neural network according to the input , Thus, the calculation amount and time can be increased in a sub linear way . Dynamic neural networks may be a promising solution , It can solve the increasing number of parameters of the pre training language model , It can use trillions of parameters for model pre training , Faster reasoning on mobile devices . In this review , The author summarizes the progress of three dynamic neural networks in natural language processing : skimming (skimming)、 Hybrid expert model (mixture of experts) And early exit reasoning (early exit). The author also emphasizes the current challenges and future research direction of dynamic neural network .





Paper download :https://arxiv.org/pdf/2202.07101.pdf
边栏推荐
- Serial of H3CNE experiment column - spanning tree STP configuration experiment
- It may be a good idea to use simulation software in the cloud for simulation
- AI video structured intelligent security platform easycvr intelligent security monitoring scheme for protecting community residents
- A very good educational man and resource center planning scheme, with word file download
- Advanced programmers must know and master. This article explains in detail the principle of MySQL master-slave synchronization
- Don't let [mana] destroy your code!
- Modern finite element analysis can easily achieve accurate results
- Wechat official account debugging and natapp environment building
- ThinkPHP vulnerability exploitation tool
- Bitwise Operators
猜你喜欢

Applet wxss
MySQL進階系列:鎖-InnoDB中鎖的情况

A new weapon to break the memory wall has become a "hot search" in the industry! Persistent memory enables workers to play with massive data + high-dimensional models

ZOJ - 4104 sequence in the pocket

Applet - use of template

Siggraph 2022 | truly restore the hand muscles. This time, the digital human hands have bones, muscles and skin

ZOJ——4104 Sequence in the Pocket(思维问题)

B. Ternary Sequence(思维+贪心)Codeforces Round #665 (Div. 2)

My network relationship with "apifox"

Ps\ai and other design software pondering notes
随机推荐
Bitwise Operators
MySQL Innodb和Myisam
Global and Chinese market of computer protective film 2022-2028: Research Report on technology, participants, trends, market size and share
2021-04-29: given an array arr, it represents a row of balloons with scores. One for each blow
How does easydss, an online classroom / online medical live on demand platform, separate audio and video data?
6 things all engineers should know before FEA
Applet - use of template
If only 2 people are recruited, can the enterprise do a good job in content risk control?
Popular explanation [redirection] and its practice
Nature publishes significant progress in quantum computing: the first quantum integrated circuit implementation in history
How to select an open source license
Leetcode notes of Google boss | necessary for school recruitment!
国泰君安期货安全么?期货开户怎么开?期货手续费怎么降低?
Istio FAQ: failed to resolve after enabling smart DNS
Week7 weekly report
Pageadmin CMS solution for redundant attachments in website construction
Percona Toolkit series - Pt deadlock logger
Fastjson 漏洞利用技巧
Introduction to new features of ECMAScript 2019 (ES10)
Problems encountered in the work of product manager