当前位置:网站首页>Icml2022 | utility theory of sequential decision making
Icml2022 | utility theory of sequential decision making
2022-06-30 21:02:00 【Zhiyuan community】

Thesis link :https://arxiv.org/pdf/2206.12562.pdf
Large based on Transformer The model has shown superior performance in various naturallanguageprocessing and computer vision tasks . However , These models contain a large number of parameters , This limits their deployment in real applications . To reduce the size of the model , Researchers prune these models according to the importance score of the weight . However , These scores are usually estimated in small batches during training , Due to small batch sampling and complex training dynamics , This brings a lot of variability / uncertainty . Because of this uncertainty , Common pruning methods prune some key weights , Make training unstable , It is not conducive to generalization . To solve this problem , We proposed PLATON Algorithm , The algorithm uses the upper confidence limit of importance estimation (upper confidence bound, UCB) To capture the uncertainty of the importance score . Especially for the weight with low importance score but high uncertainty ,PLATON Tend to keep them and explore their capacity . We are in natural language understanding 、 Question answering and image classification are based on transformer A large number of experiments have been carried out on the model , To verify PLATON The effectiveness of the . It turns out that , At different sparsity levels ,PLATON The algorithm has been significantly improved .

边栏推荐
猜你喜欢
随机推荐
Label Contrastive Coding based Graph Neural Network for Graph Classification
Comparison between QT and other GUI Libraries
RP原型资源分享-购物类App
修改已经上线的小程序名称
vncserver: Failed command ‘/etc/X11/Xvnc-session‘: 256!
uniapp-路由uni-simple-router
学习总结
Game 81 biweekly
MySQL高级篇3
SQL Server 提取字符串中的纯数字
Huffman tree (I) basic concept and C language implementation
Peking University ACM problems 1004:financial management
Peking University ACM problems 1006:biorhythms
WebRTC系列-网络传输之本地scoket端口
加密与解密以及OpenSSL的应用
On the charm of code language
Peking University ACM problems 1001:exposition
Lumiprobe细胞生物学丨DiA,亲脂性示踪剂说明书
企业保护 API 安全迫在眉睫
元宇宙可能成为互联网发展的新方向


![[1175. prime number arrangement]](/img/f2/d427db03da151786ea1dfb7a76328a.png)






