当前位置:网站首页>Icml2022 | utility theory of sequential decision making
Icml2022 | utility theory of sequential decision making
2022-06-30 21:02:00 【Zhiyuan community】

Thesis link :https://arxiv.org/pdf/2206.12562.pdf
Large based on Transformer The model has shown superior performance in various naturallanguageprocessing and computer vision tasks . However , These models contain a large number of parameters , This limits their deployment in real applications . To reduce the size of the model , Researchers prune these models according to the importance score of the weight . However , These scores are usually estimated in small batches during training , Due to small batch sampling and complex training dynamics , This brings a lot of variability / uncertainty . Because of this uncertainty , Common pruning methods prune some key weights , Make training unstable , It is not conducive to generalization . To solve this problem , We proposed PLATON Algorithm , The algorithm uses the upper confidence limit of importance estimation (upper confidence bound, UCB) To capture the uncertainty of the importance score . Especially for the weight with low importance score but high uncertainty ,PLATON Tend to keep them and explore their capacity . We are in natural language understanding 、 Question answering and image classification are based on transformer A large number of experiments have been carried out on the model , To verify PLATON The effectiveness of the . It turns out that , At different sparsity levels ,PLATON The algorithm has been significantly improved .

边栏推荐
- Flinksql两个kafka 流可以进行join么?
- No "history of blood and tears" in home office | community essay solicitation
- 19.04 分配器
- Lumiprobe dye hydrazide - BDP FL hydrazide solution
- Gartner聚焦中国低代码发展 UniPro如何践行“差异化”
- On inline function
- 浅谈代码语言的魅力
- Lumiprobe biotin phosphimide (hydroxyproline) instructions
- Lvalue reference and lvalue reference
- 电子方案开发——智能跳绳方案
猜你喜欢
随机推荐
二叉查找树(一) - 概念与C语言实现
Lumiprobe生物素亚磷酰胺(羟脯氨酸)说明书
多表操作-外键约束
电子方案开发——智能跳绳方案
ICML2022 | 序列决策的效用理论
微信小程序怎么实现圆心进度条
On inline function
修改已经上线的小程序名称
Flutter 嵌套地狱?不存在的,ConstraintLayout 来解救!
减少嵌入式软件调试时间的三个技巧
Deflection lock / light lock / heavy lock lock is healthier. How to complete locking and unlocking
文本生成模型退化怎么办?SimCTG 告诉你答案
RP原型资源分享-购物类App
对多态的理解
Double solid histogram / double y-axis
Oracle 数据库表结构 Excel 导出
有趣网站汇总
coredns 修改upstream
Lumiprobe 聚乙二醇化和 PEG 接头丨碘-PEG3-酸研究
文本生成模型退化怎麼辦?SimCTG 告訴你答案








