当前位置:网站首页>Icml2022 | utility theory of sequential decision making
Icml2022 | utility theory of sequential decision making
2022-06-30 21:02:00 【Zhiyuan community】

Thesis link :https://arxiv.org/pdf/2206.12562.pdf
Large based on Transformer The model has shown superior performance in various naturallanguageprocessing and computer vision tasks . However , These models contain a large number of parameters , This limits their deployment in real applications . To reduce the size of the model , Researchers prune these models according to the importance score of the weight . However , These scores are usually estimated in small batches during training , Due to small batch sampling and complex training dynamics , This brings a lot of variability / uncertainty . Because of this uncertainty , Common pruning methods prune some key weights , Make training unstable , It is not conducive to generalization . To solve this problem , We proposed PLATON Algorithm , The algorithm uses the upper confidence limit of importance estimation (upper confidence bound, UCB) To capture the uncertainty of the importance score . Especially for the weight with low importance score but high uncertainty ,PLATON Tend to keep them and explore their capacity . We are in natural language understanding 、 Question answering and image classification are based on transformer A large number of experiments have been carried out on the model , To verify PLATON The effectiveness of the . It turns out that , At different sparsity levels ,PLATON The algorithm has been significantly improved .

边栏推荐
猜你喜欢

Label Contrastive Coding based Graph Neural Network for Graph Classification

树基本概念

Study on lumiprobe dye NHS ester BDP FL NHS ester

Basic components of STL

Lumiprobe染料酰肼丨BDP FL 酰肼方案

Oracle 数据库表结构 Excel 导出

Adobe Photoshop (PS) - script development - remove file bloated script

Lumiprobe cell biology - dia, instructions for lipophilic tracer

文本识别-SVTR论文解读

BioVendor sRAGE Elisa试剂盒测试原理和注意事项
随机推荐
文本识别-SVTR论文解读
How can I get the stock account opening discount link? In addition, is it safe to open a mobile account?
go搭建服务器基础
19.04 分配器
MySQL:SQL概述及数据库系统介绍 | 黑马程序员
WebRTC系列-网络传输之本地scoket端口
sqlserver 字符串类型转换成小数或者整数类型
我想知道股票开户要认识谁?另外,手机开户安全么?
Playwright - scroll bar operation
开发技术-使用easyexcel导入文件(简单示例)
企业保护 API 安全迫在眉睫
Flinksql两个kafka 流可以进行join么?
Lumiprobe蛋白质定量丨QuDye 蛋白定量试剂盒
vncserver: Failed command ‘/etc/X11/Xvnc-session‘: 256!
SqlServer 获取字符串中数字,中文及字符部分数据
Comparison between QT and other GUI Libraries
将博客搬至CSDN
多表操作-外键约束
B_QuRT_User_Guide(31)
What bank card do you need to open an account online? In addition, is it safe to open an account online now?