当前位置:网站首页>Naacl 2022 | TAMT: search the transportable Bert subnet through downstream task independent mask training
Naacl 2022 | TAMT: search the transportable Bert subnet through downstream task independent mask training
2022-06-27 13:50:00 【Zhiyuan community】
Paper title :Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training

chart 2 TAMT On pre training tasks (MLM Or knowledge distillation ) Learning sub network structure , Then migrate it to different downstream tasks for fine-tuning
Based on the above motives , We propose downstream task independent mask training (Task-Agnostic Mask Training,TAMT) Method . Pictured 2 Shown ,TAMT Optimize pre training tasks BERT The structure of the subnetwork ( Do not change the pre training parameter value ), So the sub network has better performance in the pre training task . Subsequently, the searched sub network will be migrated to a variety of downstream tasks for fine-tuning training .
边栏推荐
- Step by step expansion of variable parameters in class templates
- Debug tool
- Gaode map IP positioning 2.0 backup
- [a complete human-computer interface program framework]
- Awk concise tutorial
- Pytorch learning 3 (test training model)
- Why must Oracle cloud customers self test after the release of Oracle cloud quarterly update?
- 打印输出数(递归方法解决)
- 【业务安全-01】业务安全概述及测试流程
- 快讯:华为启动鸿蒙开发者大赛;腾讯会议发布“万室如意”计划
猜你喜欢
随机推荐
Daily 3 questions (1): find the nearest point with the same X or Y coordinate
Axi bus
enable_ if
mysql 锁机制与四种隔离级别
JVM parameter setting and analysis
微服务如何拆分
Type 'image' is not a subtype of type 'imageprovider < object > solution
国产数据库乱象
Shell concise tutorial
With the advent of the era of Internet of everything, Ruijie released a scenario based wireless zero roaming scheme
类模板中可变参的逐步展开
crane:字典项与关联数据处理的新思路
Journal quotidien des questions (6)
ENSP cloud configuration
PLM还能怎么用?
Pytorch learning 1 (learning documents on the official website)
POSIX AIO -- Introduction to glibc version asynchronous IO
【业务安全-04】万能用户名及万能密码实验
The second part of the travel notes of C (Part II) structural thinking: Zen is stable; all four advocate structure
How to set postman to Chinese? (Chinese)




![[WUSTCTF2020]girlfriend](/img/a8/33fe5feb7bcbb73ba26a94d226cc4d.png)




