当前位置:网站首页>Naacl 2022 | TAMT: search the transportable Bert subnet through downstream task independent mask training
Naacl 2022 | TAMT: search the transportable Bert subnet through downstream task independent mask training
2022-06-27 13:50:00 【Zhiyuan community】
Paper title :Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training

chart 2 TAMT On pre training tasks (MLM Or knowledge distillation ) Learning sub network structure , Then migrate it to different downstream tasks for fine-tuning
Based on the above motives , We propose downstream task independent mask training (Task-Agnostic Mask Training,TAMT) Method . Pictured 2 Shown ,TAMT Optimize pre training tasks BERT The structure of the subnetwork ( Do not change the pre training parameter value ), So the sub network has better performance in the pre training task . Subsequently, the searched sub network will be migrated to a variety of downstream tasks for fine-tuning training .
边栏推荐
猜你喜欢

Number of printouts (solved by recursive method)

Openhgnn releases version 0.3

Quick news: Huawei launched the Hongmeng developer competition; Tencent conference released the "Wanshi Ruyi" plan

Step by step expansion of variable parameters in class templates
![[weekly replay] the 81st biweekly match of leetcode](/img/66/03ee4dbb88b0be7486b71cd4059f44.png)
[weekly replay] the 81st biweekly match of leetcode

CCID Consulting released the database Market Research Report on key application fields during the "14th five year plan" (attached with download)

Does Xinhua San still have to rely on ICT to realize its 100 billion enterprise dream?

《预训练周刊》第51期:重构预训练、零样本自动微调、一键调用OPT

Deep understanding of bit operations

What is the difference between the FAT32 and NTFS formats on the USB flash disk
随机推荐
[business security-04] universal user name and universal password experiment
Array related knowledge
Bidding announcement: Oracle all-in-one machine software and hardware maintenance project of Shanghai R & D Public Service Platform Management Center
Quickly set up a website to visit foreign countries, set up SS and start BBR to quickly surf the Internet
Does Xinhua San still have to rely on ICT to realize its 100 billion enterprise dream?
简析国内外电商的区别
NLP - monocleaner
芯片供给过剩之际,进口最多的中国继续减少进口,美国芯片慌了
mysql 锁机制与四种隔离级别
OpenHGNN发布0.3版本
高效率取幂运算
实现WordPress上传图片自动重命名的方法
[a complete human-computer interface program framework]
CMOS级电路分析
面试官:Redis的共享对象池了解吗?
每日3题(2):检查二进制字符串字段
Half find (half find)
赛迪顾问发布《“十四五” 关键应用领域之数据库市场研究报告》(附下载)
美国芯片再遭重击,继Intel后又一家芯片企业将被中国芯片超越
crane:字典项与关联数据处理的新思路