当前位置:网站首页>Naacl-22 | introduce the setting of migration learning on the prompt based text generation task
Naacl-22 | introduce the setting of migration learning on the prompt based text generation task
2022-07-04 22:32:00 【Zhiyuan community】
The pre training language model has made significant progress in text generation tasks through fine-tuning , But in the scenario of sparse data , It is usually impossible to fine tune directly . therefore , In this paper, based on prompt Setting of transfer learning . The author first learns one for different tasks in the source domain prompt, Thus construct prompt pool , Then migrate in the target task . In order to consider both task level and instance level information , The author designed an adaptive attention mechanism , For each instance sample in the target task , The model will select the most relevant source task for it prompt. The author has carried out experiments on various generation tasks and data sets , The results show that the migration method proposed by the author can improve the generation effect on the target task very well .

Paper title :
Learning to Transfer Prompts for Text Generation
Thesis link :
https://arxiv.org/abs/2205.01543
边栏推荐
- A large number of virtual anchors in station B were collectively forced to refund: revenue evaporated, but they still owe station B; Jobs was posthumously awarded the U.S. presidential medal of freedo
- Sqlserver encrypts and decrypts data
- Deployment of JVM sandbox repeater
- Force buckle_ Palindrome number
- PHP short video source code, thumb animation will float when you like it
- Solana链上应用Crema因黑客攻击停运
- PostgreSQL server programming aggregation and grouping
- Logo special training camp section III initial creative techniques
- Common shortcut keys for hbuilder x
- Short video system source code, click the blank space of the screen, the keyboard does not automatically stow
猜你喜欢

Logo special training camp Section IV importance of font design

LOGO特训营 第五节 字体结构与设计常用技法

广电五舟与华为签署合作协议,共同推进昇腾AI产业持续发展

Huawei Nova 10 series released Huawei application market to build a solid application security firewall

UML diagram memory skills

国产数据库乱象

LOGO特训营 第二节 文字与图形的搭配关系

UML图记忆技巧

B站大量虚拟主播被集体强制退款:收入蒸发,还倒欠B站;乔布斯被追授美国总统自由勋章;Grafana 9 发布|极客头条

The use of complex numbers in number theory and geometry - Cao Zexian
随机推荐
LOGO特训营 第二节 文字与图形的搭配关系
Test will: bug classification and promotion solution
More than 30 institutions jointly launched the digital collection industry initiative. How will it move forward in the future?
Apachecn translation, proofreading, note sorting activity progress announcement 2022.7
虚拟人产业面临的挑战
Mysql root 账号如何重置密码
Zhiyang innovation signed a cooperation agreement with Huawei to jointly promote the sustainable development of shengteng AI industry
Redis sentinel simply looks at the trade-offs between distributed high availability and consistency
How to transfer to software testing, one of the high paying jobs in the Internet? (software testing learning roadmap attached)
leetcode 72. Edit distance edit distance (medium)
sqlserver对数据进行加密、解密
Logo special training camp section III initial creative techniques
Solana chain application crema was shut down due to hacker attacks
Locust performance test - environment construction and use
现在mysql cdc2.1版本在解析值为0000-00-00 00:00:00的datetime类
达梦数据凭什么被称为国产数据库“第一股”?
Interview question 01.08 Zero matrix
Logo Camp d'entraînement section 3 techniques créatives initiales
不同环境相同配置项的内容如何diff差异?
Force buckle 3_ 383. Ransom letter