当前位置:网站首页>Naacl-22 | introduce the setting of migration learning on the prompt based text generation task
Naacl-22 | introduce the setting of migration learning on the prompt based text generation task
2022-07-04 22:32:00 【Zhiyuan community】
The pre training language model has made significant progress in text generation tasks through fine-tuning , But in the scenario of sparse data , It is usually impossible to fine tune directly . therefore , In this paper, based on prompt Setting of transfer learning . The author first learns one for different tasks in the source domain prompt, Thus construct prompt pool , Then migrate in the target task . In order to consider both task level and instance level information , The author designed an adaptive attention mechanism , For each instance sample in the target task , The model will select the most relevant source task for it prompt. The author has carried out experiments on various generation tasks and data sets , The results show that the migration method proposed by the author can improve the generation effect on the target task very well .
Paper title :
Learning to Transfer Prompts for Text Generation
Thesis link :
https://arxiv.org/abs/2205.01543
边栏推荐
- 虚拟人产业面临的挑战
- 傳智教育|如何轉行互聯網高薪崗比特之一的軟件測試?(附軟件測試學習路線圖)
- Radio and television Wuzhou signed a cooperation agreement with Huawei to jointly promote the sustainable development of shengteng AI industry
- 抖音实战~评论数量同步更新
- Machine learning notes mutual information
- Easy to use app recommendation: scan QR code, scan barcode and view history
- Scala download and configuration
- 使用 BlocConsumer 同时构建响应式组件和监听状态
- Logo Camp d'entraînement section 3 techniques créatives initiales
- Flask 上下文详解
猜你喜欢
Introduction and application of bigfilter global transaction anti duplication component
Logo Camp d'entraînement section 3 techniques créatives initiales
UML diagram memory skills
With this PDF, we finally got offers from eight major manufacturers, including Alibaba, bytek and Baidu
并发网络模块化 读书笔记转
Close system call analysis - Performance Optimization
Ascendex launched Walken (WLKN) - an excellent and leading "walk to earn" game
The use of complex numbers in number theory and geometry - Cao Zexian
TLA+ 入门教程(1):形式化方法简介
赋能数字经济 福昕软件出席金砖国家可持续发展高层论坛
随机推荐
醒悟的日子,我是怎么一步一步走向软件测试的道路
leetcode 72. Edit Distance 编辑距离(中等)
You don't have to run away to delete the library! Detailed MySQL data recovery
Energy momentum: how to achieve carbon neutralization in the power industry?
都说软件测试很简单有手就行,但为何仍有这么多劝退的?
High school physics: linear motion
傳智教育|如何轉行互聯網高薪崗比特之一的軟件測試?(附軟件測試學習路線圖)
How can the advertising system of large factories be upgraded without the presence of large models
凭借了这份 pdf,最终拿到了阿里,字节,百度等八家大厂 offer
Sqlserver encrypts and decrypts data
国产数据库乱象
传智教育|如何转行互联网高薪岗位之一的软件测试?(附软件测试学习路线图)
DevEco Device Tool 3.0 Release带来5大能力升级,让智能设备开发更高效
Now MySQL cdc2.1 is parsing the datetime class with a value of 0000-00-00 00:00:00
短视频系统源码,点击屏幕空白处键盘不自动收起
HUAWEI nova 10系列发布 华为应用市场筑牢应用安全防火墙
Embedded development: skills and tricks -- seven skills to improve the quality of embedded software code
How to transfer to software testing, one of the high paying jobs in the Internet? (software testing learning roadmap attached)
[cooking record] - stir fried 1000 pieces of green pepper
可视化任务编排&拖拉拽 | Scaleph 基于 Apache SeaTunnel的数据集成