当前位置:网站首页>北大、加州伯克利大學等聯合| Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data(基於未標記數據的結構化知識的領域自適應文本分類)
北大、加州伯克利大學等聯合| Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data(基於未標記數據的結構化知識的領域自適應文本分類)
2022-06-23 21:52:00 【智源社區】
作者:Tian Li,Xiang Chen,Zhen Dong等
簡介:領域自適應文本分類是大規模預訓練的一個具有挑戰性的問題語言模型,因為它們通常需要昂貴的附加標記數據來適應新領域。現有作品通常無法利用跨域單詞之間的隱含關系。在本文中,作者提出了一種新方法,稱為結構化知識域適應 (DASK),通過利用詞級語義關系來增强域適應。DASK 首先構建一個知識圖譜來捕獲目標域中的主幹詞(與領域無關的詞)和非主幹詞之間的關系。然後在訓練期間,DASK 將與樞軸相關的知識圖譜信息注入到源域文本中。對於下遊任務,這些知識注入文本被輸入到能够處理知識注入文本數據的 BERT 變體中。感謝知識注入,作者的模型根據與樞軸的關系為非樞軸學習域不變特征。DASK 在使用偽標簽訓練期間通過候選樞軸的極性分數動態推斷,確保樞軸具有域不變的行為。作者在廣泛的跨域情感分類任務上驗證了 DASK,並觀察到 20 個不同域對的基線絕對性能提昇高達 2.9%。代碼將在 https://github.com/hikaru-nara/DASK 上提供。


論文下載:https://arxiv.org/pdf/2206.09591.pdf
边栏推荐
- Bcdedit, used to adjust the machine startup parameters (safe mode, BootMenu display name, CPU, memory, etc.)
- SAP Migo mobile type 311 attempts to determine the batch, and the system reports an error -batch determination not Po
- Troubleshooting the problem that the channel cannot play after easycvr cascades to the upper platform
- Introduction to scikit learn machine learning practice
- ACL2022 | MVR:面向开放域检索的多视角文档表征
- Outlook开机自启+关闭时最小化
- Dart series: look at me for security. The security feature in dart is null safety
- Error running PyUIC: Cannot start process, the working directory ‘-m PyQt5. uic. pyuic register. ui -o
- How does the hybrid cloud realize the IP sec VPN cloud networking dedicated line to realize the interworking between the active and standby intranet?
- Phpkf CMS 3.00 beta y6 remote code execution
猜你喜欢

How to calculate individual income tax? You know what?

大一女生废话编程爆火!懂不懂编程的看完都拴Q了

Find my information | Apple may launch the second generation airtag. Try the Lenz technology find my solution

Outlook开机自启+关闭时最小化

Freshman girls' nonsense programming is popular! Those who understand programming are tied with Q after reading

Selenium批量查询运动员技术等级

CAD图在线Web测量工具代码实现(测量距离、面积、角度等)

Configuring error sets using MySQL for Ubuntu 20.04.4 LTS

Experiment 5 module, package and Library

嵌入式开发:嵌入式基础——重启和重置的区别
随机推荐
实验五 模块、包和库
Surprise! Edge computing will replace cloud computing??
Improve efficiency, take you to batch generate 100 ID photos with QR code
Lighthouse open source application practice: snipe it
Cloud native practice of meituan cluster scheduling system
After easydss is configured with domain name / public IP, it will always prompt for troubleshooting problems that do not exist in the service
蓝牙芯片|瑞萨和TI推出新蓝牙芯片,试试伦茨科技ST17H65蓝牙BLE5.2芯片
The most common usage scenarios for redis
I'm in Shenzhen. Where can I open an account? Is online account opening safe?
What hard disk does the ECS use? What are the functions of the ECS
Explain the rainbow ingress universal domain name resolution mechanism
[js] 生成随机数组
Bcdedit, used to adjust the machine startup parameters (safe mode, BootMenu display name, CPU, memory, etc.)
Uncover the secrets of Huawei cloud enterprise redis issue 16: acid'true' transactions beyond open source redis
HDLBits->Circuits->Arithmetic Circuitd->3-bit binary adder
How does the hybrid cloud realize the IP sec VPN cloud networking dedicated line to realize the interworking between the active and standby intranet?
How to view the hard disk of ECS? How about the speed and stability of the server
小程序ssl证书过期是什么原因导致的?小程序ssl证书到期了怎么解决?
Redis encapsulation instance
[js] generate random array