当前位置:网站首页>Paper notes ACL 2022 unified structure generation for universal information extraction
Paper notes ACL 2022 unified structure generation for universal information extraction
2022-06-21 17:58:00 【hlee-top】
List of articles
1 brief introduction
Thesis title :Unified Structure Generation for Universal Information Extraction
Source of the paper :ACL 2022
Organization : Software Institute Baidu
Thesis link :https://arxiv.org/pdf/2203.12277.pdf
Code link :https://github.com/universal-ie/UIE
1.1 motivation
- The task specific information extraction methods hinder the structural development of information extraction systems 、 Knowledge sharing and cross domain migration .
1.2 innovation
- A unified text-to-structure Generate schema , Different information can be extracted (IE) Task modeling , Generate the target structure adaptively , And learn general information extraction ability from different knowledge resources . Is the first text-to-structure Pre training extraction model .
- A unified structure generation network is designed , Extracting languages from structures (structural extraction language) The heterogeneous information extraction structure is encoded into a unified representation , And through the structural model (structural schema instructor) Guiding mechanism control UIE Model recognition 、 Relate and generate .

2 Method
The overall framework of the model is shown in the figure below , It mainly includes structural schema instructor and structural extraction language Two parts , Given a specific predefined schema s And the text t, The model needs to generate a structure , The structure contains schema s Indicated text t Structure information required in .
2.1 Structured Extraction Language
structured exextraction language (SEL) Will be heterogeneous IE The structure is encoded as a unified representation , There are three semantic structures , An example is shown below :
- SPOTNAME: Indicates that the... Exists in the text Spot Name Information fragment of type ;
- ASSONAME: It indicates that there is an upper layer in the text and structure Spot Yes Asso Name Pieces of information about the relationship ;
- INFOSPAN: Express Spot Name perhaps Asso Name In the text span;

2.2 Structural Schema Instructor
Structural Schema Instructor(SSI) Describe the extraction objectives of the task , Construct a schema-based prompt. Contains three types of token:
- SPOTNAME: Target spot name.
- ASSONAME: Target association name.
- Special Symbols([spot], [asso],[text]): Add to each spot name、association name And before the text .

2.3 Structure Generation with UIE
text-to-SEL The generated process uses encoding - Decoding structure , The structure is Transformer, The encoding and decoding formulas are as follows :
![]() | ![]() |
3 Pre-training and Fine-tuning for UIE
3.1 Pre-training
UIE Need encoded text 、 Map text to structure 、 Decoding structure , The data set of pre training includes three types
- D p a i r D_{pair} Dpair: Text - A parallel corpus of structures , Each data includes token Sequence x And structural records y, Pre trained text to structure mapping ability (UIE), Some negative samples were randomly sampled during pre training (spots、association),loss The formula is as follows :

- D r e c o r d D_{record} Drecord: Structural corpus , Pre training the ability to generate structures ( decoder ),loss The formula is as follows :

- D t e x t D_{text} Dtext: Unstructured text corpus , Use masked language model Way to pre train semantic representation , loss The formula is as follows :

total loss The formula is as follows , At every batch Randomly select data for different tasks in .
3.2 On-Demand Fine-tuning
UIE Fine tune for different dirty tasks , D t a s k = ( s , x , y ) D_{task}={(s,x,y)} Dtask=(s,x,y),loss by teacher-forcing Cross entropy , To mitigate exposure bias , Set up Rejection Mechanism, Insert some randomly [NULL] Node as a negative example SPOTNAME and ASSONAME, Here's the picture 
4 experiment
The supervised experimental results are shown in the figure below :
The experimental results under low resources are shown in the figure below :
Ablation Experiment :

边栏推荐
- Are the two flame retardant standards of European furniture en 597-1 and en 597-2 the same?
- wcdma与LTE的区别
- [technical management] assembly number and sword team
- LeetCode_ String_ Simple_ 387. first unique character in string
- Lua导出为外部链接库并使用
- Bm19 looking for peak
- Stack cognition -- basic use of reverse IDA tools
- One trick: let logs help you make decisions through Yanrong SaaS data service platform +elk
- Bm95 points candy problem
- Application architecture principles
猜你喜欢

众安保险联合阿里健康、慧医天下 探索互联网慢病管理新模式

Lagrange interpolation

Addition of 3DE grid coordinate points and objects

Deeply understand the attention mechanism of map

Encryption crash, is there a future for Web3 games? In depth discussion of 5 papers
![[real topic of the Blue Bridge Cup provincial tournament 35] scratch water reflection children's programming scratch programming explanation of the real topic of the Blue Bridge Cup provincial tournam](/img/02/3a05b21a49036e3fba95fd41c4a048.png)
[real topic of the Blue Bridge Cup provincial tournament 35] scratch water reflection children's programming scratch programming explanation of the real topic of the Blue Bridge Cup provincial tournam

POSIX信号量

Simulation Implementation of list

堆栈认知——栈溢出实例(ret2shellcode)

基于AM4377的EtherCAT主站控制stm32从站
随机推荐
PingCAP 入选 2022 Gartner 云数据库“客户之声”,获评“卓越表现者”最高分
One trick: let logs help you make decisions through Yanrong SaaS data service platform +elk
POSIX信号量
SCAU Software Engineering Fundamentals
How to perform en45545 fire test for battery shell
3DE 三維模型視圖看不到怎麼調整
Bm22 compare version number
RT thread persimmon pie M7 Quanzhi f133 DDR running xboot
主动学习(Active Learning) 概述、策略和不确定性度量
POSIX共享内存
天天在都在谈的S3协议到底是什么?一文带你了解S3背后的故事
焱融科技 YRCloudFile 与安腾普完成兼容认证,共创存储新蓝图
Postman association to complete interface automation test
Bm19 looking for peak
《MATLAB 神经网络43个案例分析》:第27章 LVQ神经网络的预测——人脸朝向识别
服务端socket程序
From demand to open source, how to look at it with new eyes?
Stack cognition - Introduction to heap
Lua导出为外部链接库并使用
Simulation of vector

