当前位置:网站首页>Raki's notes on reading paper: Leveraging type descriptions for zero shot named entity recognition and classification
Raki's notes on reading paper: Leveraging type descriptions for zero shot named entity recognition and classification
2022-06-30 02:39:00 【Sleepy Raki】
Abstract & Introduction & Related Work
Research tasks
Named entity recognition and classificationExisting methods and related work
Facing the challenge
Innovative ideas
- The first method for zero-shot Of NERC Methods
- Introduce a new architecture , Use the text of many entity categories to describe naturally occurring facts
The experimental conclusion
The non entity words observed during the training may belong to one of the test categories , Based on this observation, we propose several different architectures , Based on the cross attention, we can use... Between statement and type description transformer, Combined with the pre training model
We explore modeling negative classes in the following ways
- Use the description of negative classes
- Directly establish the model of negative class
- Model negative classes using the representations generated for the class corresponding to the type

Zero-shot NERC
For each type d, The cross attention coder generates a vector representation v, For each in statement s The words in w
Make a linear transformation ,o Means the word t How likely are they to belong to entity classes c
In order to be able to do more than just classify entities , And identify them , every last token The score of is spliced with the score belonging to the negative category , Corresponds to not belonging to any of the types considered :
Choose the category with the highest score 
Cross-attention Encoder
use bert
Modelling the negative class
Just like No 1 What is discussed in section , Non entity classes create a challenging setup . The non entity words observed during the training may belong to one of the test classes .
Description-based encoding

However , This approach requires the description of something other than . This makes it difficult to make a wise decision on the most appropriate description in practice . in addition , Non physical markers are likely to differ in training and testing , So a fixed description is unlikely to perform well
Independent encoding
Negative classes can be modeled directly , Because it is observed in the training data . therefore , Without considering any description , Each tag is represented as a negative class in the context of the sentence , Instead of exploring cross attention 
Class-aware encoding
Description based coding and independent coding do not model the following facts , That is to say zero-shot Of NERC in , Not every entity marked as non entity during training is non entity during testing . contrary , We propose to model negative classes by combining the representations of other classes generated by the cross attention coder : v t , c 0 , . . . , v t , c k v_{t,c_0}, ..., v_{t,c_k} vt,c0,...,vt,ck. then , Each vector is transformed linearly , Use w n e g − c l w_{neg-cl} wneg−cl, Then concatenate them into a characteristic graph m
Training
To prevent the attention coder from over fitting a small number of category descriptions , We use a regularizer in the form of an entity mask , This regularization avoids vocabulary memory , The model is encouraged to learn the relationship between entity context and category description , While still learning to incorporate aspects of the entity itself ( For example, capital letters 、 shape 、 form ), And relate them to type descriptions
Because negative labels are not balanced , Use 
Evaluation setup



Experiments






Conclusions & Future work
This paper discusses zero point shooting with entity type description NERC The task of , Transfer knowledge from observed classes to unseen classes . We propose a multi class architecture to solve the problem of zero shooting NERC Specific challenges , This architecture uses class aware coding to model negative classes , Thus, the definition of non entity classes is not clear . These models are based on OntoNotes and MedMentions The zero adaptation of the data set was evaluated . Results show , The proposed model is superior to the powerful baseline , And further shows that high-quality entity description ( Note guide ) It is an effective way to transfer knowledge from observed to unseen classes . Future work will aim to incorporate the predicted dependencies between tags
Remark
This article is really about ,NER Just NER, I have to say something NERC, Attention is attention , I have to say cross attention , You seem to be novel Is it? ? I'm speechless
边栏推荐
- CMake教程系列-04-编译相关函数
- Unity3d ugui force refresh of layout components
- Unity TimeLine 数据绑定
- [Postgres] Postgres database migration
- Sitelock nine FAQs
- 三层交换机和二层交换机区别是什么
- SQL injection -day17
- How difficult is the PMP Exam under the new syllabus? Comprehensive analysis
- What about punctuation in the first column of unity text
- NPDP产品经理国际认证考试报名有什么要求?
猜你喜欢

AutoJS代码能加密吗?YES,AutoJS加密技巧展示

Differences among digicert, SECTIONO and globalsign code signing certificates

选购通配符SSL证书注意事项

Jupyter notebook显示k线图集合

Le Code autojs peut - il être chiffré? Oui, présentation des techniques de chiffrement autojs
![[dry goods sharing] the latest WHQL logo certification application process](/img/c3/37277572c70b0af944e594f0965a6c.png)
[dry goods sharing] the latest WHQL logo certification application process

归并排序

Detailed explanation of minimum stack

如何在 JupyterLab 中把 ipykernel 切换到不同的 conda 虚拟环境?

What is certificate transparency CT? How to query CT logs certificate logs?
随机推荐
JMeter obtains cookies across thread groups or JMeter thread groups share cookies
What files does a CA digital certificate contain? How to view SSL certificate information?
归并排序
Detailed explanation of minimum stack
[dry goods sharing] the latest WHQL logo certification application process
PR second training notes
CMake教程系列-02-使用cmake代码生成二进制
RAII内存管理
JS advanced -es6 syntax
Weekly recommended short video: why is the theory correct but can not get the expected results?
Enlightenment from the revocation of Russian digital certificate by mainstream CA: upgrade the SSL certificate of state secret algorithm to help China's network security to be autonomous and controlla
直接插入排序
Global and Chinese markets of liquid optical waveguides 2022-2028: Research Report on technology, participants, trends, market size and share
Unity timeline data binding
走进江苏作家诗人胭脂茉莉|世界读书日
【npm】解决使用npm安装TypeORM的报错问题
Wechat applet page Jump and parameter transfer
Unity TimeLine 数据绑定
threejs 镜子案例Reflector 创建镜子+房子搭建+小球移动
Precautions for purchasing wildcard SSL certificate