当前位置:网站首页>Thesis reading_ ICD code_ MSMN
Thesis reading_ ICD code_ MSMN
2022-07-03 04:43:00 【xieyan0811】
Introduce
English title :Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding
Chinese title : Automatically ICD Encoded synonym matching network
Address of thesis :https://export.arxiv.org/pdf/2203.01515.pdf
field : natural language processing 、 Biomedical
Time of publication :2022
author :Zheng Yuan etc. , Tsinghua University , Alibaba
Source :ACL
Code and data : https://github.com/GanjinZero/ICD-MSMN
Reading time :2022.06.14
Journal entry
By substituting external resources UMLS, Papers collected Synonyms for each code , So as to make up for the electronic medical record and ICD The problem of different synonyms in coding description .
Its algorithm is not as sophisticated as some previous models , But after introducing external resources , The effect is indeed improved a lot .
extensive reading
- Aiming at problems :ICD One meaning multi word problem in coding
- The core approach :
- Put forward Multi synonym matching network (MSMN)
- Use LSTM+ Long attention
- Will encode synonyms As query Focus on different phrases in the description , So as to generate and ICD Coding related representations .
- Use Biaffine ICD code Text representation of similarity , For final classification .
- Understanding after extensive reading :
- After half an hour , Half an hour to tidy up ( This is a short passage )
Method
ICD Encoding synonyms
Use UMLS( Integrated medical language system ) Knowledge map , Yes ICD Code description for extension , First , Describe the code l1 And UMLS Concept unique identifier in CUIs alignment ; And then from UMLS The selections in have the same CUIs Synonyms of English terms , And by deleting hyphens and words “NOS” To add additional synonyms . To each of them ICD Code generation {l2,l3…lM} Text , The following is used N Indicates the number of words contained in each description .
code
Use LSTM As an encoder , Use the pre trained word vector to translate words wi mapping xi, Use d Two way of layer LSTM, Embed words as input , Calculate its hidden layer as a representation .
When encoding synonyms , Encode with the same encoder , Then get its representation with maximum pooling :

Multiple synonyms attention
Inspired by the attention of many heads , In this paper, we use Multiple synonyms attention , Cut the hidden layer into M block (M head ):

here , Use the expression of encoding synonyms qj To query Hj, use Hj and qj Linear transformation of Calculate attention score a; The relevant encoding of text and code synonyms is available Ha Get . Aggregate encoding based text representation v, When you only need to work with When a code matches , Use

classifier
The classifier is used to judge the text S Does it include ICD code l, Based on the previously calculated dependency coding The text means vl and Coded representation qj, Use double affine transformation to measure the similarity of classification .

Before, many models only relied on coding , Therefore, it is necessary to include instances of each coding in the training set , And here it is q Is a text representation based on encoding , therefore , What we learn is The relationship between texts , It has nothing to do with the specific code .
Training
Cross entropy is used to calculate the difference between the prediction probability and the actual label :

边栏推荐
- Kubernetes source code analysis (I)
- JS multidimensional array to one-dimensional array
- 2022-02-13 (347. Top k high frequency elements)
- [set theory] relational representation (relational matrix | examples of relational matrix | properties of relational matrix | operations of relational matrix | relational graph | examples of relationa
- [set theory] binary relationship (special relationship type | empty relationship | identity relationship | global relationship | divisive relationship | size relationship)
- How to choose cross-border e-commerce multi merchant system
- Handling record of electric skateboard detained by traffic police
- [set theory] binary relation (example of binary relation operation | example of inverse operation | example of composite operation | example of limiting operation | example of image operation)
- Ffmpeg tanscoding transcoding
- [tools run SQL blind note]
猜你喜欢

7. Integrated learning

MediaTek 2023 IC written examination approved in advance (topic)

2022 P cylinder filling test content and P cylinder filling simulation test questions

After reviewing MySQL for a month, I was stunned when the interviewer of Alibaba asked me

Leetcode simple question: the key with the longest key duration

The programmer went to bed at 12 o'clock in the middle of the night, and the leader angrily scolded: go to bed so early, you are very good at keeping fit

Employee attendance management system based on SSM

2022 Shandong Province safety officer C certificate examination content and Shandong Province safety officer C certificate examination questions and analysis
![[free completion] development of course guidance platform (source code +lunwen)](/img/14/7c1c822bda050a805fa7fc25b802a4.jpg)
[free completion] development of course guidance platform (source code +lunwen)

The simple problem of leetcode: dismantling bombs
随机推荐
雇佣收银员(差分约束)
The programmer went to bed at 12 o'clock in the middle of the night, and the leader angrily scolded: go to bed so early, you are very good at keeping fit
How to use kotlin to improve productivity: kotlin tips
论文阅读_中文NLP_ELECTRA
Number of 1 in binary (simple difficulty)
"Niuke brush Verilog" part II Verilog advanced challenge
2022 registration examination for safety production management personnel of hazardous chemical production units and examination skills for safety production management personnel of hazardous chemical
The least operation of leetcode simple problem makes the array increment
论文阅读_清华ERNIE
文献阅读_基于多模态数据语义融合的旅游在线评论有用性识别研究(中文文献)
First + only! Alibaba cloud's real-time computing version of Flink passed the stability test of big data products of the Institute of ICT
IPhone x forgot the boot password
Ffmpeg mix
Small sample target detection network with attention RPN and multi relationship detector (provide source code, data and download)
Some information about the developer environment in Chengdu
4 years of experience to interview test development, 10 minutes to end, ask too
Shell script -- condition judgment
[set theory] binary relationship (definition field | value field | inverse operation | inverse synthesis operation | restriction | image | single root | single value | nature of synthesis operation)
Market status and development prospect prediction of the global fire extinguisher industry in 2022
[set theory] binary relationship (special relationship type | empty relationship | identity relationship | global relationship | divisive relationship | size relationship)