当前位置:网站首页>Thesis reading_ ICD code_ MSMN
Thesis reading_ ICD code_ MSMN
2022-07-03 04:43:00 【xieyan0811】
Introduce
English title :Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding
Chinese title : Automatically ICD Encoded synonym matching network
Address of thesis :https://export.arxiv.org/pdf/2203.01515.pdf
field : natural language processing 、 Biomedical
Time of publication :2022
author :Zheng Yuan etc. , Tsinghua University , Alibaba
Source :ACL
Code and data : https://github.com/GanjinZero/ICD-MSMN
Reading time :2022.06.14
Journal entry
By substituting external resources UMLS, Papers collected Synonyms for each code , So as to make up for the electronic medical record and ICD The problem of different synonyms in coding description .
Its algorithm is not as sophisticated as some previous models , But after introducing external resources , The effect is indeed improved a lot .
extensive reading
- Aiming at problems :ICD One meaning multi word problem in coding
- The core approach :
- Put forward Multi synonym matching network (MSMN)
- Use LSTM+ Long attention
- Will encode synonyms As query Focus on different phrases in the description , So as to generate and ICD Coding related representations .
- Use Biaffine ICD code Text representation of similarity , For final classification .
- Understanding after extensive reading :
- After half an hour , Half an hour to tidy up ( This is a short passage )
Method
ICD Encoding synonyms
Use UMLS( Integrated medical language system ) Knowledge map , Yes ICD Code description for extension , First , Describe the code l1 And UMLS Concept unique identifier in CUIs alignment ; And then from UMLS The selections in have the same CUIs Synonyms of English terms , And by deleting hyphens and words “NOS” To add additional synonyms . To each of them ICD Code generation {l2,l3…lM} Text , The following is used N Indicates the number of words contained in each description .
code
Use LSTM As an encoder , Use the pre trained word vector to translate words wi mapping xi, Use d Two way of layer LSTM, Embed words as input , Calculate its hidden layer as a representation .
When encoding synonyms , Encode with the same encoder , Then get its representation with maximum pooling :

Multiple synonyms attention
Inspired by the attention of many heads , In this paper, we use Multiple synonyms attention , Cut the hidden layer into M block (M head ):

here , Use the expression of encoding synonyms qj To query Hj, use Hj and qj Linear transformation of Calculate attention score a; The relevant encoding of text and code synonyms is available Ha Get . Aggregate encoding based text representation v, When you only need to work with When a code matches , Use

classifier
The classifier is used to judge the text S Does it include ICD code l, Based on the previously calculated dependency coding The text means vl and Coded representation qj, Use double affine transformation to measure the similarity of classification .

Before, many models only relied on coding , Therefore, it is necessary to include instances of each coding in the training set , And here it is q Is a text representation based on encoding , therefore , What we learn is The relationship between texts , It has nothing to do with the specific code .
Training
Cross entropy is used to calculate the difference between the prediction probability and the actual label :

边栏推荐
- [set theory] relational representation (relational matrix | examples of relational matrix | properties of relational matrix | operations of relational matrix | relational graph | examples of relationa
- Arthas watch grabs a field / attribute of the input parameter
- [free completion] development of course guidance platform (source code +lunwen)
- 2022 Shandong Province safety officer C certificate examination content and Shandong Province safety officer C certificate examination questions and analysis
- Ffmpeg mix
- [set theory] binary relation (example of binary relation on a | binary relation on a)
- Leetcode simple question: check whether two string arrays are equal
- Preparation for school and professional cognition
- Dive into deep learning - 2.1 data operation & Exercise
- Bugku CTF daily question baby_ flag. txt
猜你喜欢

Career planning of counter attacking College Students

2022 P cylinder filling test content and P cylinder filling simulation test questions

论文阅读_中文医疗模型_ eHealth

Joint set search: merge intervals and ask whether two numbers are in the same set

MPM model and ab pressure test

Jincang KFS data bidirectional synchronization scenario deployment

2022 chemical automation control instrument examination summary and chemical automation control instrument certificate examination

Basic use of continuous integration server Jenkins

How to retrieve the password for opening word files

I've been in software testing for 8 years and worked as a test leader for 3 years. I can also be a programmer if I'm not a professional
随机推荐
Factor stock selection scoring model
RSRS index timing and large and small disc rotation
How to choose cross-border e-commerce multi merchant system
Basic use of continuous integration server Jenkins
[set theory] binary relationship (special relationship type | empty relationship | identity relationship | global relationship | divisive relationship | size relationship)
Leetcode simple question: the key with the longest key duration
Arthas watch grabs a field / attribute of the input parameter
2022 P cylinder filling test content and P cylinder filling simulation test questions
Why should programmers learn microservice architecture if they want to enter a large factory?
Sdl2 + OpenGL glsl practice (Continued)
Small program animation realizes the running lantern and animation object
String matching: find a substring in a string
GFS distributed file system (it's nice to meet it alone)
Know that Chuangyu cloud monitoring - scanv Max update: Ecology OA unauthorized server request forgery and other two vulnerabilities can be detected
X-ray normal based contour rendering
C Primer Plus Chapter 10, question 14 3 × 5 array
Library management system based on SSM
带有注意力RPN和多关系检测器的小样本目标检测网络(提供源码和数据及下载)...
[set theory] binary relationship (definition field | value field | inverse operation | inverse synthesis operation | restriction | image | single root | single value | nature of synthesis operation)
Two drawing interfaces - 1 Matlab style interface