当前位置:网站首页>Bilstm and CRF
Bilstm and CRF
2022-06-25 17:38:00 【Green Lantern swordsman】
I recently watched Mr. Huang's video class , All of a sudden, I feel strange to these contents , So I looked for it . Now take a note of
One 、 frame
The choice of frame , I think Lao Huang chose the same picture , Just the author has something to expect , So link directly Add link description Enclosed .
Here's a little , The diagram here is very important . Be careful ,LSTM In the output of , Output per word as label Probability .
Two 、LSTM Parameters of
- Parameter calculation
- LSTM Official documents of
- LSTM The structure diagram of is as follows

lstm The calculation formula of is :
- GRU The structure diagram of is :

In the picture zt and rt They represent update gate and reset gate respectively . The update gate is used to control the extent to which the previous status information is brought into the current status , The larger the value of the update door is, the less the status information of the previous time is brought in . The reset gate controls how much information is written to the current candidate set from the previous state h~t On , The smaller the reset door is , The less information about the previous state is written .
The update door is GRU Main essence of . Formula analysis , It mainly looks at the writing of the renewal door
Be careful :rt and zt from h(t-1) and xt from , In fact, it includes the correlation between them .
3、 ... and 、CRF Detailed introduction
Refer to the big brother's blog Blog , I found the most impressive English explanation Add link description . Thus deepening the understanding of chapter one Understanding in .
Besides , I remember relying on templates ,crf It can be learned by machine itself ( Add the template ,U and B Templates ) To carry out BIO Study . My notes have .
BILSTM What we're doing is Each word for each label (BIO) The launch probability of ,CRF What we're doing is The transition probability between words .
loss function yes : The most path And full path functions .
forecast : viterbi algorithm .
Why does Viterbi algorithm not exist The label bias problem of maximum entropy model ?
answer : because viterbi algorithm Normalization of yes Global normalization of all paths ; Normalization of maximum entropy model It's from previous Local normalization of departure , Local normalization can cause local problems , namely Label offset problem , For details, see https://www.bbsmax.com/A/D854D91p5E/.
边栏推荐
猜你喜欢
随机推荐
College Students' hot summer exchange, Rog star product phantom 16 flipped version / phantom 13 / phantom x appointment
【Matlab】数据统计分析
golang list to string
上线移动ERP系统有哪些步骤?环环紧扣很重要
CVPR small target detection: context and attention mechanisms improve small target detection (attached with paper Download)
Solution to the problem of incorrect clock in FreeRTOS kernel
ES6知识点
golang sort slice int
【Matlab】数值微积分与方程求解
杰理之唤醒口使用注意事项【篇】
杰理之如何给外界输出一个时钟源使用【篇】
DDD concept is complex and difficult to understand. How to design code implementation model in practice?
Mathematical modeling -- integer programming
Sword finger offer II 035 Minimum time difference
学习太极创客 — MQTT(一)MQTT 是什么
Uncover ges super large scale graph computing engine hyg: Graph Segmentation
HMS Core机器学习服务实现同声传译,支持中英文互译和多种音色语音播报
cgi通过odbc连接数据库
Sentinel哨兵机制
RuntimeError: Trying to backward through the graph a second time (or directly access saved variable








