当前位置:网站首页>Some superficial understanding of word2vec
Some superficial understanding of word2vec
2022-07-07 10:29:00 【strawberry47】
Recently, a friend asked word2vec What's the matter , So I reviewed the relevant knowledge again , Record some of your thoughts , Prevent forgetting ~
word2vec Is the means to obtain word vectors , It's in NNLM Improved on the basis of .
The training model is essentially a neural network with only one hidden layer .
It comes in two forms ① skip-gram: Predict the middle from both sides ② C-BOW: Predict both sides from the middle ;
Be careful , These two forms only represent two different training methods , Finally, the input layer is taken -> The weight of the hidden layer , As word vector .
During training , With CBOW For example , Suppose the corpus is “ It's a fine day today ”; The input to the model is " today God Of God really good " Six word one-hot vector, The output is a bunch of probabilities , We hope “ gas ” The probability of occurrence is the greatest .
When writing code , Usually called gensim library , The word vector can be trained by inputting the corpus .
Some small training trick:Negative Sampling, Huffman tree
Reference resources :[NLP] Second vector Word2vec The essence of , summary word2vec( Blog written by lab senior brother )
边栏推荐
- 嵌入式工程师如何提高工作效率
- Jump to the mobile terminal page or PC terminal page according to the device information
- Multisim--软件相关使用技巧
- C logging method
- Sword finger offer 38 Arrangement of strings [no description written]
- [higherhrnet] higherhrnet detailed heat map regression code of higherhrnet
- Methods of adding centerlines and centerlines in SolidWorks drawings
- SolidWorks工程图中添加中心线和中心符号线的办法
- Encrypt and decrypt stored procedures (SQL 2008/sql 2012)
- Study summary of postgraduate entrance examination in September
猜你喜欢

Weekly recommended short videos: what are the functions of L2 that we often use in daily life?

基于gis三维可视化技术的智慧城市建设

IO模型复习

浅谈日志中的返回格式封装格式处理,异常处理

01 use function to approximate cosine function (15 points)

Postman interface test VI

ThreadLocal会用可不够

Talking about the return format in the log, encapsulation format handling, exception handling

Postman interface test II

Appx code signing Guide
随机推荐
STM32 product introduction
. Net configuration system
Inno setup packaging and signing Guide
学习记录——高精度加法和乘法
Postman interface test IV
对存储过程进行加密和解密(SQL 2008/SQL 2012)
The story of Plato and his three disciples: how to find happiness? How to find the ideal partner?
0x0fa23729 (vcruntime140d.dll) (in classes and objects - encapsulation.Exe) exception thrown (resolved)
JMeter loop controller and CSV data file settings are used together
@Configuration, use, principle and precautions of transmission:
对word2vec的一些浅层理解
求最大公约数与最小公倍数(C语言)
Talking about the return format in the log, encapsulation format handling, exception handling
This article explains the complex relationship between MCU, arm, muc, DSP, FPGA and embedded system
关于hzero-resource报错(groovy.lang.MissingPropertyException: No such property: weight for class)
Experience sharing of software designers preparing for exams
Methods of adding centerlines and centerlines in SolidWorks drawings
01 use function to approximate cosine function (15 points)
Appx代碼簽名指南
每周推荐短视频:L2级有哪些我们日常中经常会用到的功能?