当前位置：网站首页>Some superficial understanding of word2vec

Some superficial understanding of word2vec

2022-07-07 10:29:00 【strawberry47】

Recently, a friend asked word2vec What's the matter , So I reviewed the relevant knowledge again , Record some of your thoughts , Prevent forgetting ~

word2vec Is the means to obtain word vectors , It's in NNLM Improved on the basis of .
The training model is essentially a neural network with only one hidden layer .
Insert picture description here

It comes in two forms ① skip-gram： Predict the middle from both sides ② C-BOW： Predict both sides from the middle ;
Be careful , These two forms only represent two different training methods , Finally, the input layer is taken -> The weight of the hidden layer , As word vector .

During training , With CBOW For example , Suppose the corpus is “ It's a fine day today ”; The input to the model is " today God Of God really good " Six word one-hot vector, The output is a bunch of probabilities , We hope “ gas ” The probability of occurrence is the greatest .

When writing code , Usually called gensim library , The word vector can be trained by inputting the corpus .

Some small training trick：Negative Sampling, Huffman tree
Reference resources ：[NLP] Second vector Word2vec The essence of , summary word2vec（ Blog written by lab senior brother ）

原网站

版权声明
本文为[strawberry47]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/188/202207070814305572.html

当前位置：网站首页>Some superficial understanding of word2vec

Some superficial understanding of word2vec

边栏推荐

猜你喜欢

随机推荐