当前位置:网站首页>Some superficial understanding of word2vec
Some superficial understanding of word2vec
2022-07-07 10:29:00 【strawberry47】
Recently, a friend asked word2vec What's the matter , So I reviewed the relevant knowledge again , Record some of your thoughts , Prevent forgetting ~
word2vec Is the means to obtain word vectors , It's in NNLM Improved on the basis of .
The training model is essentially a neural network with only one hidden layer .
It comes in two forms ① skip-gram: Predict the middle from both sides ② C-BOW: Predict both sides from the middle ;
Be careful , These two forms only represent two different training methods , Finally, the input layer is taken -> The weight of the hidden layer , As word vector .
During training , With CBOW For example , Suppose the corpus is “ It's a fine day today ”; The input to the model is " today God Of God really good " Six word one-hot vector, The output is a bunch of probabilities , We hope “ gas ” The probability of occurrence is the greatest .
When writing code , Usually called gensim library , The word vector can be trained by inputting the corpus .
Some small training trick:Negative Sampling, Huffman tree
Reference resources :[NLP] Second vector Word2vec The essence of , summary word2vec( Blog written by lab senior brother )
边栏推荐
猜你喜欢
PDF文档签名指南
Chris LATTNER, the father of llvm: why should we rebuild AI infrastructure software
OpenGL glLightfv 函数的应用以及光源的相关知识
1321:【例6.3】删数问题(Noip1994)
High number_ Chapter 1 space analytic geometry and vector algebra_ Quantity product of vectors
Appx代码签名指南
求方程ax^2+bx+c=0的根(C语言)
JMeter about setting thread group and time
ThreadLocal会用可不够
Experience sharing of software designers preparing for exams
随机推荐
基于HPC场景的集群任务调度系统LSF/SGE/Slurm/PBS
Socket通信原理和实践
2022.7.3DAY595
AHB bus in stm32_ Apb2 bus_ Apb1 bus what are these
P1031 [NOIP2002 提高组] 均分纸牌
每周推荐短视频:L2级有哪些我们日常中经常会用到的功能?
Postman interface test VI
Study summary of postgraduate entrance examination in September
P2788 数学1(math1)- 加减算式
HDU-2196 树形DP学习笔记
5个chrome简单实用的日常开发功能详解,赶快解锁让你提升更多效率!
Study summary of postgraduate entrance examination in August
Study summary of postgraduate entrance examination in July
The mobile terminal automatically adjusts the page content and font size by setting rem
The method of word automatically generating directory
施努卡:机器视觉定位技术 机器视觉定位原理
浅谈日志中的返回格式封装格式处理,异常处理
P1223 排队接水/1319:【例6.1】排队接水
Download Text, pictures and ab packages used by unitywebrequest Foundation
小程序跳转H5,配置业务域名经验教程