当前位置:网站首页>Some superficial understanding of word2vec
Some superficial understanding of word2vec
2022-07-07 10:29:00 【strawberry47】
Recently, a friend asked word2vec What's the matter , So I reviewed the relevant knowledge again , Record some of your thoughts , Prevent forgetting ~
word2vec Is the means to obtain word vectors , It's in NNLM Improved on the basis of .
The training model is essentially a neural network with only one hidden layer .
It comes in two forms ① skip-gram: Predict the middle from both sides ② C-BOW: Predict both sides from the middle ;
Be careful , These two forms only represent two different training methods , Finally, the input layer is taken -> The weight of the hidden layer , As word vector .
During training , With CBOW For example , Suppose the corpus is “ It's a fine day today ”; The input to the model is " today God Of God really good " Six word one-hot vector, The output is a bunch of probabilities , We hope “ gas ” The probability of occurrence is the greatest .
When writing code , Usually called gensim library , The word vector can be trained by inputting the corpus .
Some small training trick:Negative Sampling, Huffman tree
Reference resources :[NLP] Second vector Word2vec The essence of , summary word2vec( Blog written by lab senior brother )
边栏推荐
- High number_ Chapter 1 space analytic geometry and vector algebra_ Quantity product of vectors
- Postman interface test VII
- 浅谈日志中的返回格式封装格式处理,异常处理
- Word自动生成目录的方法
- Leetcode exercise - 113 Path sum II
- JMeter installation
- Methods of adding centerlines and centerlines in SolidWorks drawings
- Serial communication relay Modbus communication host computer debugging software tool project development case
- P1223 排队接水/1319:【例6.1】排队接水
- LeetCode 练习——113. 路径总和 II
猜你喜欢
随机推荐
Study summary of postgraduate entrance examination in August
优雅的 Controller 层代码
Encrypt and decrypt stored procedures (SQL 2008/sql 2012)
Prototype and prototype chain
Study summary of postgraduate entrance examination in September
[second on] [jeecgboot] modify paging parameters
【华为机试真题详解】高矮个子排队
XML configuration file parsing and modeling
The Hal library is configured with a general timer Tim to trigger ADC sampling, and then DMA is moved to the memory space.
[email protected]能帮助我们快速拿到日志对象
php \n 换行无法输出
Embedded background - chip
EasyExcel读取写入简单使用
Trajectory planning for multi-robot systems: Methods and applications 综述阅读笔记
搭建物联网硬件通信技术几种方案
Kotlin realizes wechat interface switching (fragment exercise)
IO模型复习
About hzero resource error (groovy.lang.missingpropertyexception: no such property: weight for class)
对存储过程进行加密和解密(SQL 2008/SQL 2012)
SolidWorks工程图中添加中心线和中心符号线的办法