当前位置:网站首页>Some superficial understanding of word2vec
Some superficial understanding of word2vec
2022-07-07 10:29:00 【strawberry47】
Recently, a friend asked word2vec What's the matter , So I reviewed the relevant knowledge again , Record some of your thoughts , Prevent forgetting ~
word2vec Is the means to obtain word vectors , It's in NNLM Improved on the basis of .
The training model is essentially a neural network with only one hidden layer .
It comes in two forms ① skip-gram: Predict the middle from both sides ② C-BOW: Predict both sides from the middle ;
Be careful , These two forms only represent two different training methods , Finally, the input layer is taken -> The weight of the hidden layer , As word vector .
During training , With CBOW For example , Suppose the corpus is “ It's a fine day today ”; The input to the model is " today God Of God really good " Six word one-hot vector, The output is a bunch of probabilities , We hope “ gas ” The probability of occurrence is the greatest .
When writing code , Usually called gensim library , The word vector can be trained by inputting the corpus .
Some small training trick:Negative Sampling, Huffman tree
Reference resources :[NLP] Second vector Word2vec The essence of , summary word2vec( Blog written by lab senior brother )
边栏推荐
猜你喜欢
小程序跳转H5,配置业务域名经验教程
LLVM之父Chris Lattner:为什么我们要重建AI基础设施软件
成为优秀的TS体操高手 之 TS 类型体操前置知识储备
Experience sharing of software designers preparing for exams
字符串格式化
Remote meter reading, switching on and off operation command
【acwing】786. Number k
MySQL insert data create trigger fill UUID field value
Programming features of ISP, IAP, ICP, JTAG and SWD
Pdf document signature Guide
随机推荐
String formatting
Pdf document signature Guide
反射效率为什么低?
5个chrome简单实用的日常开发功能详解,赶快解锁让你提升更多效率!
小程序跳转H5,配置业务域名经验教程
Study summary of postgraduate entrance examination in July
Encrypt and decrypt stored procedures (SQL 2008/sql 2012)
Sword finger offer 38 Arrangement of strings [no description written]
.NET配置系统
Word自动生成目录的方法
When there are pointer variable members in the custom type, the return value and parameters of the assignment operator overload must be reference types
施努卡:机器人视觉抓取工作原理 机器视觉抓取
LeetCode 练习——113. 路径总和 II
对存储过程进行加密和解密(SQL 2008/SQL 2012)
Appx code signing Guide
leetcode-303:区域和检索 - 数组不可变
Appx代码签名指南
Postman interface test IV
Chris LATTNER, the father of llvm: why should we rebuild AI infrastructure software
关于hzero-resource报错(groovy.lang.MissingPropertyException: No such property: weight for class)