当前位置:网站首页>Nce detail of softmax approximation
Nce detail of softmax approximation
2022-07-03 03:15:00 【ChaoFeiLi】
When contacting comparative learning , I saw it NCE loss, Afraid that this blog will disappear , So I specially came to record .
Easy to understand and explain nce loss? - You know
Problems related to classification in deep learning will involve softmax The calculation of . When there are fewer target categories , Use standard softmax It's OK to calculate by formula , When there are many target categories , It is necessary to adopt the method of approximate estimation to simplify softmax Calculation of normalization in .
Take the language model in natural language as an example , From theory to practice, the detailed explanation is based on sampling softmax The approximate method of NCE.
Theoretical review
Logical regression and softmax Regression is two basic classification models , They all belong to linear models . The former mainly deals with the problem of classification , The latter mainly deals with multi classification problems . in fact softmax Regression is the general form of logical regression .
Logistic Regression
Logistic regression model ( function / hypothesis ) by :
Softmax Regression
边栏推荐
- QT based tensorrt accelerated yolov5
- Vs 2019 configuration du moteur de génération de tensorrt
- [combinatorics] Application of exponential generating function (multiple set arrangement problem | different balls in different boxes | derivation of exponential generating function of odd / even sequ
- Notifydatasetchanged not applicable to recyclerview - notifydatasetchanged not working on recyclerview
- Chart. JS multitooltip tag - chart js multiTooltip labels
- Bigvision code
- el-tree搜索方法使用
- Idea set method call ignore case
- Pat class B "1104 forever" DFS optimization idea
- JS finds all the parent nodes or child nodes under a node according to the tree structure
猜你喜欢
别再用 System.currentTimeMillis() 统计耗时了,太 Low,StopWatch 好用到爆!
The idea setting code is in UTF-8 idea Properties configuration file Chinese garbled
TCP handshake three times and wave four times. Why does TCP need handshake three times and wave four times? TCP connection establishes a failure processing mechanism
Do you really understand relays?
Vs 2019 installation and configuration opencv
idea 加载不了应用市场解决办法(亲测)
VS 2019 配置tensorRT生成engine
The series of hyperbolic function in daily problem
Anhui University | small target tracking: large-scale data sets and baselines
Pytorch配置
随机推荐
Do you really understand relays?
The process of connecting MySQL with docker
MySQL Real combat 45 [SQL query and Update Execution Process]
销毁Session和清空指定的属性
函数栈帧的创建与销毁
Vs 2019 configure tensorrt to generate engine
Basic information of Promethus (I)
float与0比较
基于QT的tensorRT加速的yolov5
vfork执行时出现Segmentation fault
[error record] the parameter 'can't have a value of' null 'because of its type, but the im
TCP 三次握手和四次挥手机制,TCP为什么要三次握手和四次挥手,TCP 连接建立失败处理机制
VS克隆时显示403错误
L'index des paramètres d'erreur est sorti de la plage pour les requêtes floues (1 > Nombre de paramètres, qui est 0)
MySQL practice 45 [global lock and table lock]
Force freeing memory in PHP
Distributed transaction
Are there any recommended term life insurance products? I want to buy a term life insurance.
I2C 子系统(三):I2C Driver
Find the storage address of the elements in the two-dimensional array