当前位置:网站首页>Maximum likelihood estimation, divergence, cross entropy
Maximum likelihood estimation, divergence, cross entropy
2022-07-03 05:48:00 【code bean】
If this message , You can put things that were very uncertain before , To determine the , It shows that this information is very informative !

This picture is to explain , How mathematicians define the process of information quantity :

in other words , The amount of information about Argentina winning the championship = The amount of information about Argentina reaching the finals + The amount of information that Argentina won the finals , however x Itself is probability , So here 1/8 yes x1*x2 Result . And :
![]()
This characteristic is log Peculiar . Then the base number is greater than 1 When ,log Monotone increasing , So add a minus sign to make it decrease separately .
The definition of this information quantity is completed .
Next, let's look at the definition of information entropy :

The process of seeking information entropy is the process of seeking expectation : Multiply the amount of information by the probability of the event , Then sum it .

Information entropy , Describe the “ news ” The degree of uncertainty or confusion . And here it is “ news ” It can be regarded as a probability model . And find the information entropy of the probability model , Is to find the expectation of the probability model !
If you want to compare the distribution differences between the two models , We need a concept called relative entropy (KL The divergence )


Here is a proof , Cross entropy must be greater than entropy :
So if you want the two models to be the closest , That is to find the minimum value of cross entropy . What deep learning does is to make the neural network model approach the human brain model . So the cross entropy of neural network and human brain model , It can be used as a loss function !( The smaller the cross entropy , The closer the two models are )
Then the following function , It's actually , The expansion of cross entropy of a binary classification :

here Y It's human judgment , There are only two possibilities ,1 perhaps 0, and yhead, Is the output value of neural network , It's a probability . The cross entropy formed between them constitutes the loss function of two classifications .

Reference resources :
《PyTorch Deep learning practice 》 Complete the collection _ Bili, Bili _bilibili
边栏推荐
- Altaro virtual machine replication failed: "unsupported file type vmgs"
- 今天很多 CTO 都是被干掉的,因为他没有成就业务
- "C and pointer" - Chapter 13 function of function pointer 1 - callback function 1
- Source insight License Activation
- Notepad++ wrap by specified character
- Introduction to redis using Lua script
- pytorch 多分类中的损失函数
- Configure DTD of XML file
- Apt update and apt upgrade commands - what is the difference?
- 最大似然估计,散度,交叉熵
猜你喜欢

Kubernetes resource object introduction and common commands (V) - (configmap)
![[teacher Zhao Yuqiang] MySQL flashback](/img/93/75998e28fd309880661ea723dc8de6.jpg)
[teacher Zhao Yuqiang] MySQL flashback
![[teacher Zhao Yuqiang] RDB persistence of redis](/img/cc/5509b62756dddc6e5d4facbc6a7c5f.jpg)
[teacher Zhao Yuqiang] RDB persistence of redis

Final review (Day5)
![[video of Teacher Zhao Yuqiang's speech on wot] redis high performance cache and persistence](/img/a7/2140744ebad9f1dc0a609254cc618e.jpg)
[video of Teacher Zhao Yuqiang's speech on wot] redis high performance cache and persistence

"C and pointer" - Chapter 13 advanced pointer int * (* (* (*f) () [6]) ()
![[branch and cycle] | | super long detailed explanation + code analysis + a trick game](/img/aa/543d4f0dcbcd664be963579af77ec9.jpg)
[branch and cycle] | | super long detailed explanation + code analysis + a trick game
![[escape character] [full of dry goods] super detailed explanation + code illustration!](/img/33/ec5a5e11bfd43f53f2767a9a0f0cc9.jpg)
[escape character] [full of dry goods] super detailed explanation + code illustration!

Beaucoup de CTO ont été tués aujourd'hui parce qu'il n'a pas fait d'affaires
![[teacher Zhao Yuqiang] index in mongodb (Part 1)](/img/2d/277ec737f2a7065831a19d036e61e1.jpg)
[teacher Zhao Yuqiang] index in mongodb (Part 1)
随机推荐
[function explanation (Part 1)] | | knowledge sorting + code analysis + graphic interpretation
2022.6.30DAY591
Source insight automatic installation and licensing
深度学习,从一维特性输入到多维特征输入引发的思考
The server data is all gone! Thinking caused by a RAID5 crash
Redhat7 system root user password cracking
2022.7.2 simulation match
Can altaro back up Microsoft teams?
Use telnet to check whether the port corresponding to the IP is open
配置xml文件的dtd
[untitled]
Export the altaro event log to a text file
[Zhao Yuqiang] deploy kubernetes cluster with binary package
32GB Jetson Orin SOM 不能刷机问题排查
Installation du plug - in CAD et chargement automatique DLL, Arx
MySQL startup error: several solutions to the server quit without updating PID file
[teacher Zhao Yuqiang] MySQL high availability architecture: MHA
If function of MySQL
期末复习DAY8
Ensemble, série shuishu] jour 9