当前位置:网站首页>Maximum likelihood estimation, divergence, cross entropy
Maximum likelihood estimation, divergence, cross entropy
2022-07-03 05:48:00 【code bean】
If this message , You can put things that were very uncertain before , To determine the , It shows that this information is very informative !

This picture is to explain , How mathematicians define the process of information quantity :

in other words , The amount of information about Argentina winning the championship = The amount of information about Argentina reaching the finals + The amount of information that Argentina won the finals , however x Itself is probability , So here 1/8 yes x1*x2 Result . And :
![]()
This characteristic is log Peculiar . Then the base number is greater than 1 When ,log Monotone increasing , So add a minus sign to make it decrease separately .
The definition of this information quantity is completed .
Next, let's look at the definition of information entropy :

The process of seeking information entropy is the process of seeking expectation : Multiply the amount of information by the probability of the event , Then sum it .

Information entropy , Describe the “ news ” The degree of uncertainty or confusion . And here it is “ news ” It can be regarded as a probability model . And find the information entropy of the probability model , Is to find the expectation of the probability model !
If you want to compare the distribution differences between the two models , We need a concept called relative entropy (KL The divergence )


Here is a proof , Cross entropy must be greater than entropy :
So if you want the two models to be the closest , That is to find the minimum value of cross entropy . What deep learning does is to make the neural network model approach the human brain model . So the cross entropy of neural network and human brain model , It can be used as a loss function !( The smaller the cross entropy , The closer the two models are )
Then the following function , It's actually , The expansion of cross entropy of a binary classification :

here Y It's human judgment , There are only two possibilities ,1 perhaps 0, and yhead, Is the output value of neural network , It's a probability . The cross entropy formed between them constitutes the loss function of two classifications .

Reference resources :
《PyTorch Deep learning practice 》 Complete the collection _ Bili, Bili _bilibili
边栏推荐
- Mapbox tasting value cloud animation
- 中职网络子网划分例题解析
- Source insight operation manual installation trial
- How to use source insight
- Using the ethtool command by example
- Altaro o365 total backup subscription plan
- 为什么网站打开速度慢?
- How to create and configure ZABBIX
- Jetson AgX Orin platform porting ar0233 gw5200 max9295 camera driver
- PHP笔记超详细!!!
猜你喜欢
![[function explanation (Part 1)] | | knowledge sorting + code analysis + graphic interpretation](/img/c2/991b8febd262cf9237017adc9d1221.jpg)
[function explanation (Part 1)] | | knowledge sorting + code analysis + graphic interpretation

@Import annotation: four ways to import configuration classes & source code analysis

今天很多 CTO 都是被干掉的,因为他没有成就业务

今天很多 CTO 都是被幹掉的,因為他沒有成就業務
![[advanced pointer (1)] | detailed explanation of character pointer, pointer array, array pointer](/img/9e/a4558e8e53c9655cbc1a38e8c0536e.jpg)
[advanced pointer (1)] | detailed explanation of character pointer, pointer array, array pointer

Understand one-way hash function
![[teacher Zhao Yuqiang] MySQL high availability architecture: MHA](/img/a7/2140744ebad9f1dc0a609254cc618e.jpg)
[teacher Zhao Yuqiang] MySQL high availability architecture: MHA

"C and pointer" - Chapter 13 function pointer 1: callback function 2 (combined with template to simplify code)

PHP notes are super detailed!!!

Notepad++ wrap by specified character
随机推荐
2022.7.2day594
大二困局(复盘)
pytorch 搭建神经网络最简版
一起上水硕系列】Day 9
NG Textarea-auto-resize
[set theory] relational closure (reflexive closure | symmetric closure | transitive closure)
[teacher Zhao Yuqiang] MySQL high availability architecture: MHA
Installation du plug - in CAD et chargement automatique DLL, Arx
Shanghai daoning, together with American /n software, will provide you with more powerful Internet enterprise communication and security component services
Final review (Day5)
【一起上水硕系列】Day 7 内容+Day8
[escape character] [full of dry goods] super detailed explanation + code illustration!
redis 遇到 NOAUTH Authentication required
期末复习(Day5)
Redis cannot connect remotely.
Final review (Day7)
Why is the website slow to open?
[teacher Zhao Yuqiang] kubernetes' probe
Using the ethtool command by example
[written examination question analysis] | | get [sizeof and strlen] [pointer and array] graphic explanation + code analysis