当前位置:网站首页>Maximum likelihood estimation, divergence, cross entropy
Maximum likelihood estimation, divergence, cross entropy
2022-07-03 05:48:00 【code bean】
If this message , You can put things that were very uncertain before , To determine the , It shows that this information is very informative !

This picture is to explain , How mathematicians define the process of information quantity :

in other words , The amount of information about Argentina winning the championship = The amount of information about Argentina reaching the finals + The amount of information that Argentina won the finals , however x Itself is probability , So here 1/8 yes x1*x2 Result . And :
![]()
This characteristic is log Peculiar . Then the base number is greater than 1 When ,log Monotone increasing , So add a minus sign to make it decrease separately .
The definition of this information quantity is completed .
Next, let's look at the definition of information entropy :

The process of seeking information entropy is the process of seeking expectation : Multiply the amount of information by the probability of the event , Then sum it .

Information entropy , Describe the “ news ” The degree of uncertainty or confusion . And here it is “ news ” It can be regarded as a probability model . And find the information entropy of the probability model , Is to find the expectation of the probability model !
If you want to compare the distribution differences between the two models , We need a concept called relative entropy (KL The divergence )


Here is a proof , Cross entropy must be greater than entropy :
So if you want the two models to be the closest , That is to find the minimum value of cross entropy . What deep learning does is to make the neural network model approach the human brain model . So the cross entropy of neural network and human brain model , It can be used as a loss function !( The smaller the cross entropy , The closer the two models are )
Then the following function , It's actually , The expansion of cross entropy of a binary classification :

here Y It's human judgment , There are only two possibilities ,1 perhaps 0, and yhead, Is the output value of neural network , It's a probability . The cross entropy formed between them constitutes the loss function of two classifications .

Reference resources :
《PyTorch Deep learning practice 》 Complete the collection _ Bili, Bili _bilibili
边栏推荐
- Use telnet to check whether the port corresponding to the IP is open
- 牛客网 JS 分隔符
- Linux登录MySQL出现ERROR 1045 (28000): Access denied for user ‘root‘@‘localhost‘ (using password: YES)
- Final review (Day7)
- Today, many CTOs were killed because they didn't achieve business
- [teacher Zhao Yuqiang] index in mongodb (Part 2)
- [function explanation (Part 2)] | [function declaration and definition + function recursion] key analysis + code diagram
- Source insight License Activation
- Complete set of C language file operation functions (super detailed)
- Sorry, this user does not exist!
猜你喜欢

Why is the website slow to open?
![[teacher Zhao Yuqiang] MySQL flashback](/img/93/75998e28fd309880661ea723dc8de6.jpg)
[teacher Zhao Yuqiang] MySQL flashback

PHP笔记超详细!!!

Strategy pattern: encapsulate changes and respond flexibly to changes in requirements

Sophomore dilemma (resumption)

一起上水碩系列】Day 9

Understand one-way hash function

Apt update and apt upgrade commands - what is the difference?

Introduction to redis using Lua script

最大似然估计,散度,交叉熵
随机推荐
1. 两数之和
理解 期望(均值/估计值)和方差
redis 无法远程连接问题。
2022.6.30DAY591
"C and pointer" - Chapter 13 function pointer 1: callback function 2 (combined with template to simplify code)
Source insight automatic installation and licensing
Redhat7 system root user password cracking
Altaro VM backup getting started
Altaro set grandfather parent child (GFS) archiving
PHP笔记超详细!!!
ansible防火墙firewalld设置
Apache+php+mysql environment construction is super detailed!!!
今天很多 CTO 都是被干掉的,因为他没有成就业务
[teacher Zhao Yuqiang] the most detailed introduction to PostgreSQL architecture in history
[trivia of two-dimensional array application] | [simple version] [detailed steps + code]
Latest version of source insight
NG Textarea-auto-resize
[advanced pointer (2)] | [function pointer, function pointer array, callback function] key analysis + code explanation
[together Shangshui Shuo series] day 7 content +day8
[function explanation (Part 2)] | [function declaration and definition + function recursion] key analysis + code diagram