当前位置:网站首页>Maximum likelihood estimation, divergence, cross entropy
Maximum likelihood estimation, divergence, cross entropy
2022-07-03 05:48:00 【code bean】
If this message , You can put things that were very uncertain before , To determine the , It shows that this information is very informative !

This picture is to explain , How mathematicians define the process of information quantity :

in other words , The amount of information about Argentina winning the championship = The amount of information about Argentina reaching the finals + The amount of information that Argentina won the finals , however x Itself is probability , So here 1/8 yes x1*x2 Result . And :
![]()
This characteristic is log Peculiar . Then the base number is greater than 1 When ,log Monotone increasing , So add a minus sign to make it decrease separately .
The definition of this information quantity is completed .
Next, let's look at the definition of information entropy :

The process of seeking information entropy is the process of seeking expectation : Multiply the amount of information by the probability of the event , Then sum it .

Information entropy , Describe the “ news ” The degree of uncertainty or confusion . And here it is “ news ” It can be regarded as a probability model . And find the information entropy of the probability model , Is to find the expectation of the probability model !
If you want to compare the distribution differences between the two models , We need a concept called relative entropy (KL The divergence )


Here is a proof , Cross entropy must be greater than entropy :
So if you want the two models to be the closest , That is to find the minimum value of cross entropy . What deep learning does is to make the neural network model approach the human brain model . So the cross entropy of neural network and human brain model , It can be used as a loss function !( The smaller the cross entropy , The closer the two models are )
Then the following function , It's actually , The expansion of cross entropy of a binary classification :

here Y It's human judgment , There are only two possibilities ,1 perhaps 0, and yhead, Is the output value of neural network , It's a probability . The cross entropy formed between them constitutes the loss function of two classifications .

Reference resources :
《PyTorch Deep learning practice 》 Complete the collection _ Bili, Bili _bilibili
边栏推荐
- Installation of CAD plug-ins and automatic loading of DLL and ARX
- Method of finding prime number
- 1. 两数之和
- Azure file synchronization of altaro: the end of traditional file servers?
- [teacher Zhao Yuqiang] redis's slow query log
- [function explanation (Part 2)] | [function declaration and definition + function recursion] key analysis + code diagram
- JS implements the problem of closing the current child window and refreshing the parent window
- AtCoder Beginner Contest 258(A-D)
- Xaml gradient issue in uwp for some devices
- Strategy pattern: encapsulate changes and respond flexibly to changes in requirements
猜你喜欢

Redis使用Lua脚本简介
![[teacher Zhao Yuqiang] MySQL flashback](/img/93/75998e28fd309880661ea723dc8de6.jpg)
[teacher Zhao Yuqiang] MySQL flashback

Redhat7 system root user password cracking

【一起上水硕系列】Day 7 内容+Day8

2022.DAY592

Altaro o365 total backup subscription plan

Personal outlook | looking forward to the future from Xiaobai's self analysis and future planning

@Import annotation: four ways to import configuration classes & source code analysis
![[teacher Zhao Yuqiang] Alibaba cloud big data ACP certified Alibaba big data product system](/img/cc/5509b62756dddc6e5d4facbc6a7c5f.jpg)
[teacher Zhao Yuqiang] Alibaba cloud big data ACP certified Alibaba big data product system

Map的扩容机制
随机推荐
NG Textarea-auto-resize
How to create and configure ZABBIX
QT read write excel -- qxlsx insert chart 5
70 shell script interview questions and answers
"C and pointer" - Chapter 13 function of function pointer 1 - callback function 1
C 语言文件操作函数大全 (超详细)
Notepad++ wrap by specified character
How to install and configure altaro VM backup for VMware vSphere
1. 两数之和
Btrfs and ext4 - features, strengths and weaknesses
Life is a process of continuous learning
为什么网站打开速度慢?
Detailed explanation of iptables (1): iptables concept
Linux登录MySQL出现ERROR 1045 (28000): Access denied for user ‘root‘@‘localhost‘ (using password: YES)
ES 2022 正式发布!有哪些新特性?
2022.7.2day594
The server data is all gone! Thinking caused by a RAID5 crash
2022.6.30DAY591
If function of MySQL
[teacher Zhao Yuqiang] kubernetes' probe