当前位置:网站首页>最大似然估计,散度,交叉熵
最大似然估计,散度,交叉熵
2022-07-03 05:45:00 【code bean】
如果这个信息,可以将之前非常不确定的事情,确定了,说明这个信息的信息量很大!

这张图是解释,数学家如何定义信息量的过程:

也就是说,阿根廷夺冠的信息量 = 阿根廷进决赛的信息量+阿根廷赢了决赛的信息量,但是x本身是概率,所以这里的1/8 是 x1*x2 的结果。及:
![]()
这种特性是log特有的。 然后底数大于1的时候,log单调递增,所以加上负号让其单独递减。
到此信息量的定义完成了。
接下看信息熵的定义:

求信息熵的过程就是求期望的过程:信息量乘以事件发生的概率,然后求和。

信息熵,描述的是“消息”的不确定性程度或者说是混乱程度。而这里的“消息”是可以看成一个概率模型的。而求概率模型的信息熵,就是求概率模型的期望啊!
如果想对比两个模型的分布差异,就需要一个概念叫做相对熵(KL散度)


这里有个证明,交叉熵一定大于熵:
所以如果要两个模型最接近,那就是求交叉熵的最小值。那深度学习干的事情就是让神经网络这个模型逼近人脑模型。所以神经网络和人脑模型的交叉熵,就可以作为损失函数!(交叉熵越小,两个模型越相近)
那如下函数,其实是,一个二分类的交叉熵的展开:

这里Y是人的判定,只有两种可能,1或者0,而yhead,是神经网络的输出值,是个概率。他们之间形成的交叉熵构成了二分类的损失函数。

参考:
边栏推荐
- MySQL 5.7.32-winx64 installation tutorial (support installing multiple MySQL services on one host)
- 2022.6.30DAY591
- How to use source insight
- 一起上水碩系列】Day 9
- Final review (Day6)
- Linux登录MySQL出现ERROR 1045 (28000): Access denied for user ‘root‘@‘localhost‘ (using password: YES)
- 卷积神经网络CNN中的卷积操作详解
- How to install and configure altaro VM backup for VMware vSphere
- Jetson AgX Orin platform porting ar0233 gw5200 max9295 camera driver
- Communication - how to be a good listener?
猜你喜欢

Understand one-way hash function

Life is a process of continuous learning

@Import annotation: four ways to import configuration classes & source code analysis

Linux登录MySQL出现ERROR 1045 (28000): Access denied for user ‘root‘@‘localhost‘ (using password: YES)
![[branch and cycle] | | super long detailed explanation + code analysis + a trick game](/img/aa/543d4f0dcbcd664be963579af77ec9.jpg)
[branch and cycle] | | super long detailed explanation + code analysis + a trick game

Strategy pattern: encapsulate changes and respond flexibly to changes in requirements
![[teacher Zhao Yuqiang] Cassandra foundation of NoSQL database](/img/cc/5509b62756dddc6e5d4facbc6a7c5f.jpg)
[teacher Zhao Yuqiang] Cassandra foundation of NoSQL database
![[teacher Zhao Yuqiang] RDB persistence of redis](/img/cc/5509b62756dddc6e5d4facbc6a7c5f.jpg)
[teacher Zhao Yuqiang] RDB persistence of redis
![[minesweeping of two-dimensional array application] | [simple version] [detailed steps + code]](/img/b0/aa5dce0bb60c50eea907de9e127d6c.jpg)
[minesweeping of two-dimensional array application] | [simple version] [detailed steps + code]

Altaro virtual machine replication failed: "unsupported file type vmgs"
随机推荐
PHP notes are super detailed!!!
Simpleitk learning notes
Es 2022 officially released! What are the new features?
How do I migrate my altaro VM backup configuration to another machine?
Shanghai daoning, together with American /n software, will provide you with more powerful Internet enterprise communication and security component services
[explain in depth the creation and destruction of function stack frames] | detailed analysis + graphic analysis
Strategy pattern: encapsulate changes and respond flexibly to changes in requirements
期末复习DAY8
Final review (day3)
Latest version of source insight
[teacher Zhao Yuqiang] Alibaba cloud big data ACP certified Alibaba big data product system
Installation of CAD plug-ins and automatic loading of DLL and ARX
Mapbox tasting value cloud animation
NG Textarea-auto-resize
Life is a process of continuous learning
kubernetes资源对象介绍及常用命令(五)-(ConfigMap)
[teacher Zhao Yuqiang] MySQL flashback
Niuke JS separator
Use telnet to check whether the port corresponding to the IP is open
獲取並監控遠程服務器日志