当前位置:网站首页>Bayesian network explanation
Bayesian network explanation
2022-06-26 17:50:00 【lmn_】

0x01 Overview of Bayesian networks
Sometimes we need to calculate the probability of an uncertain cause and give some observed evidence , In these cases, Bayesian methods can be used .
Bayesian network (bayesian network)
Is a probabilistic graphical model , It explicitly captures the known conditional dependencies of directed edges in the graphical model , It passes through a directed acyclic graph (DAG) Represents a set of variables and their conditional dependencies .
Bayesian networks are well suited for capturing events that have occurred and predicting any of several possible known causes . for example , Bayesian networks can represent the probabilistic relationship between diseases and symptoms . Given symptoms , The network can be used to calculate the probability of the existence of various diseases .
Efficient algorithms can be used for reasoning and learning in Bayesian networks .
Sequence of variables ( for example Speech signal or protein sequence ) The Bayesian network for modeling is called dynamic Bayesian network , The generalization of Bayesian networks that can represent and solve decision problems under uncertainty is called influence graph .

Unlike manual builds , Automatic learning does not require expert knowledge of the underlying domain . Bayesian networks can be automatically learned directly from databases using experience based algorithms that are usually built into appropriate software . But the disadvantage is that the automation construction requires high data .
0x02 Bayesian network model
stay 0x01 Already mentioned in , Bayesian networks are directed acyclic graphs (DAG), Its nodes represent variables in the Bayesian sense .
- There are many types of variables , There are observable quantities 、 Potential variables 、 Unknown parameters or assumptions .
- Use edges to represent conditional dependencies ; Unconnected nodes represent variables that are conditionally independent of each other .
Each node is associated with a probability function , The probability function takes a set of specific values of the node's parent variables as input , The probability of variables represented by nodes is given .
Simplest graph , It can be expressed in this way :

Wiki gives an example
There are two ways to make grass wet : Watering car & It's raining .
Rain will affect the travel of sprinkler , There are two results when the grass becomes wet :T( It means true ) and F( On behalf of false ).
Chain rule of probability for joint probability function :
![]()
- G =“ Grass becomes wet ( really / false )”
- S =“ Sprinkler on ( really / false )”
- R =“ It's raining ( really / false )”
The model can answer questions about the existence effect ( The so-called inverse probability ) Whether there is a problem of cause .
By using the conditional probability formula and summing all the variables :

Use the expansion of the joint probability function Pr(G,S,R)} And the conditional probability table described in the figure (CPT) Conditional probabilities in , Each term in the numerator and denominator can be evaluated .

Then the numerical results ( Subscript by the value of the relevant variable ) yes

Infer unobserved variables
Bayesian networks are often used to answer probabilistic queries about them , When there are other variables , The network can be used to update the knowledge of variable subset states . This process of calculating the posterior distribution is called probabilistic reasoning . When selecting values for a subset of variables , Some expected loss functions can be minimized , For example, the probability of wrong decision . So Bayesian network can be considered as a complicated problem with the mechanism of automatically applying Bayesian theorem .
Parameter learning
Specified in X The parent node of is conditional X Probability distribution of ,X The distribution conditional on the parent can take any form . Discrete or Gaussian distributions are usually used , Because it simplifies the calculation . Sometimes you only know the limits of distribution ; The maximum entropy principle can then be used to determine the individual distributions , That is, the distribution with maximum entropy under given constraints .
Structural learning
The model of Bayesian network can be divided into several forms :
Direct connection : Indicates that the direction of a directed graph is in a straight line , And point in the same direction .
Expressed as :G -> S -> R

Huilian : Two independent nodes point to the same node .
Expressed as :G -> S <- R

Successive : One node points to two different nodes .
Expressed as :G <- S -> R

A particularly fast and accurate BN The learning method is to transform the problem into an optimization problem , And use integer programming to solve . In the solution process, the acyclic constraint is added to the integer program in the form of cutting plane (IP) in , This method can handle up to 100 A variable problem .
Software
Useful software for processing graphical models , There are several options . The most common software packages are Genie、Hugin、BUGS etc. .
边栏推荐
- padding百分比操作
- Army chat -- registration of Registration Center
- Troubleshooting ideas that can solve 80% of faults!
- KDD 2022 | 如何在跨域推荐中使用对比学习?
- Detailed explanation of browser storage methods: the origin and difference of cookies, localstorage and sessionstorage
- Introduction to distributed cache / cache cluster
- 力扣每日一题-第28天-566.重塑矩阵
- Vue--vuerouter cache routing component
- 14 MySQL tutorial insert insert data
- [recommendation system learning] technology stack of recommendation system
猜你喜欢

【推荐系统学习】推荐系统架构

14 MySQL tutorial insert insert data

解决pycharm里面每个字母占一格空格的问题

关于FlowUs这一款国民好笔记

Uncover the secret of Agora lipsync Technology: driving portraits to simulate human speech through real-time voice

pycharm如何修改多行注释快捷键

玩转Linux,轻松安装配置MySQL

Viewing the task arrangement ability of monorepo tool from turborepo

物联网协议的王者:MQTT

Halcon's region: features of multiple regions (5)
随机推荐
Viteconfigure project path alias
Using redis for user access data statistics hyperloglog and bitmap advanced data types
一起备战蓝桥杯与CCF-CSP之大模拟炉石传说
直播预告|程序员进击,如何提升研发效能?6月21日晚视频号、B站同步直播,不见不散!
Leetcode HOT100 (22--- bracket generation)
17.13 补充知识、线程池浅谈、数量谈、总结
I want to know. I am in Zhaoqing. Where can I open an account? Is it safe to open an account online?
Jouer avec Linux et installer et configurer MySQL facilement
用redis做用户访问数据统计HyperLogLog及Bitmap高级数据类型
MySQL add column failed because there was data before, not null by default
Strength and appearance Coexist -- an exclusive interview with Liu Yu, a member of Apache pulsar PMC
LM06丨仅用成交量构造抄底摸顶策略的奥秘
The king of Internet of things protocol: mqtt
9、智慧交通项目(2)
Ndroid development from introduction to mastery Chapter 2: view and ViewGroup
Treasure and niche CTA animation material website sharing
在国金证券开户怎么样?开户安全吗?
RuntimeError: CUDA error: out of memory自己的解决方法(情况比较特殊估计对大部分人不适用)
非对称密码体制详解
VSCode使用 - Remote-SSH 配置说明