当前位置:网站首页>Understanding disentangling in β- VAE paper reading notes
Understanding disentangling in β- VAE paper reading notes
2022-07-06 18:46:00 【zeronose】
List of articles
Preface
article :Understanding disentangling in β-VAE
Link to the original text : link
Understanding disentangling in β-VAE Is based on β-VAE An article from .
First ,β-VAE There are several problems in :
1.β-VAE Just by KL Item adds a super parameter β, It is found that the model has decoupling characteristics , But there is no good explanation for adding a super parameter β Will produce decoupling characteristics .
2.β-VAE Find out , When the decoupling effect is good, the reconstruction effect is not good , When the reconstruction effect is good, the decoupling effect is poor , So we need to balance decoupling and reconstruction .
Based on this ,Understanding disentangling in β-VAE adopt Information bottleneck theory given β-VAE Explanation of decoupling , And for β-VAE We need to balance decoupling and reconstruction , They put forward their own training methods ---- Gradually increase the amount of information of potential variables in the training process .
The original text also introduces VAE And β-VAE, It's just more here , Interested can see my previous article
VAE
β-VAE
One 、 What is the information bottleneck ?
The figure below is the explanation of the information bottleneck in the original text , In fact, information bottleneck describes a constrained optimization goal , The goal is to maximize potential bottlenecks Z And tasks Y Mutual information between , At the same time, discard the input X About Y All irrelevant information .
Drawing is troublesome , Let's make do with it , As shown in the figure below
You can see β-VAE The loss function of is :
L(θ, ϕ, β; x, z) = Eqϕ(z|x)ln pθ(x|z)− βKL (qϕ(z|x) || p(z))
The first term on the right of the equation is the reconstruction term , The second term is the regular term . The second is the information bottleneck of the first , Increase the weight of the second term, that is β Value , Also is to let qϕ(z|x) Closer to the p(z), because p(z) It's the standard Zhengtai distribution , At this time, the implicit variables are limited z It contains x The amount of information , Therefore, the decoupling effect is good, but the reconstruction effect is poor , On the contrary, the decoupling effect is poor , The effect of refactoring is good .
Two 、 New training goals
1. Loss function
among ,γ Fixed to a larger number 1000,C Is a variable number . In the process of training ,C Gradually increase from zero to a value large enough to produce high-quality reconstruction .
The training process here is similar to β-VAE Different ,β-VAE During training , It needs to be fixed first β Value then train , change β You need to retrain after the value . there C Is a variable parameter , It also becomes the amount of information , He is in the process of training , from 0 Gradually increasing .
summary
By controlling the increase of potential posterior coding ability in the training process , Allow the previous average KL The difference increases gradually from zero , Not the original β-VAE Fixed in the target β weighting KL Increase of items . Compared with the result of the original formula , It promotes Robust learning of disentangled representation , Combined with better reconstruction fidelity .
边栏推荐
- About NPM install error 1
- Collection of penetration test information -- use with nmap and other tools
- 上海部分招工市場對新冠陽性康複者拒絕招錄
- 使用cpolar建立一个商业网站(1)
- 287. 寻找重复数
- AcWing 3537.树查找 完全二叉树
- Stm32+mfrc522 completes IC card number reading, password modification, data reading and writing
- Grafana 9.0 is officially released! It's the strongest!
- 使用block实现两个页面之间的传统价值观
- STM32+ENC28J60+UIP协议栈实现WEB服务器示例
猜你喜欢
44所高校入选!分布式智能计算项目名单公示
ORACLE进阶(四)表连接讲解
徐翔妻子应莹回应“股评”:自己写的!
The role of applet in industrial Internet
一种用于夜间和无袖测量血压手臂可穿戴设备【翻译】
There is a sound prompt when inserting a USB flash disk under win10 system, but the drive letter is not displayed
Blue Bridge Cup real question: one question with clear code, master three codes
根据PPG估算血压利用频谱谱-时间深度神经网络【翻】
视频化全链路智能上云?一文详解什么是阿里云视频云「智能媒体生产」
美庐生物IPO被终止:年营收3.85亿 陈林为实控人
随机推荐
Test 1234
wx小程序学习笔记day01
Mathematics in machine learning -- common probability distribution (XIII): Logistic Distribution
[.Net core] solution to error reporting due to too long request length
CSRF漏洞分析
Jdbc driver, c3p0, druid and jdbctemplate dependent jar packages
STM32+MFRC522完成IC卡号读取、密码修改、数据读写
Markdown syntax for document editing (typera)
Summary of performance knowledge points
朗坤智慧冲刺科创板:年营收4亿 拟募资7亿
用于远程医疗的无创、无袖带血压测量【翻译】
With the implementation of MapReduce job de emphasis, a variety of output folders
Wchars, coding, standards and portability - wchars, encodings, standards and portability
Introduction to the use of SAP Fiori application index tool and SAP Fiori tools
epoll()无论涉及wait队列分析
上海部分招工市场对新冠阳性康复者拒绝招录
巨杉数据库首批入选金融信创解决方案!
node の SQLite
Unity资源顺序加载的一个方法
Cobra quick start - designed for command line programs