当前位置:网站首页>[Code] neural symbol generation machine
[Code] neural symbol generation machine
2022-06-10 18:50:00 【User 1908973】
https://github.com/JindongJiang/GNM
Abstract
Harmonizing notation and distributed representation is a critical challenge , It can potentially solve the limitations of current deep learning . lately , Through the generation object centric representation model , Significant progress has been made in this direction . Although the learning recognition model infers the object-centered symbolic representation from the original image in an unsupervised way , Like the bounding box , But no such model can provide another important capability for generating models , That is, according to the structure of the world density of learning ( sampling ). In this paper , We propose a neural symbol generation machine , This is a generation model that combines the advantages of distributed and symbolic representation , Support structured representation of symbolic components and density based generation . These two key attributes are realized through two potential layers , Global distributed potential and structured symbolic potential diagrams for flexible density modeling . To increase the flexibility of the model in this hierarchy , We also proposed StructDRAW prior. Experiments show that , This model is obviously superior to the previous structured representation model and the latest unstructured generation model in terms of structural accuracy and image generation quality . Our code 、 Data sets and training models are available at the following web site https://github.com/JindongJiang/GNM
Introduce
The two core capabilities of human and machine intelligence are the abstract representation of the learning world , And generate imagination in a way that reflects the causal structure of the world . Deep latent variable model , Such as variational automatic encoder (VAEs) [31,39] Provides an elegant probabilistic framework , Learn both skills unsupervised and end-to-end trainable . However , In most VAEs The single distribution vector representation used in provides only weak or implicit structures induced by independent priors in practice . therefore , In expressing complex 、 High dimensional and structured observation , For example, scene images containing various objects , This representation is difficult to express useful structural properties , E.g. modularity 、 Composability and interpretability . However , These features are considered to be the key to solve the current limitations of deep learning in various systems 2 [29] Reasoning and other related abilities [6], Causal learning [40,37], Accountability [13], And the distributed generalization ability of the system [3,46]. By learning to represent observations as combinations of their entity representations , Especially the object - centered scene image mode , Significant progress has been made in addressing this challenge [15,32,18,45,8,17,14,12,33,11,26,48]. These models are equipped with more explicit inductive bias , Such as the spatial position of the object 、 Symbolic representation and synthetic scene modeling , It provides a method to identify and generate a given observation by composition based on the representation of interactive entities . However , Most of these models do not support another key capability of generating models : Generate hypothetical observations by learning the density of observation data . Although this ability to imagine according to the density of possible worlds plays a crucial role in the world models required for planning and model-based reinforcement, for example
[22, 21, 1, 36, 24, 38, 23], In the past, most entity based models can only synthesize artificial images by manually configuring the representation , Not according to the observed density of the bottom layer . although VAEs Support this function [31,19], Lack of explicit synthetic structure in its representation , When generating complex images , It is easy to lose the global structure consistency in practice [44,19]. In this paper , We propose a neural symbol generation machine (GNM), This is a probability generation model , By supporting symbolic entity based representation and distributed representation , It combines the advantages of the two worlds . therefore , The model can express the observed values by symbolic components , And the observation value can be generated according to the basic density . We have two potential levels in GNM Both of these key attributes are implemented in : The top level generates a globally distributed potential representation for flexible density modeling , The underlying layer generates potential structure diagrams based on entity and symbol representation from the global potential . Besides , We proposed StructDRAW, A structural feature graph supported by autoregressive prior , To improve the expression ability of potential structure diagram . In the experiment , We found that in terms of structural accuracy and image clarity , This model is obviously superior to the previous structured representation model and the highly expressive unstructured generation model .
Please refer to the original text for more information .
边栏推荐
- After the qtmqtt source code compilation is set to keepalive, the Ping package timeout error does not return a problem repair (qmqtt:: mqttnopingresponse, qmqtt:: clientprivate:: onpingtimeo)
- [QNX hypervisor 2.2 user manual] 3.2.3 ACPI table and FDT
- MySQL索引失效场景
- Adobe Premiere基础-介绍,配置,快捷键,创建项目,创建序列(一)
- AgI foundation, uncertain reasoning, subjective logic ppt2
- 锐捷x32pro刷openwrt开启无线160MHz
- 如何设置 SaleSmartly 以进行 Google Analytics(分析)跟踪
- 华为云鲲鹏DevKit代码迁移实战
- [QNX hypervisor 2.2 user manual] 3.3 configure guest
- flutter系列之:UI layout简介
猜你喜欢

连续六年稳居中国SDN(软件)市场份额第一

AFL fuzzy multithreading

Adobe Premiere基礎-工具使用(選擇工具,剃刀工具,等常用工具)(三)

Adobe Premiere foundation - Import and export, merge materials, source file compilation, offline (II)

TestNG的HelloWorld例子以及如何在命令行下运行

实时商业智能BI(二):合理的ETL架构设计实现准实时商业智能BI

Data URL

SaleSmartly | 再添新渠道Slack,助你拉近客户关系

两部门发文明确校外培训机构消防安全条件

当前有哪些主流的全光技术方案?-下篇
随机推荐
干货 | 一文搞定 uiautomator2 自动化测试工具使用
Uniapp native JS to convert the Gregorian calendar to the lunar calendar
The value of Bi in the enterprise: business analysis and development decision
企业数据质量管理:如何进行数据质量评估?
[QNX hypervisor 2.2 user manual] 3.2.3 ACPI table and FDT
华为云HCDE上云之路第二期:华为云如何助力制造业中小企业数字化转型?
How can bi help enterprises reduce labor, time and management costs?
Seata安装Window环境
[QNX hypervisor 2.2 user manual] 3.2.2 VM configuration example
Huawei cloud hcde Cloud Road phase II: how does Huawei cloud help small and medium-sized manufacturing enterprises' digital transformation?
Cross domain error: when allowcredentials is true, allowedorigins cannot contain the special value "*“
半导体硅片持续供不应求,胜高长期合约价上涨30%!
nfs网络挂载制作服务器镜像
VMware esxi version number comparison table
AFL fuzzy multithreading
[代码]神经符号生成机器
三部曲套路解bob活命问题
3. Golang并发入门
AgI foundation, uncertain reasoning, subjective logic ppt2
mysql备份和shell脚本手动执行没问题,crontab定时执行失败