当前位置:网站首页>Large scale visual relationship understanding
Large scale visual relationship understanding
2022-06-29 01:36:00 【Respect the heaven and turn the spirit into God】
Thesis link : The paper
Brief introduction of the paper
This is an article AAAI 2019 The paper of , This paper aims at a large-scale visual understanding problem , In fact, it deals with the wide distribution of visual relations and the imbalance of data . This paper develops a new relationship detection model , Embed objects and relationships into two vector spaces , At the same time, it retains the distinguishing ability and semantic affinity . This article has learned a visual and semantic module , Mapping the features of two forms to a shared space , In this space , Matching feature pairs must distinguish those that do not match , At the same time, similar feature pairs should be as close as possible .
Paper notes
①、 Object categories are usually semantically related , This connection is more subtle for the relationship between objects ( The first time I read this sentence, I was a little confused , Later, I read the examples given in the following article ).<person,ride,horse> and <person,ride,elephant> The image features should be similar ( People ride an animal ), and <person,ride,horse> and <person,walk with,horse> Although they have the same subject and object , But the image features are completely different . Here we are talking about relationship recognition with object,subject On condition that , however object recognition Independent of relationship .
②、Visual Module The design of is mainly intended to object and subject Independent of relationship Space , At the same time, it involves object and subject Of relationship It also contains the characteristics of these two objects .
③、 Network structure

The main design idea is to make <object,subject> Independent of relationship Study , but relationship And <object,subject> There's a big connection , So in relationship Every step of the branch is fused object and subject Information . That is, you want to learn the mapping of visual features to two independent semantic spaces ( Objects and relationships ).
④、Semantic Module
The purpose of this module is to map word vectors into an embedded space , This embedded space is more different than the original word vector space , While maintaining semantic similarity . As object / It is important that relational tags provide a good word vector representation , Because it provides an appropriate initialization that is easy to tune . About word vector The choice of , Initial use Pretrained word2vec embeddings, And then use Relationship-level co-occurrence embeddings To deal with , Maximize P (P |S, O) 、 P (S|P, O) and P(O|S, P), Is to maximize the basis of <object,predict,subject> Two of them determine the distribution of the other .
边栏推荐
- Statistical learning method (4/22) naive Bayes
- Basic use of Sqlalchemy
- Callback function of unity after importing resources
- What is the reason why easycvr can't watch the device video when it is connected to the home protocol?
- How to encrypt anti copy program
- How to choose source code encryption software
- [solution] longest common subsequence
- Design and development of VB mine sweeping game
- The function of Schottky diode in preventing reverse connection of power supply
- Using autogluon to forecast house price
猜你喜欢

Easycvr service private What should I do if the PEM file is emptied and cannot be started normally?

Statistical learning method (2/22) perceptron

统计学习方法(3/22)K近邻法

Battle drag method 1: moderately optimistic and build self-confidence (2)

Callback function of unity after importing resources

免疫组化和免疫组学之间的区别是啥?

多维分析预汇总应该怎样做才管用?

Server antivirus

Docker中安装Oracle数据库

4276. 擅长C
随机推荐
Kuboardv3与监控套件安装
Exclusive analysis | about resume and interview
NOIP2006-2018 提高组 初赛试题完善程序题 CSP-S 2019 2020 初赛试题完善程序题
致我们曾经遇到过的接口问题
独家分析 | 软件测试关于简历和面试的真实情况
Magic Quadrant of motianlun's 2021 China Database
Fibonacci sequence
【Proteus仿真】4x4矩阵键盘中断方式扫描 +数码管显示
Rasa dialogue robot helpdesk (V)
Advanced Installer Architect创作工具
Installing Oracle database in docker
Sword finger offer 16 Integer power of numeric value
TypeScript(6)函数
0和1的歧义问题
How many locks are added to an update statement? Take you to understand the underlying principles
Basic use of Sqlalchemy
TypeScript(7)泛型
一种全面屏手势适配方案
be based on. NETCORE development blog project starblog - (13) add friendship link function
Vulnerability mining | routine in password retrieval