当前位置:网站首页>No manual prior is required! HKU & Tongji & lunarai & Kuangshi proposed self supervised visual representation learning based on semantic grouping, which significantly improved the tasks of target dete
No manual prior is required! HKU & Tongji & lunarai & Kuangshi proposed self supervised visual representation learning based on semantic grouping, which significantly improved the tasks of target dete
2022-06-26 17:41:00 【Zhiyuan community】
This article shares papers 『Self-Supervised Visual Representation Learning with Semantic Grouping』, No manual priors are required ! HKU & Tongji &LunarAI& Open vision ( Zhangxiangyu team ) Self supervised visual representation learning based on semantic grouping is proposed , Significantly improve target detection 、 Instance segmentation and semantic segmentation tasks !

Thesis link :
https://arxiv.org/abs/2205.15288
Abstract
In this paper , The author solves the problem of learning visual representation from unlabeled scene centric data . Existing work has demonstrated the potential of utilizing complex underlying structures in scenario centric data ; For all that , They usually rely on hand-made object priors or special excuse tasks to build learning frameworks , This could undermine universality .
contrary , The author proposes to carry out contrastive learning from data-driven semantic slots , namely SlotCon, For joint semantic grouping and representation learning . Semantic grouping is performed by assigning pixels to a set of learnable prototypes , These prototypes can be adapted to each sample by concentrating features , And form new grooves (slot). Slots based on the learned data , Use comparative goals to express learning , Enhanced feature resolution , This in turn facilitates the grouping of semantically related pixels .
Compared with previous work , By optimizing semantic grouping and contrastive learning at the same time , The method in this paper bypasses the shortcoming of manual prior , Be able to learn objects from scene centric images / Group level representation . Experiments show that , This method can effectively decompose complex scenes into semantic groups for feature learning , And for downstream tasks ( Including target detection 、 Instance segmentation and semantic segmentation ) It's obviously helpful .

边栏推荐
- Microservice architecture practice: user login and account switching design, order query design of the mall
- The high concurrency system is easy to play, and Alibaba's new 100 million level concurrent design quick notes are really fragrant
- Cloud native 02: Alibaba cloud cloud efficient flow pipeline
- map和filter方法对于稀缺数组的处理
- Leetcode daily [2022 - 02 - 16]
- 14《MySQL 教程》INSERT 插入数据
- In those years, interview the abused red and black trees
- #25class的类继承
- Over the weekend: 20000 words! Summary of JVM core knowledge, 18 serial cannons as a gift
- COMP5216 Mobile Computing Assignment 1 - Extending ToDoList app
猜你喜欢
![[recommendation system learning] recommendation system architecture](/img/a8/448f6e708227555bb6b32cdc652435.png)
[recommendation system learning] recommendation system architecture

The latest masterpiece of Alibaba, which took 182 days to produce 1015 pages of distributed full stack manual, is so delicious

Troubleshooting ideas that can solve 80% of faults!

背包问题求方案数

SIGIR 2022 | 港大等提出超图对比学习在推荐系统中的应用

Play with Linux and easily install and configure MySQL

Number of solutions for knapsack problem

Live broadcast preview | how can programmers improve R & D efficiency? On the evening of June 21, the video number and station B will broadcast live at the same time. See you or leave!

Daily record 2

Turtle cartography
随机推荐
Redis' 43 serial cannons, try how many you can carry
Romance of the Three Kingdoms: responsibility chain model
sparksql如何通过日期返回具体周几-dayofweek函数
玩转Linux,轻松安装配置MySQL
What is the difference between digital collections and NFT
vue--vuerouter缓存路由组件
一起备战蓝桥杯与CCF-CSP之大模拟炉石传说
Daily record 2
类型多样的石膏PBR多通道贴图素材,速来收藏!
Basic requirements: 7 problems in singleton mode
Alibaba's "high concurrency" tutorial "basic + actual combat + source code + interview + Architecture" is a god class
【uniapp】uniapp手机端使用uni.navigateBack失效问题解决
Uncover the secret of Agora lipsync Technology: driving portraits to simulate human speech through real-time voice
【推荐系统学习】推荐系统的技术栈
js强制转换
QPushButton 样式使用示例(以及按钮setmenu添加下拉菜单的方法)
Fire evacuation and self rescue... This safety production and fire training is full!
Redis and database data consistency
Number of solutions for knapsack problem
Viewing the task arrangement ability of monorepo tool from turborepo