当前位置:网站首页>清华&商汤&上海AI&CUHK提出Siamese Image Modeling,兼具linear probing和密集预测性能!
清华&商汤&上海AI&CUHK提出Siamese Image Modeling,兼具linear probing和密集预测性能!
2022-06-26 17:32:00 【智源社区】
本文分享论文『Siamese Image Modeling for Self-Supervised Vision Representation Learning』,由清华(黄高组)&商汤(代季峰组)&上海AI Lab&CUHK提出Siamese Image Modeling,兼具linear probing和密集预测性能!

论文链接:
http://arxiv.org/abs/2206.01204
摘要
自监督学习(SSL)在各种下游视觉任务上都提供了优异的性能。目前提出了两种主流SSL框架,即实例鉴别(ID)和掩蔽图像建模(MIM)。ID将来自同一图像的不同视图的表示拉到在一起。它在 linear probing方面表现良好,但在检测性能方面较差。另一方面,MIM在给定mask图像的情况下重建原始内容。它擅长密集预测,但在linear probing上表现不佳。它们的区别是由于忽视了语义对齐或空间敏感性的表示要求。
具体而言,作者观察到:(1)语义对齐要求将语义相似的视图投影到附近的表示中,这可以通过对比不同的视图和强数据增强来实现;(2) 空间敏感性要求对图像中的局部结构进行建模。因此,使用掩蔽图像预测密集表示是有益的,因为它模拟了图像内容的条件分布。

边栏推荐
- Microservice architecture practice: user login and account switching design, order query design of the mall
- [qt learning notes]qt inter thread data communication and data sharing
- Viteconfigure project path alias
- Romance of the Three Kingdoms: responsibility chain model
- What is the difference between digital collections and NFT
- ACL 2022 | zero sample multilingual extracted text summarization based on neural label search
- How sparksql returns a specific day of the week by date -dayofweek function
- [recommendation system learning] recommendation system architecture
- 离婚协议中的几个重点
- Alibaba's "high concurrency" tutorial "basic + actual combat + source code + interview + Architecture" is a god class
猜你喜欢

sql中ROUND和TRUNCATE的区别(四舍五入还是截取小数点后几位)
![[ten thousand words summary] starting from the end, analyze in detail how to fill in the college entrance examination volunteers](/img/77/715454c8203d722e246ed70e1fe0d8.png)
[ten thousand words summary] starting from the end, analyze in detail how to fill in the college entrance examination volunteers

Discussion: the next generation of stable coins

Basic requirements: 7 problems in singleton mode

SQL injection for Web Security (3)

Leetcode 1169. Query invalid transactions (if the amount of data is small, this problem still needs to be solved by violent enumeration)

sparksql如何通过日期返回具体周几-dayofweek函数

Vscode usage - Remote SSH configuration description

MySql 导出数据库中的全部表索引

Leetcode HOT100 (22--- bracket generation)
随机推荐
牛客网:设计LRU缓存结构 设计LFU缓存结构
Environment setup mongodb
Record the use process of fenics
你好,现在网上股票开户买股票安全吗?
并发之线程安全
Viewing the task arrangement ability of monorepo tool from turborepo
Vue--vuerouter cache routing component
Several key points in divorce agreement
20: Chapter 3: develop the pass service: 3: get through the redis server in the program; (it only connects with the redis server and does not involve specific business development)
一起备战蓝桥杯与CCF-CSP之大模拟炉石传说
The function keeps the value of variable H to two decimal places and rounds the third digit
Alibaba's "high concurrency" tutorial "basic + actual combat + source code + interview + Architecture" is a god class
Leetcode topic [array] -283- move zero
Discussion: the next generation of stable coins
物联网协议的王者:MQTT
【NPOI】C#跨工作薄复制Sheet模板导出Excel
ACL 2022 | zero sample multilingual extracted text summarization based on neural label search
7 views on NFT market prospect
14 MySQL tutorial insert insert data
mysql Add column 失败 因为之前有数据,不是默认null 不行