当前位置:网站首页>Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and
Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and
2022-07-01 02:32:00 【Rainylt】
One sentence summary : Think of the prospect and background feature Natural similarity is low , And the front of the same texture / background feature High similarity , So directly in feature Then the segmentation head predicts the background score before , And feature Before multiplying and taking out / background feature. In a batch Before inner lowering - background pair The similarity , Improve prospects - Foreground and background - Background similarity , But according to the similarity ranking, reduce the pair Of loss The weight .
In detail , By reducing the dimension of features, we can find
(1) A foreground with a similar texture feature The similarity is high ,
(2) The similarity between foreground and background is low
(3) The background with similar texture has high similarity
therefore , In pre training encoder Extract feature after , Before direct prediction / Background score , And reinforce the above conclusion with comparative loss , So that the former / Background prediction is more accurate
As shown in the figure above , h ( ⋅ ) h(\cdot) h(⋅) yes Preliminary training Of encoder, z j z_j zj It's the extracted feature map , adopt Random initialization Binary segmentation header φ ( ⋅ ) \varphi{(\cdot)} φ(⋅) Get random ( At the beginning of the ) Segmentation results , Such as fraction P
notes : I don't know at this time P Does it represent the prospect or the background , Don't worry about... For the time being
At this point through P or (1-P) And feature Multiply , Can take out random ( At the beginning of the ) front / background feature.
In a batch Inside , Calculate the foreground - background pair The similarity , and Minimize the similarity :
For the same kind ( Same as foreground 、 Background class )feature, First calculate the similarity , ranking , Assign weight according to ranking :
The higher the ranking ( The smaller it is ) Of pair The greater the weight of , The smaller the vice . This is because Not all backgrounds / The prospects are similar , Only those with similar textures are similar .
This means that I hope to begin with feature The above assumption is satisfied , In this way, we can find the similar texture directly through the similarity region, Then by comparing the losses, the score cuts the head to predict the prospect - background pair Small similarity ( This actually requires the segmentation head to accurately segment the foreground / background region?), At the same time, it requires prospects - The foreground similarity is large , In fact, it is a requirement batch Inside cross-image Objects with similar textures are similar .
Counterexample
(1) Suppose a batch Inside all the foreground / The background textures are not similar
(2) Suppose a batch There are only a few sample Before / The background texture is similar
(3) Before hypothesis / There are multiple objects in the background , More complicated , The mixed texture of multiple objects still satisfies the assumption ?
Of course , In the end, this just completes the binary segmentation , For multi class goals , Can only assist CAM To do it :
边栏推荐
- 鼠标悬停效果十
- Thread Detach
- AI edge computing platform - beaglebone AI 64 introduction
- Pytoch -- foundation refers to the north_ II. What high school students can understand [back propagation and gradient descent]
- 鼠标悬停效果二
- Alphabet-Rearrange-Inator 3000(字典树自定义排序)
- 手机edge浏览器无法打开三方应用
- 使用ipmitool配置X86服务器的BMC网络和用户信息
- 股票开账户如何优惠开户?还有,在线开户安全么?
- Calculate special bonus
猜你喜欢
Qu'est - ce que le PMP?
RestCloud ETL WebService数据同步到本地
AI 边缘计算平台 - BeagleBone AI 64 简介
(translation) use eyebrow shaped text to improve Title click through rate
Pycharm 打开远程目录 Remote Host
Comment réaliser la liaison entre la serrure intelligente et la lampe, la scène du moteur de rideau intelligent dans le timing intelligent?
The latest CSDN salary increase technology stack in 2022 overview of APP automated testing
js中的图片预加载
Gartner研究:在中国,混合云的采用已成为主流趋势
SWT/ANR问题--Binder Stuck
随机推荐
My PMP learning test experience
Pytorch —— 基础指北_贰 高中生都能看懂的[反向传播和梯度下降]
Analysis and solution of anr problems
Qu'est - ce que le PMP?
How do I open an account on my mobile phone? Also, is it safe to open an account online?
Alphabet rearrange inator 3000 (dictionary tree custom sorting)
js防抖和节流
I want to know how to open a stock account? Is it safe to open an account online?
Leetcode(524)——通过删除字母匹配到字典里最长单词
522. Longest special sequence II
ANR问题的分析与解决思路
现在开户有优惠吗?另外,手机开户安全么?
鼠标悬停效果八
Zero foundation self-study SQL course | window function
项目管理是什么?
SAP ALV summary is inconsistent with exported excel summary data
如果我在北京,到哪里开户比较好?另外,手机开户安全么?
在国内如何买港股的股?用什么平台安全一些?
The image variables in the Halcon variable window are not displayed, and it is useless to restart the software and the computer
Résumé des styles de développement d'applets Wechat