当前位置:网站首页>Asgnet paper and code interpretation 2
Asgnet paper and code interpretation 2
2022-07-01 03:29:00 【It's seventh uncle】
Address of thesis :Adaptive Prototype Learning and Allocation for Few-Shot Segmentation
Paper code :ASGNet
Abstract
Prototype learning is widely used in small sample segmentation . Usually , By averaging the global object information , From supporting features (support feature) Get a single prototype in . However , Using a prototype to represent all the information may lead to ambiguity . In this paper , We propose two new modules : Super pixel guided clustering (SGC) And guided prototype allocation (GPA), Used for extraction and allocation of various prototypes . say concretely ,SGC It is a method without parameters and training , It extracts more representative prototypes by aggregating similar feature vectors , and GPA Be able to select matching prototypes to provide more accurate guidance . By way of SGC and GPA Bind together , We propose an adaptive super-pixel guidance network (ASGNet), This is a lightweight model , Able to adapt to changes in object size and shape . Besides , Our network can be easily extended to k-shot Division , There are significant improvements and no additional computational costs . special , We use COCO Data set evaluation shows ,ASGNet stay 5-shot The accuracy of segmentation is higher than that of the most advanced methods 5%.
Existing problems and solutions Introduction
current Few-Shot Segmentation networks usually extract features from query images and support images , Then different feature matching methods and target mask transmission methods from support image to query image are proposed . Feature matching and mask passing usually use prototype feature learning technology . Prototype learning technology will support the mask target object of image (masked object features) Compressed into one or several prototype eigenvectors . then , Search the pixel position of similar features in the query image to segment the target .
- One of the main advantages of prototype learning is that prototype features are more robust to noise than pixel features . However ,
Prototype features inevitably lose spatial information, This is very important when the appearance of objects that support images and query images are quite different . Besides , Most prototype learning networks only generate a single prototype through mask average pooling, thus losing the ability to distinguish information .
In this work , We propose a new prototype learning technology , To address some of the existing major shortcomings . especially , We want to adaptively change the number of prototypes and their spatial range according to the image content , Make the prototype have the ability of content adaptation and spatial awareness . This adaptive multi prototype strategy is very important to deal with the huge changes of object size and shape in different images . Intuitively , When an object occupies a large part of the image , It carries more information , So more prototypes are needed to represent all the necessary information . contrary , If the object is small , The proportion of background is relatively large , Then one or more prototypes are enough . Besides , We want the support area of each prototype ( The scope of space ) It can adapt to the object information appearing in the supporting image . say concretely , Our goal is to divide the supporting features into several representative regions according to feature similarity . We also hope to be able to adaptively select more important prototypes to find more similar features in query images . Different object parts may appear in different image regions and different query images , Therefore, we hope to dynamically allocate different prototypes in the query image for feature matching . for example , Some parts of the object may be occluded in the query image , We want to dynamically select the prototype corresponding to the visible part of the query image .
We use adaptive super pixels to guide the network (ASGNet) To achieve this adaptation 、 Multi archetypal learning and distribution ,ASGNet Use super pixels to adapt to the number of prototypes and support areas . Specially , We propose the composition ASGNet Two modules of the core : Super pixel guided clustering (SGC) And guided prototype allocation (GPA).
- SGC The module carries out feature-based super-pixel fast extraction for supporting images , Got super image plain quality heart do by primary type , sign \color{red}{ Super pixel centroid as prototype feature } super image plain quality heart do by primary type , sign . The shape and number of super pixels are adaptive to the image content , Therefore, the generated prototype also becomes adaptive .
- GPA The module uses a mechanism similar to attention to branch with Big many Count phase Turn off Of the a primary type , sign \color{red}{ Assign most relevant supporting prototype features } branch with Big many Count phase Turn off Of the a primary type , sign .
in summary ,SGC The module provides adaptive prototype learning in terms of the number of prototypes and their spatial expansion ,GPA The module provides adaptive allocation of learned prototypes when dealing with query features . These two modules make ASGNet Highly flexible and adaptable to variable object shapes and sizes , Allow it to better generalize invisible object classes .
Proposed Method
In this part , We first introduce two prototype generation and matching modules , That is, the super-pixel guided clustering module (SGC) And guide prototype allocation module (GPA). then , We discuss the adaptive ability of these two modules . then , We introduced the whole network architecture , It is called adaptive super pixel guidance network (ASGNet), It will SGC and GPA Modules are integrated in one model . The overall structure is shown in the figure 2 Shown . Last , We explained ASGNet Medium k-shot Set up .
Superpixel-guided Clustering( Super pixel guided clustering )
SGC The core idea of super pixel sampling network (SSN)[13] and MaskSLIC[12] Inspired by the .SSN Is the first end-to-end trainable depth network for super pixel segmentation .SSN The key contribution of is to SLIC[1] The nearest neighbor operation in is transformed into differentiable operation . Conventional SLIC The super-pixel algorithm uses k Mean iterative clustering , In two steps : Pixel super pixel Association and super pixel centroid update . Based on color similarity and proximity , Assign pixels to different superpixel centroids . To be specific , The input image I∈ Rn×5 Usually located with n A five-dimensional space of pixels (labxy), among lab Express CIELAB Pixel vector in color space ,xy Indicates the pixel position . After iterative clustering , The algorithm outputs the correlation graph , Each of these pixels n Assigned to m One of the super pixels .
This simple method inspired us with a profound idea , That is, the feature map is aggregated into multiple super pixel centroids by clustering , Here the super pixel centroid can be used as a prototype . therefore , We do not calculate the super pixel centroid in the image space , Instead, it is estimated by clustering similar feature vectors , Classify in feature space . Algorithm 1 Describe the whole SGC The process :
边栏推荐
- 几行事务代码,让我赔了16万
- Nacos
- LeetCode 31下一个排列、LeetCode 64最小路径和、LeetCode 62不同路径、LeetCode 78子集、LeetCode 33搜索旋转排序数组(修改二分法)
- 数据交换 JSON
- C语言多线程编程入门学习笔记
- How to use hybrid format to output ISO files? isohybrid:command not found
- How do spark tasks of 10W workers run? (Distributed Computing)
- Clion and C language
- 第03章_用戶與權限管理
- 手把手带你了解一块电路板,从设计到制作(干货)
猜你喜欢

Best used trust automation script (shell)

Redis tutorial
![[nine day training] content III of the problem solution of leetcode question brushing Report](/img/7e/1e76181e56ef7feb083f9662df71c7.jpg)
[nine day training] content III of the problem solution of leetcode question brushing Report

实现pow(x,n)函数

POI exports excel and displays hierarchically according to parent-child nodes

BluePrism注册下载并安装-RPA第一章

Research on target recognition and tracking based on 3D laser point cloud

Chapitre 03 Bar _ Gestion des utilisateurs et des droits

How to achieve 0 error (s) and 0 warning (s) in keil5

ctfshow爆破wp
随机推荐
倍福TwinCAT3 Ads相关错误详细列表
Pyramid Scene Parsing Network【PSPNet】论文阅读
雪崩问题以及sentinel的使用
Common interview questions for performance test
split(),splice(),slice()傻傻分不清楚?
家居网购项目
Introduction to core functions of webrtc -- an article on understanding SDP PlanB unifiedplan (migrating from PlanB to unifiedplan)
How to verify whether the contents of two files are the same
Detailed explanation of ES6 deconstruction grammar
ES6解构语法详解
JUC learning
后台系统页面左边菜单按钮和右边内容的处理,后台系统页面出现双滚动
5、【WebGIS实战】软件操作篇——服务发布及权限管理
Promise中finally的用法
别再说不会解决 “跨域“ 问题啦
pytorch训练深度学习网络设置cuda指定的GPU可见
Hal library setting STM32 interrupt
[深度学习]激活函数(Sigmoid等)、前向传播、反向传播和梯度优化;optimizer.zero_grad(), loss.backward(), optimizer.step()的作用及原理
C language EXECL function
监听器 Listener