当前位置:网站首页>RGB-D Salient Object Detection withCross-Modality Modulation and Selection
RGB-D Salient Object Detection withCross-Modality Modulation and Selection
2022-06-11 06:51:00 【A Xuan is going to graduate~】
Abstract :
in the light of RGB-D Salient object detection (SOD) problem , An effective method is proposed to Integrate and improve cross channel complementarity . The network mainly solves two challenging problems :1) How to effectively integrate information from RGB Complementary information of the image and its corresponding depth map ,2) How to adaptively select more significantly related features . First , We propose a cross modal eigen modulation (cmFM) modular , Take the depth feature as a priori , modeling RGB-D Data complementarity Relationship , Enhanced feature representation . secondly , We propose adaptive feature selection (AFS) Module to select significantly related features and suppress poor features .AFS The module adopts multi-modal spatial feature fusion , Consider the self modal and cross modal interdependencies of channel characteristics . Third , We use Position edge attention of saliency guidance (sg-PEA) modular , Encourage our network to pay more attention to areas related to saliency . The above modules as a whole , be called cmMS block , It is convenient to refine the salient features in a coarse to fine way . Plus a bottom-up inference , Subtle salient features make SOD Accuracy and edge preservation are possible . A lot of experiments have proved that , Our network is in 6 A popular RGB-D SOD The benchmark is superior to the most advanced saliency detector .
2. introduction
First , We came up with one Cross modal characteristic modulation (cmFM) modular , Take the corresponding depth characteristics as a priori , enhance RGB Characteristic means . This is fused with execution input [30]、 Early integration [19] Or late fusion [18] The popular strategy of , The latter roughly connects or adds multimodal information . The proposed modulation design realizes the effective integration of multimodal information through feature transformation , It clearly simulates the indivisible cross modal relationship , It reduces the interference caused by the internal inconsistency of multimodal data .
secondly , We designed adaptive feature selection (AFS) modular , The importance of different channel characteristics in self mode and cross mode is emphasized , At the same time, multi-modal spatial features are fused by gating . This is different from the past RGB-D SOD Different algorithms , Previous algorithms treat the channel characteristics of different modes equally and independently . Relax these assumptions , So that our method can Adaptively select more significantly related features from spatial features and channel features , And suppress poor features . It can also mitigate the negative effects of improper depth map capture . therefore , Our network has additional flexibility in handling different information . We also passed Introduce significance oriented position edge attention (sg-PEA) modular To emphasize the position and edge of significance , The module collects the attention weights from the predicted saliency map and saliency edge map .
The unique feature of this method is that the feature modulation and attention mechanism can be changed from coarse to fine (coarse-to-fine) In a way that is tightly coupled . say concretely , The fusion First of all cmFM Module execution , To provide a rich feature representation . With our AFS Module coordination , Highlight relevant features to be emphasized , Redundant features are suppressed . Characteristics related to significance are defined by sg-PEA Further refinement of modules .cmFM、AFS and sg-PEA The elaborate design of the module Allows cross modal complementarity through modulation in a coarse to fine fashion 、 Select and refine , Provide precise salient features for our network . Plus bottom-up reasoning , Precise saliency features enable us to perform more accurately 、 More robust SOD.
contribution
An effective method is proposed RGB-D SOD Methods . Effectively integrated cross complementation , Adaptive selection of saliency related features . This is achieved by designing a fusion from coarse to fine , The fusion includes
1) A cross modal feature modulation module , The optimal affine transformation parameters are learned by taking the corresponding depth features as a priori , Flexible multi-level modulation RGB features , enhance RGB Characteristic means .
2) Adaptive feature selection module , While fusing important multi-channel spatial features , Gradually emphasize the importance of self channel features and cross channel features , Effectively capture the relationship between different modes . In six popular RGB-D SOD In benchmarking , This method is always superior to the most advanced SOD Method .
3. Related work
3.1 Significance target detection
The work of this paper :
1) use Depth feature as a priori Learn the optimal affine transformation parameters , Flexible multi-level modulation RGB features ;
2) At the same time, the self modal and cross modal channel characteristics as well as the multi-modal spatial characteristics are considered , Effectively capture the relationship between different patterns .
3.2 Characteristic modulation
suffer FiLM Inspired by the , In this paper, we adjust the multi-level feature representation based on the corresponding depth feature . Besides , A pixel based cross modal feature modulation is designed , Provides fine-grained and fine-grained control of features .
3.3 Attention mechanism
The application of attention mechanism is becoming more and more diversified , Such as space attention 、 Double attention 、 Pay attention to yourself 、 Multi level attention 、 Pay attention to the passage . by comparison , In this paper, the attention mechanism is used in the adaptive feature selection module , While fusing important multi-channel spatial features in a gated manner , The interdependency between self - modal and cross - modal channel characteristics is explored .
边栏推荐
- Flat design, blog website (VIII) code source code
- Redux learning (III) -- using Redux saga, writing middleware functions, and splitting reducer files
- Flutter 内外边距
- 563. slope of binary tree
- Won't virtual DOM be available in 2022? Introduction to virtual Dom and complete implementation of diff and patch
- AppClips&Tips(持续更新)
- Starting from scratch (IV) enemy aircraft flying out of the border disappear automatically
- Luogu p1091 chorus formation (longest ascending subsequence)
- text-overflow失效
- 347. top k high frequency elements
猜你喜欢

UEFI查找PCI设备
![Handwritten promise [05] - exception capture of promise method and optional parameters of then method implementation](/img/e7/87069f921ae003511e32b23653703c.jpg)
Handwritten promise [05] - exception capture of promise method and optional parameters of then method implementation

What are the differences and usages of break and continue?

洛谷P1091合唱队形(最长上升子序列)
![[]==![]](/img/65/ab724c74b080da319ed5c01c93fdb7.png)
[]==![]

Differences between FindIndex and indexof

双周投融报:资本抢滩元宇宙游戏

saltstack部署lnmp

JS two methods to determine whether there are duplicate values in the array
![Resolve typeerror: ctx injections. tableRoot.$ scopedSlots[ctx.props.column.slot] is not a function](/img/29/de95cef86bd75107555b212f01ebb9.jpg)
Resolve typeerror: ctx injections. tableRoot.$ scopedSlots[ctx.props.column.slot] is not a function
随机推荐
538. convert binary search tree to cumulative tree
How to make time planning
动态import
Latex various arrows / arrows with text labels / variable length arrows
News web page display
Warning: Each child in a list should have a unique “key“ prop.
saltstack部署zabbix状态文件编写
Flutter 内外边距
搜狐员工遭遇工资补助诈骗 黑产与灰产有何区别 又要如何溯源?
Reconstruction and preheating of linked list information management system (2) how to write the basic logic using linear discontinuous structure?
Moment time plug-in tips -js (super detailed)
3.1 测试函数的命名规则
563. 二叉树的坡度
Do you know what the quotation for it talent assignment service is? It is recommended that programmers also understand
关于 QtCreator的设计器QtDesigner完全无法正常拽托控件 的解决方法
Do you use typescript or anyscript
JS two methods to determine whether there are duplicate values in the array
[matlab printed character recognition] OCR printed letter + number recognition [including source code 1861]
双周投融报:资本抢滩元宇宙游戏
Starting from scratch (IV) enemy aircraft flying out of the border disappear automatically