当前位置：网站首页>[point cloud processing paper crazy reading frontier edition 13] - gapnet: graph attention based point neural network for exploring local feature

[point cloud processing paper crazy reading frontier edition 13] - gapnet: graph attention based point neural network for exploring local feature

2022-07-03 09:14:00 【LingbinBu】

GAPNet: Graph Attention based Point Neural Network for Exploiting Local Feature of Point Cloud

Abstract
Method
- GAPLayer
- Attention pooling layer
GAPNet architecture
experiment
- Classification
- Semantic part segmentation

Abstract

Method ： This paper presents a new method for point cloud neural network GAPNet, By way of graph attention mechanism Embedded in stacked Multi-Layer-Perceptron (MLP) layers Middle school learning point cloud Local geometric representation of
1. introduce GAPLayer, Learn the weight of each point by emphasizing the different weights of the neighborhood attention features
2. utilize multi-head mechanism, To be able to make GAPLayer From separate head Aggregate different features
3. Propose in the neighborhood attention pooling layer obtain local signature, It is used to improve the robustness of the network
Code ：TenserFlow edition

Method

remember $X=\left\{x_{i} \in \mathbb{R}^{F}, i=1,2, \ldots, N\right\}$ For input point cloud set, In this paper , $F = 3$ , Representation coordinates $(x, y, z)$ .

GAPLayer

Local structure representation

Considering the real application point cloud The number is huge , So the use of $k$ -nearest neighbor Tectonic orientation graph $G = (V, E)$ , among $V=\{1,2, \ldots, N\}$ Representation node , $\subseteq V \times N_{i}$ edge , $N_{i}$ Indication point $x_{i}$ Set of neighborhoods . Define the edge feature as $y_{i j}=\left(x_{i}-x_{i j}\right)$ , among $\in V, j \in N_{i}$ , $x_{i j}$ Express $x_{i}$ Of neighboring point $x_{j}$ .

Single-head GAPLayer

Single-head GAPLayer The structure of the is shown in the figure below 2(b).

To give everyone neighbors Allocate attention , They put forward respectively self-attention mechanism and neighboring-attention mechanism To get each point to its neighbors The attention coefficient of , Pictured 1 Shown . To be specific ,self-attention mechanism By considering the self-geometric information Study self-coefficients;neighboring-attention mechanism By considering neighborhood Focus on local-coefficients.

As an initialization step , Yes point cloud The vertices and edges of , Features mapped to higher dimensions , The dimension of the output is $F$ :
$\begin{aligned} x_{i}^{\prime} &=h\left(x_{i}, \theta\right) \\ y_{i j}^{\prime} &=h\left(y_{i j}, \theta\right) \end{aligned}$
among $h ()$ Is a parameterized nonlinear function , Selected in the experiment as single-layer neural network , $\theta$ yes filter Set of learnable parameters .

Through fusion self-coefficients $h\left(x_{i}^{\prime}, \theta\right)$ and local-coefficients $h\left(y_{i j}^{\prime}, \theta\right)$ To get the final attention coefficients, among $h\left(x_{i}^{\prime}, \theta\right)$ and $h\left(y_{i j}^{\prime}, \theta\right)$ Yes output as 1 Dimensional single-layer neural network , LeakyReLU() Is the activation function ：
$c_{i j}=\operatorname{LeakyRe} L U\left(h\left(x_{i}^{\prime}, \theta\right)+h\left(y_{i j}^{\prime}, \theta\right)\right)$

Use softmax Normalize these coefficients ：
$\alpha_{i j}=\frac{\exp \left(c_{i j}\right)}{\sum_{k \in N_{i}} \exp \left(c_{i k}\right)}$
Single-head GAPLayer The goal of is to calculate the value of each point ontextual attention feature. So , Use the calculated normalization coefficient to update the feature of the vertex $\hat{x}_{i} \in \mathbb{R}^{F^{\prime}}$ ：
$\hat{x}_{i}=f\left(\sum_{j \in N_{i}} \alpha_{i j} y_{i j}^{\prime}\right)$
among $f ()$ It's a nonlinear activation function , Used in experiments RELU function .

Multi-head mechanism

In order to obtain sufficient structural information and stable network , We will $M$ Independent single-head GAPLayers Splicing , The number of generated channels is $\times F^{\prime}$ Of multi-attention features：
$\hat{x}_{i}^{\prime}=\|_{m}^{M} \hat{x}_{i}^{(m)}$
Pictured 2 Shown ,multi-head GAPLayer The output of is multi-attention features and multi-graph features. $\hat{x}_{i}^{(m)}$ It's No $m$ individual head Of attention feature, $M$ yes heads The number of , $\|$ Indicates the splicing operation between feature channels .

Attention pooling layer

In order to improve the stability and performance of the network , stay multi-graph features Defined on adjacent channels of attention pooling layer：
$Y_{i}=\|_{m}^{M} \max _{j \in N_{i}} y_{i j}^{\prime(m)}$

GAPNet architecture

This structure is related to PointNet Yes 3 It's different ：

Use attention-aware spatial transform network bring Point cloud It has some transformation invariance
Do not process single points , Instead, it extracts local features
Use attention pooling layer obtain local signature, Connect with the middle layer , Used to obtain global descriptor