当前位置:网站首页>Detailed explanation of deeplab series (simple and practical annual summary)
Detailed explanation of deeplab series (simple and practical annual summary)
2022-07-27 12:03:00 【Full stack programmer webmaster】
Hello everyone , I meet you again , I'm your friend, Quan Jun .
1、 Why convolution neural network has translation invariance ?
Invariance can be divided into :
- Translation invariance :Translation Invariance
- rotate / Perspective invariance :Ratation/Viewpoint Invariance
- Scale invariance :Size Invariance
- Illumination invariance :Illumination Invariance
CNN = Convolution + Pooling
The features of the image are translated , So in the process of convolution , Feature extraction is translated accordingly .
During pooling , It returns the maximum or average value in the receptive field , The receptive field still corresponds to the characteristic information of the image .
therefore ,CNN It has translation invariance .
2、 Why is it difficult for convolutional neural network to deal with pixel level classification ?
- Repeated pooling and down sampling lead to a sharp decline in resolution , Location information is lost and difficult to recover
- Spatial invariance leads to the loss of detail information
In fact, convolution and pooling lead to the loss of a lot of image detail location information .
3、deeplab v1 What has been done ?
problem :DCNNs The invariance of is not enough for semantic segmentation .
Method : combination DCNNs And probability graph models , namely DCNNs The last layer of response and conditional random fields solves the segmentation problem .
4、deeplab v2 What has been done ?
problem : Semantic segmentation problem .
Method : Cavity convolution 、 Pyramids pool 、DCNNs+CRF.
1、 Use sampling filter or null Hole convolution (Atrous Convolution) Highlight convolution , It is a powerful tool in intensive prediction tasks . Void convolution is allowed in DCNNs The calculated characteristic response shows the control resolution . At the same time, the filtering receptive field is effectively expanded to mix richer context information without increasing the amount of parameters and computational complexity .
2、 Put forward a Pyramid with empty space pooling(ASPP) Segment the target stably on multiple scales .ASPP A filter with multiple sampling rates and effective field of view is used to detect the incoming convolution feature layer , So as to capture the object and image context of multiple scales .
3、 Propose the location of the target boundary , By combining DCNNs And probability models . Ordinary DCNNs in max-pooling And mining to maintain invariance, but has an impact on positioning accuracy , use DCNNs and CRF Combine to solve the problem of positioning accuracy .
Void convolution : Enhance intensive forecasting 、 Expand the feeling field .
Pyramidal pooling of empty space : Multiscale image representation .( Multi scale feature extraction + Information fusion )
DCNNs+CRF effect : Structural prediction of precise boundaries .
5、deeplab v3 What has been done ?
problem : The accuracy of semantic segmentation .
Method : Ed - Decoding structure , Introduce decoding module ; combination Xception model And depth separable convolution (depthwise separable convolution) and ASPP、 Decoding module .
DeepLabv3, By increasing the Simple and effective decoding module fine segmentation results Especially the object boundary . further , Use Xception model And depth separable convolution (depthwise separable convolution), combination ASPP And the decoding module gets a faster 、 Stronger editing - Decoding network .
6、deeplab Series of work ?
Deeplabv1 Use CRF post-processing , Improve the accuracy of segmentation boundary ;
Deeplabv2 Use void convolution to expand receptive field , Using hole space pyramid pooling to realize multi-scale prediction and context information extraction , At the same time, post-processing is used CRF;
Deeplabv3 Do not use post-treatment , Use of editing - Decoding structure improves segmentation boundary prediction , Use depth to separate convolution and Xception modular .
Publisher : Full stack programmer stack length , Reprint please indicate the source :https://javaforall.cn/128203.html Link to the original text :https://javaforall.cn
边栏推荐
- 【机器学习-白板推导系列】学习笔记---条件随机场
- LNMP architecture setup (deploy discuz Forum)
- Leetcode 01: t1. sum of two numbers; T1108. IP address invalidation; T344. Reverse string
- Finding the finite zero point of transfer function under different sampling periods
- [machine learning whiteboard derivation series] learning notes - probability graph model and exponential family distribution
- compute_ class_ weight() takes 1 positional argument but 3 were given
- Could not load dynamic library ‘libcudnn.so.8‘;
- 二分查找判定树(二分查找树平均查找长度)
- Sword finger offer notes: t58 - I. flip word order
- Firewall firewall
猜你喜欢

Japan Fukushima waste dump safety monitoring agreement will recognize the "safety" of the sea discharge plan

Why is ack=seq+1 when TCP shakes hands three times

解决方案:Can not issue executeUpdate() or executeLargeUpdate() for SELECTs

Strictly control outdoor operation time! Foshan housing and Urban Rural Development Bureau issued a document: strengthening construction safety management during high temperature

图像分割 vs Adobephotoshop(PS)

In the first half of the year, the number of fires decreased by 27.7%. Guangdong will improve the fire safety quality of the whole people in this way

Introduction to box diagram

LNMP architecture setup (deploy discuz Forum)

Keil MDK compilation appears..\user\stm32f10x H (428): error: # 67: expected a "}" wrong solution

MySQL数据库主从复制集群原理概念以及搭建流程
随机推荐
[machine learning whiteboard derivation series] learning notes - support vector machine and principal component analysis
上半年火灾起数下降27.7%,广东将这样提升全民消防安全素质
shell编程之免交互
Proteus8专业版破解后用数码管闪退的解决
EfficientNet
Newton-Raphson迭代法
Sword finger offer notes: t57 - ii Continuous positive sequence with sum s
剑指 Offer 笔记: T45. 把数组排成最小的数
Unexpected harvest of epic distributed resources, from basic to advanced are full of dry goods, big guys are strong!
Interaction free shell programming
kazoo使用教程
你尚未连接代理服务器可能有问题或地址不正确(如何查看代理服务器ip)
The first case of monkeypox in pregnant women in the United States: the newborn was injected with immunoglobulin and was safely born
JS-寄生组合式继承
基于反馈率的控制系统原理
torch‘ has no attribute ‘inference_ mode‘
A possibility that ch340 module cannot be recognized / burned
Greek alphabet reading
Keil MDK编译出现..\USER\stm32f10x.h(428): error: #67: expected a “}“错误的解决办法
Shake quickly to rescue the "frustrated person"