当前位置:网站首页>Semantic segmentation | learning record (1) semantic segmentation Preface
Semantic segmentation | learning record (1) semantic segmentation Preface
2022-07-08 02:09:00 【coder_ sure】
Semantic segmentation | Learning record (1) Semantic segmentation Preface
Tips : come from up Lord thunderbolt Wz, I'm just taking study notes , Original video
List of articles
- Semantic segmentation | Learning record (1) Semantic segmentation Preface
- Preface
- One 、 What is semantic segmentation ?
- Two 、 Learning Planning
- Two 、 Common dataset formats for semantic segmentation tasks
- 3、 ... and 、 The specific form of the result obtained by semantic segmentation
- Four 、 Semantic segmentation evaluation index
- 5、 ... and 、 Semantic segmentation annotation tool
- 6、 ... and 、 Reference material
Preface
The preface of semantic segmentation mainly introduces the content involved in this paper :
- What is semantic segmentation
- Tentative learning objectives
- Common dataset formats for semantic segmentation tasks
- The specific form of the result obtained by semantic segmentation
- Common evaluation indicators for semantic segmentation
- Semantic segmentation annotation tool
One 、 What is semantic segmentation ?
Semantic segmentation is one of the common segmentation tasks , Common segmentation tasks have the following three aspects :
- Semantic segmentation (semantic segmentation)FCN
- Instance segmentation (Instance segmentation)Mask R-CNN
- Panoramic segmentation (Panoramic segmentation) Panoptic FPN


Panoramic segmentation is not only to distinguish the background and foreground , Moreover, the background should be classified and segmented in some columns .
The difficulty of the above three segmentation tasks increases in turn .
Two 、 Learning Planning
Several semantic segmentation algorithm source code introduction 
Two 、 Common dataset formats for semantic segmentation tasks
1.PASCAL VOC

PASCAL VOC What is provided in semantic segmentation is actually a PNG picture , In this PNG The file records the category of each pixel , there PNG Pictures are stored in palette format ( The original picture is a 1 Grayscale image of the channel ), The corresponding pixel value is mapped to the corresponding color value . such as :
- Pixels 0 The corresponding is (0,0,0) black
- Pixels 1 The corresponding is (127,0,0) Deep red
- Pixels 255 The corresponding is (224,224,129)
This 255 It's necessary to explain : When we calculate the loss, we will ignore that the pixel value is 255 These pixels , Because it's hard to say which category the edge of the target strictly belongs to , Including some goals that are not easy to divide , We also have 255 Fill in . such as , The figure above has a quadrilateral , It's actually the tail of an airplane , This segmentation is very difficult , We just ignore it .
2.MS COCO

The feature is that each target is given a polygon , And record the coordinates of each corner of the polygon .
MS COCO Data set introduction and pycocotools Easy to use
3、 ... and 、 The specific form of the result obtained by semantic segmentation

Why not directly display grayscale images , But to turn it into color ?
for instance The plane is a pixel value corresponding to 1, Person correspondence is 15, The difference between them is very big , If in the form of gray , It's hard for us to see the difference .
So we map the pixel value to the color format , also Each pixel value corresponds to the category index .
Four 、 Semantic segmentation evaluation index
Pixel Accuracy(Global Acc): pre measuring just indeed Of image plain individual Count total Of image plain individual Count \frac { Predict the correct number of pixels }{ Total number of pixels } total Of image plain individual Count pre measuring just indeed Of image plain individual Count
Σ i n i i Σ i t i \frac{\Sigma_{i}n_{ii}}{\Sigma_{i}t_{i}} ΣitiΣinii
mean Accuracy: Average the accuracy of each category of pixels
1 n c l s Σ i n i i t i \frac{1}{n_{cls}}\Sigma_{i}\frac{n_{ii}}{t_{i}} ncls1Σitinii
mean IoU: Yes IoU averaging
1 n c l s Σ i n i i t i + Σ j n j i − n i i \frac{1}{n_{cls}}\Sigma_{i}\frac{n_{ii}}{t_{i}+\Sigma_{j}n_{ji}-n_{ii}} ncls1Σiti+Σjnji−niinii
Yan color heavy Stack District Domain Of Noodles product total Noodles product \frac { Area of color overlapping area }{ Total area } total Noodles product Yan color heavy Stack District Domain Of Noodles product 
among :
- n i j n_{ij} nij: Category i Predicted into categories j The number of pixels
- n c l s n_{cls} ncls: Number of target categories ( Include background )
- t i = Σ j n i j : t_{i}=\Sigma_{j}n_{ij}: ti=Σjnij: Target categories i Total number of pixels ( Real label )
Have a deep understanding of this evaluation index








mean acc = 1 5 ∑ ( c l a s s i a c c ) \ =\frac{1}{5}\sum(class_iacc) =51∑(classiacc)

mean IoU = 1 5 ∑ ( c l s i i o u ) \ =\frac{1}{5}\sum(cls_i iou) =51∑(clsiiou)
5、 ... and 、 Semantic segmentation annotation tool
Traditional annotation tools , such as :Labelme
Labelme
A semiautomatic annotation tool : Baidu EISeg
EISeg
6、 ... and 、 Reference material
PASCAL VOC2012 Data set introduction
EISeg Segmentation and annotation software use
边栏推荐
- Analysis ideas after discovering that the on duty equipment is attacked
- nmap工具介紹及常用命令
- 发现值守设备被攻击后分析思路
- Le chemin du poisson et des crevettes
- 日志特征选择汇总(基于天池比赛)
- #797div3 A---C
- MySQL查询为什么没走索引?这篇文章带你全面解析
- 《通信软件开发与应用》课程结业报告
- Why did MySQL query not go to the index? This article will give you a comprehensive analysis
- 咋吃都不胖的朋友,Nature告诉你原因:是基因突变了
猜你喜欢

Reading notes of Clickhouse principle analysis and Application Practice (7)

静态路由配置全面详解,静态路由快速入门指南

快手小程序担保支付php源码封装

MySQL查询为什么没走索引?这篇文章带你全面解析

LeetCode精选200道--链表篇

metasploit

Flutter 3.0框架下的小程序运行

JVM memory and garbage collection -4-string

PB9.0 insert OLE control error repair tool

Nacos microservice gateway component +swagger2 interface generation
随机推荐
C语言-Cmake-CMakeLists.txt教程
Introduction à l'outil nmap et aux commandes communes
Introduction to grpc for cloud native application development
ClickHouse原理解析与应用实践》读书笔记(8)
See how names are added to namespace STD from cmath file
Node JS maintains a long connection
力扣5_876. 链表的中间结点
力扣4_412. Fizz Buzz
Ml self realization /knn/ classification / weightlessness
力扣6_1342. 将数字变成 0 的操作次数
数据链路层及网络层协议要点
Vim 字符串替换
#797div3 A---C
leetcode 873. Length of Longest Fibonacci Subsequence | 873. 最长的斐波那契子序列的长度
The body has a mysterious margin of 8px
Ml self realization / linear regression / multivariable
Cross modal semantic association alignment retrieval - image text matching
Remote Sensing投稿經驗分享
Where to think
MySQL查询为什么没走索引?这篇文章带你全面解析