当前位置:网站首页>Target segmentation learning
Target segmentation learning
2022-07-28 22:36:00 【TwilightZrui】
object detection : Pay more attention to semantic level
Target segmentation : Focus on pixel level
Basic process of target detection algorithm


DPM: Additional strategies will be added , The pinnacle of traditional algorithms
Deep learning target detection method :
One-stage(YOLO SSD)
Two-stage(Faster RCNN)
At present, deep learning target detection method is the mainstream
Acquire features through learning methods , Features will be more robust
Proposal Or go straight back to Get the target
When deciding the target , Through deep network
The feature is not robust ,
End to end , The end-to-end step is more convenient to adapt to the model than the multi-step of the traditional target detection algorithm
The basic flow of traditional target detection methods
Many classification : Distinguish the background
After determining the candidate box
NMS: Not very much : Merge candidate boxes
Deep learning objectives
Turn the process of feature extraction into convolutional neural network
Viola-Jones( Face detection )
haar features :
Candidate box : Sliding window extraction
HOG+SVM
For pedestrian detection ( There are differences in pedestrian posture , And for sports )
extract HOG features
HOG features (
A kind of texture feature
Grayscale the features (Hog It is mainly used in gray image (
Conduct Gamma Transformation
Calculate the gradient map( The direction angle of the current pixel : Quantify to 0——360
dimension : Quantitative angle , Degree of ,CELL,
Training SVM classifier
Modeling pedestrians and backgrounds
Support vector
Candidate target extraction ;
Candidate box filtering :SVM
DPM: The pinnacle of traditional target detection algorithms
HOG An extension of
utilize SVM Train to get the gradient of the object
NMS:
Purpose : To eliminate redundant frames , Find the best location for object detection
thought : Select the window with the highest score in those fields , At the same time, suppress those windows with low scores
( fraction : Probability value )
Soft-NMS: There are too many rough judgments of the detection frame by the direct standby threshold . therefore soft
Adjust the score of the detection maniac in the adjacent area, and it is not completely suppressed , Thus, the accuracy rate under the condition of high detection rate is improved .
It can still significantly improve the object detection performance at low retrieval rate .

ROI_POOLINT: Cutout
Two-stage:
RCNN
Fast RCNN
Faster RCNN
Faster RCNN variant 
边栏推荐
- Sword finger offer II 056. Sum of two nodes in a binary search tree (simple binary search tree DFS hash table double pointer iterator)
- The blueprint of flask complements openpyxl
- Concise history of graphic technology
- 软考网络工程师
- Win11怎么打开软件通知
- Necessary for in-depth learning: split the data set, split the labels according to the split pictures, and check the interval of all marked labels
- winServer运维技术栈
- Leetcode question brushing series - sum of majority type
- 成立不到一年!MIT衍生量子计算公司完成900万美元融资
- Why doesn't the icon on the elment plus icon input display
猜你喜欢

98. Verify binary search tree (medium binary search tree DFS)

How about the actual use effect of common source oscilloscope

SQL injection less38 (Stack Injection)

SQL injection less42 (post stack injection)

成立不到一年!MIT衍生量子计算公司完成900万美元融资
![[Ruiji takeout project] Day5 - Chapter 6 mobile verification code login](/img/53/c578e0d1428ea569fb412a20019924.png)
[Ruiji takeout project] Day5 - Chapter 6 mobile verification code login

【CVPR 2021】Cylinder3D:用于LiDAR点云分割的圆柱体非对称3D卷积网络

6K6w5LiA5qyh5pS75Ye75YiG5p6Q
![Paddlenlp is based on ernir3.0 text classification. Take the crime prediction task of cail2018-small dataset as an example [multiple tags]](/img/87/d943cc1e8169bb670414fbf7a322c5.jpg)
Paddlenlp is based on ernir3.0 text classification. Take the crime prediction task of cail2018-small dataset as an example [multiple tags]

Less than a year after its establishment! MIT derivative quantum computing company completed financing of US $9million
随机推荐
[CS231N]Lecture_ 2:Image Classification pipelin
Overall introduction of Ruiji takeout project
Att & CK preliminary understanding
静态成员static详解
成立不到一年!MIT衍生量子计算公司完成900万美元融资
SSH password free login
[leetcode] maximum depth of binary tree
微信小程序里button点击的时候会边框有黑线
容器化配置启动redis集群 单机6节点 3主3从
[LiteratureReview]Object Detection and Mapping with Bounding Box Constraints
PaddleNLP基于ERNIR3.0文本分类以中医疗搜索检索词意图分类(KUAKE-QIC)为例【多分类(单标签)】
ATT&CK初步了解
ECMASript 5/6 笔记
MOV格式是不是静态图像文件格式
微信小程序使用canvas绘图,圆形头像,网络背景图,文字,虚线,直线
The blueprint of flask complements openpyxl
What is time complexity
Sword finger offer II 054. Sum of all values greater than or equal to nodes (medium binary search tree DFS)
Jmeter 安装第三方插件 Plugins Manager
【二叉树】二叉树中的伪回文路径