当前位置:网站首页>20220525 RCNN--->Faster RCNN
20220525 RCNN--->Faster RCNN
2022-06-12 07:44:00 【GAOSHIQI5322688】
1、RCNN (region CNN). A pioneer in the field of object detection .CNN The Internet
1> Candidate area generation
Use selective search The traditional way , First split the picture , Merge areas with a high probability of containing the same object , Normalize , Get a fixed size image .
2>cnn feature extraction
Characteristic drawing reeler
3> svm classifier .
Linear two classifiers for classification , Difficult sample mining to balance the imbalance of positive and negative samples .
4> Location refinement
The regressor returns to the target area
2、Fast RCNN End to end 、 be based on VGG16、 Fast
improvement :
1> Shared winder
Put the whole picture into the winder network , Or use it selective search The way , But the amount of calculation is reduced .
2>Roi Pooling
Feature pooling , Arbitrary scale transformation , Any size picture input .
3> Multitasking loss
Classification and regression are trained together , Use softmax The function classification
3、faster rcnn. Put forward rpn Extract candidate box network , utilize anchor
function :
1> Feature extraction network
vggnet
2>RPN
1>>anchor Generate
Each point of the feature map corresponds to 9 individual anchor, Corresponding to the original image, it can basically cover all objects
2>>RPN Winder network
Use 1*1 The winder gets each... In the characteristic graph anchor Prediction score and prediction offset value of
3>>RPN loss
Only during training , Will all anchors Match the label , A good match anchors Positive sample , The opposite is a negative sample , Get the classification and offset true values , And the predicted score in part II 、 Offset values do loss Calculation
4>> Generate proposal
The value predicted in the second part after using the loss calculation , Select the better proposal, Into the network
5>> Screening ROI ( Region of interest )
3> ROI Pooling
The essential , Accept the characteristic diagram and ROI, Output to RCNN. because ROI Different feature sizes , Different dimensions , Unable to send to the fully connected network , So use feature pooling , Fixed dimension .
4>RCNN
1>> take roi Connect to the fully connected network , Input rcnn Prediction score and prediction offset
2>> Calculation rcnn Truth value
3>>Rcnn Loss
Classification and regression input dimensions 21 and 84
Fasterrcnn It is a two order algorithm , namely RPN\RCNN , It is necessary to calculate the loss , The former needs to provide the latter with regions of interest .
RPN Output anchor It's the forecast , anchor With the label iou\ Offset For the truth .
RPN Loss calculation :
Predicted value and true value , Calculate the loss . Including classification and regression .
classification : Just distinguish the background 、 prospects , Two classification , Cross entropy loss . Incoming score .
Return to : The offset and truth values are large , Use 1 Order loss function , Easy to converge .
nms:
stay RPN Step four , You will get more than 10000 points anchors , But there will be multiple overlaps anchors, use nms Remove the overlapping box ( Just remove the overlapping boxes ), According to the score, select the front 2000 As the final proposal
Screening proposal obtain roi:
utilize proposal And label iou Calculation , elect 256 individual roi.
边栏推荐
- R语言使用epiDisplay包的summ函数计算dataframe中指定变量在不同分组变量下的描述性统计汇总信息并可视化有序点图、使用dot.col参数设置不同分组数据点的颜色
- Continuous local training for better initialization of Federated models
- Arrangement of statistical learning knowledge points -- maximum likelihood estimation (MLE) and maximum a posteriori probability (map)
- Question bank and answers of special operation certificate examination for safety management personnel of hazardous chemical business units in 2022
- In depth learning - overview of image classification related models
- Topic 1 Single_Cell_analysis(4)
- 鸿蒙os-第一次培训
- Golang quickly generates model and queryset of database tables
- tmux 和 vim 的快捷键修改
- AcWing——4269校庆
猜你喜欢

2022 G3 boiler water treatment recurrent training question bank and answers

Seeking for a new situation and promoting development, the head goose effect of Guilin's green digital economy

Voice assistant - potential skills and uncalled call technique mining

Summary of semantic segmentation learning (II) -- UNET network

鸿蒙os-第一次培训

经典论文回顾:Palette-based Photo Recoloring

Missing getting in online continuous learning with neuron calibration thesis analysis + code reading

最新hbuilderX编辑uni-app项目运行于夜神模拟器

Summary of semantic segmentation learning (I) -- basic concepts

Voice assistant - Multi round conversation (process implementation)
随机推荐
Federated meta learning with fast convergence and effective communication
Summary of machine learning + pattern recognition learning (VI) -- feature selection and feature extraction
Arrangement of statistical learning knowledge points -- maximum likelihood estimation (MLE) and maximum a posteriori probability (map)
Voice assistant - those classification models used in the assistant
二、八、十、十六进制相互转换
R语言glm函数构建泊松回归模型(possion)、epiDisplay包的poisgof函数对拟合的泊松回归模型进行拟合优度检验、即模型拟合的效果、验证模型是否有过度离散overdispersion
Chapter 4 - key management and distribution
Fcpx plug-in: simple line outgoing text title introduction animation call outs with photo placeholders for fcpx
The first demand in my life - batch uploading of Excel data to the database
The R language converts the data of the specified data column in the dataframe data from decimal to percentage representation, and the data to percentage
Personalized federated learning with Moreau envelopes
谋新局、促发展,桂林绿色数字经济的头雁效应
R语言将dataframe数据中指定数据列的数据从小数转化为百分比表示、数据转换为百分数
Meter Reading Instrument(MRI) Remote Terminal Unit electric gas water
ECMAScript6面试题
Voice assistant -- Qu -- query error correction and rewriting
20220526 损失函数
最新hbuilderX编辑uni-app项目运行于夜神模拟器
The Poisson regression model (posion) is constructed by GLM function of R language, and the poisgof function of epidisplay package is used to test the goodness of fit of the fitted Poisson regression
R语言dplyr包mutate_at函数和one_of函数将dataframe数据中指定数据列(通过向量指定)的数据类型转化为因子类型