当前位置:网站首页>20220525 RCNN--->Faster RCNN
20220525 RCNN--->Faster RCNN
2022-06-12 07:44:00 【GAOSHIQI5322688】
1、RCNN (region CNN). A pioneer in the field of object detection .CNN The Internet
1> Candidate area generation
Use selective search The traditional way , First split the picture , Merge areas with a high probability of containing the same object , Normalize , Get a fixed size image .
2>cnn feature extraction
Characteristic drawing reeler
3> svm classifier .
Linear two classifiers for classification , Difficult sample mining to balance the imbalance of positive and negative samples .
4> Location refinement
The regressor returns to the target area
2、Fast RCNN End to end 、 be based on VGG16、 Fast
improvement :
1> Shared winder
Put the whole picture into the winder network , Or use it selective search The way , But the amount of calculation is reduced .
2>Roi Pooling
Feature pooling , Arbitrary scale transformation , Any size picture input .
3> Multitasking loss
Classification and regression are trained together , Use softmax The function classification
3、faster rcnn. Put forward rpn Extract candidate box network , utilize anchor
function :
1> Feature extraction network
vggnet
2>RPN
1>>anchor Generate
Each point of the feature map corresponds to 9 individual anchor, Corresponding to the original image, it can basically cover all objects
2>>RPN Winder network
Use 1*1 The winder gets each... In the characteristic graph anchor Prediction score and prediction offset value of
3>>RPN loss
Only during training , Will all anchors Match the label , A good match anchors Positive sample , The opposite is a negative sample , Get the classification and offset true values , And the predicted score in part II 、 Offset values do loss Calculation
4>> Generate proposal
The value predicted in the second part after using the loss calculation , Select the better proposal, Into the network
5>> Screening ROI ( Region of interest )
3> ROI Pooling
The essential , Accept the characteristic diagram and ROI, Output to RCNN. because ROI Different feature sizes , Different dimensions , Unable to send to the fully connected network , So use feature pooling , Fixed dimension .
4>RCNN
1>> take roi Connect to the fully connected network , Input rcnn Prediction score and prediction offset
2>> Calculation rcnn Truth value
3>>Rcnn Loss
Classification and regression input dimensions 21 and 84
Fasterrcnn It is a two order algorithm , namely RPN\RCNN , It is necessary to calculate the loss , The former needs to provide the latter with regions of interest .
RPN Output anchor It's the forecast , anchor With the label iou\ Offset For the truth .
RPN Loss calculation :
Predicted value and true value , Calculate the loss . Including classification and regression .
classification : Just distinguish the background 、 prospects , Two classification , Cross entropy loss . Incoming score .
Return to : The offset and truth values are large , Use 1 Order loss function , Easy to converge .
nms:
stay RPN Step four , You will get more than 10000 points anchors , But there will be multiple overlaps anchors, use nms Remove the overlapping box ( Just remove the overlapping boxes ), According to the score, select the front 2000 As the final proposal
Screening proposal obtain roi:
utilize proposal And label iou Calculation , elect 256 individual roi.
边栏推荐
- Model deployment learning notes (I)
- Personalized federated learning with Moreau envelopes
- Golang quickly generates model and queryset of database tables
- Topic 1 Single_Cell_analysis(4)
- Fcpx plug-in: simple line outgoing text title introduction animation call outs with photo placeholders for fcpx
- [yolo-v5 learning notes]
- WEB页面性能优化面试题
- 2022 G3 boiler water treatment recurrent training question bank and answers
- 2022r2 mobile pressure vessel filling test question simulation test platform operation
- Voice assistant - overall architecture and design
猜你喜欢

Summary of machine learning + pattern recognition learning (II) -- perceptron and neural network

Some summaries of mathematical modeling competition in 2022

Golang 快速生成数据库表的 model 和 queryset
![[tutorial] deployment process of yolov5 based on tensorflow Lite](/img/d0/c38f27ad76b62b27cdeb68728e9c8c.jpg)
[tutorial] deployment process of yolov5 based on tensorflow Lite

Adaptive personalized federated learning paper interpretation + code analysis

Leverage contextual information

2022r2 mobile pressure vessel filling test question simulation test platform operation

謀新局、促發展,桂林綠色數字經濟的頭雁效應

MySQL index (easy to handle in one article)

vscode 1.68变化与关注点(整理导入语句/实验性新命令中心等)
随机推荐
Velocity autocorrelation function lammps v.s MATALB
Non IID data and continuous learning processes in federated learning: a long road ahead
2022 G3 boiler water treatment recurrent training question bank and answers
L'effet de l'oie sauvage sur l'économie numérique verte de Guilin
Leverage contextual information
Summary of semantic segmentation learning (II) -- UNET network
Seeking for a new situation and promoting development, the head goose effect of Guilin's green digital economy
Principle and application of PWM
ECMAScript6面试题
Topic 1 Single_Cell_analysis(4)
Vs2019 MFC IP address Control Control inherit cipaddressctrl class redessine
AI fanaticism | come to this conference and work together on the new tools of AI!
面试计算机网络-传输层
Summary of semantic segmentation learning (I) -- basic concepts
2D visualization of oil storage, transportation and production, configuration application enables intelligent development of industry
Vscode 1.68 changes and concerns (sorting and importing statements / experimental new command center, etc.)
R语言将dataframe数据中指定数据列的数据从小数转化为百分比表示、数据转换为百分数
VS 2019 MFC 通过ACE引擎连接并访问Access数据库类库封装
In depth learning - overview of image classification related models
最新hbuilderX编辑uni-app项目运行于夜神模拟器