当前位置:网站首页>Cs231n notes (bottom) - applicable to 0 Foundation
Cs231n notes (bottom) - applicable to 0 Foundation
2022-07-05 16:43:00 【Small margin, rush】
Catalog
Transfer learning and object location monitoring
Object segmentation & Semantic segmentation
Transfer learning and object location monitoring
Can target detection be regarded as a regression problem ? The bounding box of multiple objects output for different pictures , May output 4 A bounding box may also output 8 A bounding box , The number of outputs is not fixed , So it can't be regarded as a return problem .
The migration study
In practice , Because few data sets are large enough , So few people choose to train the network from scratch . The common way is : Pre train one on a very large data set CNN, Then use the pre trained network as the initialization network to fine tune 、 Or as a feature extractor .
- Convolutional neural network as a feature extractor . Use in ImageNet Pre trained CNN, Remove the last full connection layer ( namely : The last layer used for classification ), Then use the rest as a feature extractor .
- Fine-tuning CNN. Replace the input layer of the network ( data ), Continue training with new data .
Transfer learning scenarios :
- The new dataset is relatively small , And it is highly similar to the original data set . It is not recommended to CNN Conduct Fine-tune, It is recommended to use pre trained CNN As a feature extractor , Then train a linear classifier .
- The new data set is relatively large , And it is highly similar to the original data set . Because the new data set is large enough , Sure fine-tune The whole network
- The new dataset is smaller , And it is very different from the original data set . Because the data set is very small , So it is best to train a linear classifier . And because the data set is not similar to the original data set , The best way is to train a linear classifier from the shallow output of the pre training network as a feature .
- The new data set is relatively large , And it is very different from the original data set . Because the new data set is big enough , You can retrain the network .
Object positioning & testing
After labeling the picture , Also frame where the object is - In classification and positioning , The number of output boxes is known in advance , And object detection is uncertain , And classification + The difference between positioning tasks is , The number of objects to be detected in object detection is uncertain , Therefore, the regression framework cannot be used directly .
The sliding window -overfeat
Randomly select several windows of different sizes and positions
RCNN
- Pre train a CNN
- Build a training set : First apply Selective Search The algorithm selects 2000~3000 Candidate box .
- Each candidate region is preprocessed , Deliver to CNN Extract image features , Then send the image features to SVM In the classifier , Calculate the loss of label classification . At the same time, the image features are also sent to the regressor , Calculate the offset distance L2 Loss .
- Back propagation training
fast R-CNN
solve R-CNN The problem of slow training prediction , The whole image is CNN feature extraction , Then select the candidate area
- And R-CNN Same use Selective Search Method generation 2000 Multiple candidate boxes
- Input the whole picture directly CNN in , Feature extraction
- Put the 2000 Boxes map to just CNN The last layer extracted feature map On
faster R-CNN:
SSD
SSD The idea is to divide the image into many grids , Several can be derived from the center of each lattice base boxes. Use neural network to classify these grids at one time , For these baseboxes Regression .
Object segmentation & Semantic segmentation
Semantic segmentation is to classify each pixel in the image , Do not distinguish between objects , Only care about pixels , Often costly , May first The framework of undersampling and oversampling
Under sampling can use convolution layer and pooling , Over sampling adopts de pooling , Transposition convolution
There is another one called Max Unpooling Methods , This method records the previous use max pooling The index of the previous maximum values in the array , When de pooling, put the value in the index , Fill in other positions 0:
Object segmentation -Mask RCNN
What we need to do is to go further in object detection , Segment the objects from the pixel level .
Use images CNN Process as a feature , And then through a RPN Network generation candidate area , Project to the previous feature map. Here with faster RCNN equally . Then there are two branches , A branch and faster RCNN identical , Predict the classification and boundary value of the candidate box , Another branch is similar to semantic segmentation , Classify each pixel .
边栏推荐
- 【刷题篇】有效的数独
- Win11 prompt: what if the software cannot be downloaded safely? Win11 cannot download software safely
- sqlserver 做cdc 要对数据库性能有什么要求么
- Flet教程之 11 Row组件在水平数组中显示其子项的控件 基础入门(教程含源码)
- 漫画:什么是分布式事务?
- yarn 常用命令
- 详解SQL中Groupings Sets 语句的功能和底层实现逻辑
- Android 隐私沙盒开发者预览版 3: 隐私安全和个性化体验全都要
- [deep learning] [original] let yolov6-0.1.0 support the txt reading dataset mode of yolov5
- 养不起真猫,就用代码吸猫 -Unity 粒子实现画猫咪
猜你喜欢
解决CMakeList find_package找不到Qt5,找不到ECM
Pspnet | semantic segmentation and scene analysis
The new version of effect editor is online! 3D rendering, labeling, and animation, this time an editor is enough
Oneforall installation and use
Explain in detail the functions and underlying implementation logic of the groups sets statement in SQL
Win11 prompt: what if the software cannot be downloaded safely? Win11 cannot download software safely
2020-2022 two-year anniversary of creation
数据访问 - EntityFramework集成
If you can't afford a real cat, you can use code to suck cats -unity particles to draw cats
Basic introduction to the control of the row component displaying its children in the horizontal array (tutorial includes source code)
随机推荐
Oneforall installation and use
降本40%!Redis多租户集群的容器化实践
[echart] resize lodash 实现窗口缩放时图表自适应
详解SQL中Groupings Sets 语句的功能和底层实现逻辑
怎样在电脑上设置路由器的WiFi密码
今日睡眠质量记录79分
新春限定丨“牛年忘烦”礼包等你来领~
Migrate /home partition
公司自用的国产API管理神器
Jarvis OJ shell流量分析
How to install MySQL
Single merchant v4.4 has the same original intention and strength!
[echart] resize lodash to realize chart adaptation when window is zoomed
If you can't afford a real cat, you can use code to suck cats -unity particles to draw cats
How to uninstall MySQL cleanly
Flet教程之 11 Row组件在水平数组中显示其子项的控件 基础入门(教程含源码)
Global Data Center released DC brain system, enabling intelligent operation and management through science and technology
Apple has abandoned navigationview and used navigationstack and navigationsplitview to implement swiftui navigation
[js] 技巧 简化if 判空
[es6] add if judgment or ternary operator judgment in the template string