当前位置:网站首页>Cs231n notes (bottom) - applicable to 0 Foundation
Cs231n notes (bottom) - applicable to 0 Foundation
2022-07-05 16:43:00 【Small margin, rush】
Catalog
Transfer learning and object location monitoring
Object segmentation & Semantic segmentation
Transfer learning and object location monitoring
Can target detection be regarded as a regression problem ? The bounding box of multiple objects output for different pictures , May output 4 A bounding box may also output 8 A bounding box , The number of outputs is not fixed , So it can't be regarded as a return problem .
The migration study
In practice , Because few data sets are large enough , So few people choose to train the network from scratch . The common way is : Pre train one on a very large data set CNN, Then use the pre trained network as the initialization network to fine tune 、 Or as a feature extractor .
- Convolutional neural network as a feature extractor . Use in ImageNet Pre trained CNN, Remove the last full connection layer ( namely : The last layer used for classification ), Then use the rest as a feature extractor .
- Fine-tuning CNN. Replace the input layer of the network ( data ), Continue training with new data .
Transfer learning scenarios :
- The new dataset is relatively small , And it is highly similar to the original data set . It is not recommended to CNN Conduct Fine-tune, It is recommended to use pre trained CNN As a feature extractor , Then train a linear classifier .
- The new data set is relatively large , And it is highly similar to the original data set . Because the new data set is large enough , Sure fine-tune The whole network
- The new dataset is smaller , And it is very different from the original data set . Because the data set is very small , So it is best to train a linear classifier . And because the data set is not similar to the original data set , The best way is to train a linear classifier from the shallow output of the pre training network as a feature .
- The new data set is relatively large , And it is very different from the original data set . Because the new data set is big enough , You can retrain the network .
Object positioning & testing
After labeling the picture , Also frame where the object is - In classification and positioning , The number of output boxes is known in advance , And object detection is uncertain , And classification + The difference between positioning tasks is , The number of objects to be detected in object detection is uncertain , Therefore, the regression framework cannot be used directly .
The sliding window -overfeat
Randomly select several windows of different sizes and positions
RCNN
- Pre train a CNN
- Build a training set : First apply Selective Search The algorithm selects 2000~3000 Candidate box .
- Each candidate region is preprocessed , Deliver to CNN Extract image features , Then send the image features to SVM In the classifier , Calculate the loss of label classification . At the same time, the image features are also sent to the regressor , Calculate the offset distance L2 Loss .
- Back propagation training
fast R-CNN
solve R-CNN The problem of slow training prediction , The whole image is CNN feature extraction , Then select the candidate area
- And R-CNN Same use Selective Search Method generation 2000 Multiple candidate boxes
- Input the whole picture directly CNN in , Feature extraction
- Put the 2000 Boxes map to just CNN The last layer extracted feature map On
faster R-CNN:
SSD
SSD The idea is to divide the image into many grids , Several can be derived from the center of each lattice base boxes. Use neural network to classify these grids at one time , For these baseboxes Regression .
Object segmentation & Semantic segmentation
Semantic segmentation is to classify each pixel in the image , Do not distinguish between objects , Only care about pixels , Often costly , May first The framework of undersampling and oversampling
Under sampling can use convolution layer and pooling , Over sampling adopts de pooling , Transposition convolution
There is another one called Max Unpooling Methods , This method records the previous use max pooling The index of the previous maximum values in the array , When de pooling, put the value in the index , Fill in other positions 0:
Object segmentation -Mask RCNN
What we need to do is to go further in object detection , Segment the objects from the pixel level .
Use images CNN Process as a feature , And then through a RPN Network generation candidate area , Project to the previous feature map. Here with faster RCNN equally . Then there are two branches , A branch and faster RCNN identical , Predict the classification and boundary value of the candidate box , Another branch is similar to semantic segmentation , Classify each pixel .
边栏推荐
- 《21天精通TypeScript-3》-安装搭建TypeScript开发环境.md
- [es6] add if judgment or ternary operator judgment in the template string
- EDI许可证和ICP经营性证有什么区别
- Can you help me see what the problem is? [ERROR] Could not execute SQL stateme
- ES6 drill down - ES6 generator function
- Record a 'very strange' troubleshooting process of cloud security group rules
- One click installation script enables rapid deployment of graylog server 4.2.10 stand-alone version
- 给自己打打气
- 企业级备份软件Veritas NetBackup(NBU) 8.1.1服务端的安装部署
- [vulnerability warning] cve-2022-26134 conflict Remote Code Execution Vulnerability POC verification and repair process
猜你喜欢
OneForAll安装使用
【深度学习】深度学习如何影响运筹学?
[brush questions] effective Sudoku
Benji Bananas 会员通行证持有人第二季奖励活动更新一览
Research and development efficiency measurement index composition and efficiency measurement methodology
scratch五彩糖葫芦 电子学会图形化编程scratch等级考试三级真题和答案解析2022年6月
【刷题篇】鹅厂文化衫问题
有序链表集合求交集 方法 总结
[team PK competition] the task of this week has been opened | question answering challenge to consolidate the knowledge of commodity details
The database of the server is not connected to 200310060 "unknown error" [the service is up, the firewall is off, the port is on, and the netlent port is not connected]
随机推荐
BS-XX-042 基于SSM实现人事管理系统
Explain in detail the functions and underlying implementation logic of the groups sets statement in SQL
Data access - entityframework integration
养不起真猫,就用代码吸猫 -Unity 粒子实现画猫咪
公司自用的国产API管理神器
Win11提示无法安全下载软件怎么办?Win11无法安全下载软件
Jarvis OJ shell流量分析
Detailed explanation of use scenarios and functions of polar coordinate sector diagram
Using graylog alarm function to realize the regular work reminder of nail group robots
Cartoon: what is MapReduce?
Flet教程之 11 Row组件在水平数组中显示其子项的控件 基础入门(教程含源码)
Reduce the cost by 40%! Container practice of redis multi tenant cluster
Cartoon: what is service fusing?
移动办公时如何使用frp内网穿透+teamviewer方式快速连入家中内网主机
[team PK competition] the task of this week has been opened | question answering challenge to consolidate the knowledge of commodity details
为季前卡牌游戏 MotoGP Ignition Champions 做好准备!
數據訪問 - EntityFramework集成
Cartoon: what is the eight queens problem?
APICloud云调试解决方案
Cartoon: what is distributed transaction?