当前位置:网站首页>Building lightweight target detection based on mobilenet-yolov4
Building lightweight target detection based on mobilenet-yolov4
2022-06-27 03:46:00 【@BangBang】
Network replacement implementation ideas
1、 Network structure analysis and replacement thinking analysis

about YoloV4 for , The whole network structure can be divided into three parts .
Namely :
- 1、 Backbone feature extraction network
Backbone, Corresponding to... On the imageCSPdarknet53 - 2、 Strengthen the feature extraction network , Corresponding to... On the image
SPPandPANet - 3、 Prediction network
YoloHead, Use the obtained features to predict
among :
- The first part is the function of the backbone feature extraction network , Extract network information using backbone features , We can obtain three preliminary effective feature layers .
- The second part is to enhance the function of feature extraction network , Using enhanced feature extraction network , We can perform feature fusion on three preliminary effective feature layers , Extract better features , Get three more effective feature layers .
- The function of the third part of the prediction network is to obtain the prediction results by using a more effective and effective entire layer .
In these three parts , The first 1 Section and section 2 Parts can be modified more easily . The first 3 Some modifiable parts are of little significance , After all, it's just 3x3 Convolution sum 1x1 The combination of convolutions .
mobilenet A series of networks can be used for classification , Its main part is used for feature extraction , We can use mobilenet Series network replaces yolov4 In the middle of CSPdarknet53 Feature extraction , Make the three preliminary effective feature layers the same shape Feature layer to enhance feature extraction , Can will mobilenet Series replacement into yolov4 In the middle .
3、 Integrate the feature extraction results into yolov4 In the Internet

about yolov4 Speaking of , We need to use the three effective features obtained from the backbone feature extraction network to strengthen the construction of the feature pyramid .
Use the... Defined in the previous step MobilenetV1、MobilenetV2、MobilenetV3 Three functions, we can get each Mobilenet Three effective feature layers corresponding to the network .
We can use these three effective feature layers to replace the original yolov4 Backbone network CSPdarknet53 Effective feature layer .
In order to further reduce the number of parameters , We can use deep separable convolution instead of yoloV4 The common convolution used in .
4、 Modify the enhanced feature extraction network PAnet, Reduce the amount of network parameters
about PAnet The parameters of are mainly concentrated in 3x3 In convolution of , If you can 3x3 Convolution is modified , The parameter quantity can be greatly reduced . utilize Deep separable structure is fast Replace 3x3 Convolution
5. Modify convolution kernel magnification factor alpha
Convolution kernel magnification factor alpha, Control the number of convolution kernels , Modifying this parameter can reduce the number of convolution kernels , Thus reducing the amount of network parameters
See blog :Tensorflow2 utilize mobilenet series (v1,v2,v3) build yolov4 Target detection platform
github Source code :https://github.com/bubbliiiing/mobilenet-yolov4-tf2
B Stop video :Tensorflow2 Build your own Mobilenet-YoloV4 Target detection platform
边栏推荐
- [Shangshui Shuo series] day 6
- Geometric distribution (a discrete distribution)
- MATLAB | 基于分块图布局的三纵坐标图绘制
- Anaconda3安装过程及安装后缺失大量文件,没有scripts等目录
- 超級詳細,2 萬字詳解,吃透 ES!
- MATLAB | 三个趣的圆相关的数理性质可视化
- 文旅夜游|以沉浸式视觉体验激发游客的热情
- 2021:Graphhopper: Multi-Hop Scene Graph Reasoning for Visual Question Answering
- Fplan powerplan instance
- USB DRIVER
猜你喜欢

2021:Beyond Question-Based Biases:Assessing Multimodal Shortcut Learning in Visual Question Answeri

I found a JSON visualization tool artifact. I love it!

mysql数据库基础:DQL数据查询语言

Static timing analysis OCV and time derive

超级详细,2 万字详解,吃透 ES!

Stack overflow vulnerability

Pat grade a 1020 tree Traversals

MATLAB | 三个趣的圆相关的数理性质可视化

电商产品如何在知乎上进行推广和打广告?

2016Analyzing the Behavior of Visual Question Answering Models
随机推荐
Pat class a 1024 palindromic number
Pat grade a 1026 table tennis
乐得瑞LDR6035 USB-C接口设备支持可充电可OTG传输数据方案。
Static timing analysis OCV and time derive
Pat grade a 1025 pat ranking
How do I simplify the development of interfaces in open source systems?
How to solve the problem of low applet utilization
Pat grade a 1021 deep root
How does source insight (SI) display the full path? (do not display omitted paths) (turn off trim long path names with ellipses)
通信中的机器学习最佳阅读资料列表
Fplan layout
Promise [II. Promise source code] [detailed code comments / complete test cases]
fplan-布局
Anaconda3 is missing a large number of files during and after installation, and there are no scripts and other directories
Nestjs environment variable configuration to solve the problem of how to inject services into interceptors
记录unity 自带读取excel的方法和遇到的一些坑的解决办法
MATLAB | 基于分块图布局的三纵坐标图绘制
电商产品如何在知乎上进行推广和打广告?
Common sense of Apple's unique map architecture
栈溢出漏洞