当前位置:网站首页>From repvgg to mobileone, including mobileone code
From repvgg to mobileone, including mobileone code
2022-07-04 21:52:00 【Roaring Ajie】
The idea of re parameterization is essentially to use the additivity of linear models . In industry, it is conv and bn Layer fusion is applied . In recent years, there have been re-parameter The job of .RepVGG It is a good application .
VGG It is a straight cylinder model , Because no skip connnection, Lead to deeper training vgg There will be degradation of the model . However, such as resnet etc. ,skip connnect It increases the running time for the end-to-end equipment , There is a lot of consumption in data access .
therefore RepVGG, Integrate the idea of heavy parameters , During the training , by vgg Introduced skip connection, During the test skip connection Merge with the main branch , Get one convbnrelu block, Then it becomes a straight cylinder model .
But it's different from resnet,RepVgg In order to ensure the additive principle of linear model , In two relu Between , Use residual Branch . Because any linear layer between nonlinear layers , Can be merged .
About how to integrate , You can see it in the reference link , It's very detailed
and MobileOne It is in RepVGG On the basis of , Realize that the straight cylinder model can bring a good acceleration to the end model . So it was decided that RepVgg Transformed into a lightweight model , Convolute depth separation , Use some well-designed training strategies , Make the straight tube model on the light-weight model , On a par mobilenet And other well-known lightweight models , And it is better in running speed .
I reappeared mobileone Of s0 edition , The effect is almost the same as that of the paper , Interest can be found in my github see
MobileOne code
Reference resources
边栏推荐
- 股票开户佣金最低多少,炒股开户佣金最低网上开户安全吗
- Jerry's ad series MIDI function description [chapter]
- 输入的查询SQL语句,是如何执行的?
- gtest从一无所知到熟练运用(1)gtest安装
- Operation of adding material schedule in SolidWorks drawing
- 时空预测3-graph transformer
- Solve the problem of data disorder caused by slow asynchronous interface
- Analysis of maker education technology in the Internet Era
- Use of redis publish subscription
- redis RDB AOF
猜你喜欢
WGCNA analysis basic tutorial summary
巅峰不止,继续奋斗!城链科技数字峰会于重庆隆重举行
ArcGIS 10.2.2 | solution to the failure of ArcGIS license server to start
[weekly translation go] how to code in go series articles are online!!
Interpreting the development of various intelligent organizations in maker Education
LambdaQueryWrapper用法
Jerry's ad series MIDI function description [chapter]
【C语言】符号的深度理解
Can be displayed in CAD but not displayed in print
Jerry's ad series MIDI function description [chapter]
随机推荐
更强的 JsonPath 兼容性及性能测试之2022版(Snack3,Fastjson2,jayway.jsonpath)
网上开户哪家证券公司佣金最低,我要开户,网上开户安全吗
Compréhension approfondie du symbole [langue C]
Redis cache
Billions of citizens' information has been leaked! Is there any "rescue" for data security on the public cloud?
Jerry's ad series MIDI function description [chapter]
How to remove the black dot in front of the title in word document
Application practice | Shuhai supply chain construction of data center based on Apache Doris
Le module minidom écrit et analyse XML
Golang面试整理 三 简历如何书写
开户哪家券商比较好?网上开户安全吗
Three or two things about the actual combat of OMS system
How to implement Devops with automatic tools
一文掌握数仓中auto analyze的使用
超详细教程,一文入门Istio架构原理及实战应用
Numpy vstack and column_ stack
Analysis of maker education technology in the Internet Era
Can be displayed in CAD but not displayed in print
Jerry's ad series MIDI function description [chapter]
Flink1.13 SQL basic syntax (I) DDL, DML