当前位置:网站首页>yolov7 innovation point
yolov7 innovation point
2022-08-02 09:57:00 【ffllxx123】
Innovation 1: Extended Efficient Layer Aggregation Network E-ELAN and Composite Model Reduction
In the middle of the picture, there is actually a convolution and a siluA bn layer.64 on the right is the number of output channels, 1 is the size of the convolution kernel, and 1 is the stride.
v7 proposes e-elan, in fact, e-elan istwo elan.
Model scaling
From (a) to (b), we observeAs a result, when performing depth scaling on cascade-based models, the output width of the computational block also increases.This phenomenon will cause the input width of subsequent transport layers to increase.Therefore, we propose (c) that when performing model scaling on a cascade-based model, only the depth in the computation block needs to be scaled, and the rest of the transport layer uses the corresponding width scaling.You can make the width unchanged (roughly this reason).
Reparameterized Network
As you can see from the leftmost figure, the residualThere are two types of structures, the middle one is the re-parameterized residual structure. It can be seen that each residual structure only spans a 3x3 convolution, and sometimes the two residual structures are used together, but there is no inference stage.Residual structure, which makes the training accuracy higher and the inference speed faster.
Parameter fusion, that is, 3x3 convolution kernel 1x1 volumeProduct and do nothing These 3 can be fused into a 3X3 convolution.
both left and right are doing nothing, is the identity all the way.Then, for example, the original road has an ordinary 3x3 convolution, then adding this ordinary 3x3 convolution and the convolution on the right side of the above picture one by one can achieve parameter fusion.You get the effect on the far right of the image below.Re-parameterization is achieved.
This repconv is a reparameterized convolution.The author found that the reparameterized convolution plus the residual is not good (d figure),
Innovation point 3 tag matching
a picture is a common pyramid model, introduced in v7The b structure is added and the auxiliary head is added. You can see it in the c picture, and calculate the loss of the guide head and the auxiliary head at the same time. As you can see in the d picture, the distributor in the guide head will assist in calculating the loss of the auxiliary head, and then at the same time for the guide head.The loss and the loss of the auxiliary head are optimized by gradient descent.
边栏推荐
- 理解JS的三座大山
- Re22:读论文 HetSANN An Attention-based Graph Neural Network for Heterogeneous Structural Learning
- 向量点积(Dot Product),向量叉积(Cross Product)
- ConvNeXt论文及实现
- Using the TCP protocol, will there be no packet loss?
- 用正向迭代器封装实现反向迭代器
- QT专题:组合会话框和文本编辑器
- 牛客网项目2.7开发注册功能 报错This application has no explicit mapping for /error......
- 读博一年后对机器学习工程的思考
- RPA助你玩转抖音,开启电商运营新引擎
猜你喜欢
Redis数据结构
【技术分享】OSPFv3基本原理
node制作一个视频帧长图生成器
【SeaTunnel】从一个数据集成组件演化成企业级的服务
net start mysql MySQL 服务正在启动 . MySQL 服务无法启动。 服务没有报告任何错误。
迭代器失效问题
用了TCP协议,就一定不会丢包嘛?
Nodejs3day(express简介,express创建基本Web服务器,托管静态资源,nodemon下载及出现的问题,中间件,编写GET,POST,JSONP接口)
Two-dimensional array piecemeal knowledge sorting
QT专题:事件机制event基础篇
随机推荐
【OpenCV】-霍夫变换
要长续航还是更安全?海豹与深蓝SL03对比导购
【New Edition】DeepFakes: Creation, Detection and Influence
The love-hate relationship between C language volatile keyword, inline assembly volatile and compiler
js防抖函数和函数节流的应用场景
The perceptron perceptron of Li Hang's "Statistical Learning Methods" notes
打印lua内部结构的函数调用
练习16-两道模拟题
转转反爬攻防战
HikariCP database connection pool, too fast!
食品安全 | 鱼肝油不是鱼油,家有宝宝的注意了
State Management in Jetpack Compose
matlab-day02
Navicat连接MySQL时弹出:1045:Access denied for user ‘root’@’localhost’
瑞吉外卖项目剩余功能补充
npm ERR! 400 Bad Request - PUT xxx - Cannot publish over previously published version “1.0.0“.
net start mysql MySQL 服务正在启动 . MySQL 服务无法启动。 服务没有报告任何错误。
【SeaTunnel】从一个数据集成组件演化成企业级的服务
斯皮尔曼相关系数
日元疲软令游戏机在日本变身“理财产品”:黄牛大赚