当前位置:网站首页>Shuffle Net_ v1-shuffle_ v2
Shuffle Net_ v1-shuffle_ v2
2022-07-28 06:22:00 【A tavern on the mountain】
Catalog
2. Computational complexity comparison :
3.Shuffle Net_v1 Model architecture and parameter setting
4.Shuffle Net_v1 Accuracy rate
5.Shuffle Net_v2 Design criteria
6.Shuffle Net_v2 Network architecture improvement
shuffleNet combination ResNeXt Group convolution (GConv) And MobileNet Of DW Convolution .ResNeXt in 1*1 Convolution takes up 94.3% Of Mdds, Introduce group convolution GConv Reduce computational complexity . When performing group convolution , Each group of convolution is performed independently , So introduce shuffle operation bring Group And Group Information exchange between .
1.shuffle operation

2. Computational complexity comparison :

Compare with ResNet,ShuffleNet_v1 Yes 1*1 The convolution of , And in the middle 3*3 Convolution uses DW Convolution , Group number g= The channel number m.
3.Shuffle Net_v1 Model architecture and parameter setting
Next, let's look at the model architecture and the parameters of each layer :

Every stage One of the first block Step length stride by 2, Next stage output channels Double ( Down sampling ) As shown in the structure diagram (c).bottleneck( middle 3*3DW Convolution ) The number of channels is... Of the output channel 1/4.
4.Shuffle Net_v1 Accuracy rate

shufflenet_v2
5.Shuffle Net_v2 Design criteria
shufflenet_v2 Four indicators for efficient network design are proposed ,
FLOPs Is an indirect indicator , And speed is the direct indicator of the speed of model calculation ,MAC( Memory footprint )、 platform 、 Parallelism is an index that affects the speed of the model , Put forward four guidelines for designing efficient networks :
(1) When the number of channels of the input characteristic matrix and the output characteristic matrix of the convolution layer is equal ,MAC Minimum .

(2) When GConv Of group When it increases (FLOPs unchanged ),MAC Also increase .

(3) The more fragmented the network is , The slower the network . Although the accuracy is improved , But it is not friendly to hardware devices with parallel computing .
(4)Element—wise(ReLU、shortcut etc. ) The impact of the crisis cannot be ignored .

6.Shuffle Net_v2 Network architecture improvement
According to the above four principles , shufflenet_v2 Yes shufflenet_v1 A series of improvements have been made , The network architecture is shown in the figure below :

1 Division is adopted at channle_split The way , Reduce access to bneck The number of channels , In half .
2 Change the group convolution back to normal 1*1 Convolution , Follow the second 2, Fewer groups g.
3 Adopted at shuffle Location change of .
4 Adopted at concat Splicing ensures the whole input_c=output_c.
5 It's about ReLU Location change of .
6bneck in 1*1—3*3—1*1 The number of input channels of three-layer convolution is equal to the number of output channels .
(d) The bottom sampling operation in the figure , Channel division channle_split, The number of output channels is doubled .
7.Shuffle Net_v2 Parameters

For each of these stage One of the first block,channel Double , such as stage2 Of block1, Each branch has 58 individual channels, Corresponding (d) chart . Others block use split Method will input_channels Divide equally .
边栏推荐
- 说说ESXi虚拟交换机和端口组的“混杂模式”
- set_false_path
- mixup_ratio
- 开关电源电路EMI设计在layout过程中注意事项
- (PHP graduation project) obtain the campus network repair application management system based on PHP
- 针对大量数据,MATLAB生成EXCEL文件并进行排版处理的源码
- 在Asp.net 中Cookie的用法
- AEM testpro K50 and south Guangdong survey
- Summary of Intranet Information Collection
- PLC的整体认识
猜你喜欢

ESXi 社区版网卡驱动

The short jumper dsx-8000 test is normal, but the dsx-5000 test has no length display?

2、 Openvino brief introduction and construction process

How to view the transfer function of the module directly built by Simulink

天线效应解决办法

硬件电路设计学习笔记1--温升设计

Apache log4j arbitrary code execution replication

Arduino reads the analog voltage_ How mq2 gas / smoke sensor works and its interface with Arduino

短跳线DSX-8000测试正常,但是DSX-5000测试无长度显示?

浪涌冲击抗扰度实验(SURGE)-EMC系列 硬件设计笔记6
随机推荐
ICC2使用report_placement检查floorplan
Agilent安捷伦 E5071测试阻抗、衰减均正常,惟独串扰NG?---修复方案
Convert data in grilview into datatable
Fluke fluke aircheck WiFi tester cannot configure file--- Ultimate solution experience
USB Network Native Driver for ESXi更新到支持ESXi7.0.1
clock tree分析实例
Random life-1
EfficientNET_V1
BERT基于transformer的双向编码器
Led selection - hardware learning notes 3
frameset 用法示例
AEM线上产品推介会---线缆认证测仪
福禄克DSX2-5000 网络线缆测试仪为什么每年都要校准一次?
Transformer 自注意力机制 及完整代码实现
In asp Usage of cookies in. Net
EMC实验实战案例-ESD静电实验
Reading experience of protecting against DNN model steaming attacks
1、 Amd - openvino environment configuration
Efficient Net_V2
Surge impact immunity experiment (surge) -emc series Hardware Design Notes 6
