当前位置:网站首页>The third week of postgraduate freshman training: resnet+resnext
The third week of postgraduate freshman training: resnet+resnext
2022-07-29 06:02:00 【Fanshoo】
PART 1: Video learning
One 、ResNet
1.ResNet Network innovation
2.ResNet The question of the outcome

3.ResNet-34 Network structure
4.ResNet Network structure

5.Residual structure

In the lower layers ResNet And deep ResNet The network structure used
Residual Calculation principle :

6. Network training results
In general neural networks , The deeper the network, the worse the effect

ResNet:

Two 、Batch Nomalization

BN Its function is to accelerate the convergence of the network and improve the accuracy , The purpose is to make every batch Of feature map The mean is 0, The variance of 1 Distribution law of
3、 ... and 、 The migration study
1. Advantages of transfer learning
2. Common transfer learning methods

Four 、ResNeXt
1. Group convolution

The main difference is ResNet The grouping convolution is added on the basis of , The parameter quantity is further reduced
2. Network structure

Convolution block structure :

PART 2: Code experiments
One 、 Experimental process
1. Introduce modules

2. Super parameter setting

3. Data preprocessing

4. Load data

5. Instantiation model

6. Training , test
Training :

test :

Learning rate adjustment :

Training :

Output csv file :

Two 、 Experimental questions
When doing neural network training, it is mainly built on this machine pytorch platform , Because the function of computer graphics card is not ideal ,batch_size and epoch And other parameters cannot be set too large , Large data sets and large parameter model training often explode .

The experimental effect is not ideal , It has not been used due to network problems colab, Next week, we will focus on solving the problem of the experimental platform , Retraining ResNet The Internet , Get better experimental results .
PART 3: I want to answer
1.Residual learning
Solve the gradient explosion in deep network / Gradient disappears and accuracy decreases ( Focus on training ) The problem of , Deepen the depth of neural network .
2.Batch Normalization
BN Its function is to accelerate the convergence of the network and improve the accuracy , The purpose is to make every batch Of feature map The mean is 0, The variance of 1 Distribution law of .
3. Why can grouping convolution improve accuracy ? That is, grouping convolution can improve the accuracy , At the same time, it can also reduce the amount of calculation , Can't you try to have as many scores as possible ?
(1) Reduce the number of parameters , Amount of computation , It can train deeper Networks
(2) Group as many as possible into DW Convolution , It is easy to split the feature relationship in each feature map , The result is not good
(3) Appropriate convolution mode and parameter setting should be selected according to the model and calculation amount
边栏推荐
- Gluster cluster management analysis
- How to PR an open source composer project
- ASM piling: after learning ASM tree API, you don't have to be afraid of hook anymore
- Ribbon learning notes 1
- Research and implementation of flash loan DAPP
- Semaphore (semaphore) for learning notes of concurrent programming
- Realize the scheduled backup of MySQL database in Linux environment through simple script (mysqldump command backup)
- [DL] build convolutional neural network for regression prediction (detailed tutorial of data + code)
- asyncawait和promise的区别
- ReportingService WebService Form身份验证
猜你喜欢

Android studio login registration - source code (connect to MySQL database)

【卷积核设计】Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs

anaconda中移除旧环境、增加新环境、查看环境、安装库、清理缓存等操作命令

File permissions of day02 operation

赓续新征程,共驭智存储

Show profiles of MySQL is used.

Research on the implementation principle of reentrantlock in concurrent programming learning notes
![[ml] PMML of machine learning model -- Overview](/img/a1/cd3eff044d903dbcfb880e854713e5.png)
[ml] PMML of machine learning model -- Overview

Exploration of flutter drawing skills: draw arrows together (skill development)

Reporting Services- Web Service
随机推荐
Super simple integration of HMS ml kit to realize parent control
【图像分类】如何使用 mmclassification 训练自己的分类模型
Training log III of "Shandong University mobile Internet development technology teaching website construction" project
【pycharm】pycharm远程连接服务器
Are you sure you know the interaction problem of activity?
手撕ORM 框架(泛型+注解+反射)
The difference between asyncawait and promise
【网络设计】ConvNeXt:A ConvNet for the 2020s
Performance comparison | FASS iSCSI vs nvme/tcp
个人学习网站
How to PR an open source composer project
Flink connector Oracle CDC 实时同步数据到MySQL(Oracle19c)
Operation commands in anaconda, such as removing old environment, adding new environment, viewing environment, installing library, cleaning cache, etc
isAccessible()方法:使用反射技巧让你的性能提升数倍
Nifi changed UTC time to CST time
与张小姐的春夏秋冬(2)
【卷积核设计】Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
ASM插桩:学完ASM Tree api,再也不用怕hook了
Thinkphp6 pipeline mode pipeline use
Synchronous development with open source projects & codereview & pull request & Fork how to pull the original warehouse