当前位置:网站首页>Learning rate optimization strategy
Learning rate optimization strategy
2022-07-24 05:57:00 【Didi'cv】
Warmup Strategy
Deep learning , The initial weight of the model is randomly generated in the training start-up stage , choice Warmup The strategy can make the model use less learning at the beginning of training
Practice at a regular rate , After a set number of iterations , After the model tends to be stable , Then change to the preset learning rate , Achieve the effect of preheating the learning rate , It can prevent the model from shaking , Accelerate the convergence speed of the network , Improve the effect .
Used in experiments Warmup In strategy Gradual Warmup, That is, in the preheating stage of learning rate, the learning rate will gradually increase with the increase of iteration times , Until the end of the warm-up phase, the learning rate reaches the preset value , And then do the follow-up training , This can avoid the sudden increase of learning rate and the sharp increase of training error .


Poly Strategy
Learning rate is a super parameter that has a great influence on the weight update of the model . Only when the initial learning rate is set reasonably can the model be optimized , Too small will lead to slow convergence , Too large will lead to instability or convergence failure . The learning rate needs to change with the degree of online training , Its change strategy is very important , There are many strategies in deep learning , Such as Fixed Strategy 、Poly Strategy and sigmoid Strategy . In this paper, the experimental results are as follows SGD The optimization strategy adds Poly Learning rate decay strategy , The current learning rate is 

边栏推荐
- UDP通讯应用于多种环境的Demo
- 《统计学习方法(第2版)》李航 第十三章 无监督学习概论 思维导图笔记
- C语言链表(创建、遍历、释放、查找、删除、插入一个节点、排序,逆序)
- Qt 使用纯代码画图异常
- Test whether the label and data set correspond after data enhancement
- 《机器学习》(周志华)第一章 绪论 笔记 学习心得
- [MYCAT] MYCAT sub database and sub table
- 数组常用方法
- [activiti] activiti process engine configuration class
- Positional argument after keyword argument
猜你喜欢

Jupyter notebook选择conda环境

学习率优化策略

Delete the weight of the head part of the classification network pre training weight and modify the weight name

Machine learning (Zhou Zhihua) Chapter 3 Notes on learning linear models

Answers and analysis of some after-school exercises in signals and systems (Wujing)
![[activiti] activiti introduction](/img/17/bd8f6fd8dd8918a984ca0a3793ec0b.jpg)
[activiti] activiti introduction

《统计学习方法(第2版)》李航 第十三章 无监督学习概论 思维导图笔记
![[raspberry pie 4B] VII. Summary of remote login methods for raspberry pie xshell, putty, vncserver, xrdp](/img/dc/364fdc4c1748cc5522e4592bc47dc3.png)
[raspberry pie 4B] VII. Summary of remote login methods for raspberry pie xshell, putty, vncserver, xrdp

DeepSort 总结

Positional argument after keyword argument
随机推荐
Typora 安装包2021年11月最后一次免费版本的安装包下载V13.6.1
Jupyter notebook选择conda环境
Chapter III summary of linear model
GCC 中__attribute__((constructor)和__attribute__(((destructor))的注意事项。
systemctl + journalctl
Problems in SSM project configuration, various dependencies, etc. (for personal use)
Native JS magnifying glass effect
MySql与Qt连接、将数据输出到QT的窗口tableWidget详细过程。
[deep learning] teach you to write "handwritten digit recognition neural network" hand in hand, without using any framework, pure numpy
《剑指Offer》 二维数组的查找 C语言版本
MySql下载,及安装环境设置
Multi merchant mall system function disassembly Lecture 14 - platform side member level
删除分类网络预训练权重的的head部分的权重以及修改权重名称
HAL_Delay()延时误差约1ms的问题
KMP代码分布详解
[USB host] stm32h7 cubemx porting USB host with FreeRTOS to read USB disk, usbh_ Process_ The OS is stuck. There is a value of 0xa5a5a5
学习率优化策略
测试数据增强后标签和数据集是否对应
读取csv文件的满足条件的行并写入另一个csv中
systemctl + journalctl