当前位置:网站首页>Torch optimizer small parsing
Torch optimizer small parsing
2022-07-07 05:17:00 【Fun of construction】
I press torch 1.11 Version written :
With sgd For example :


Inside maximize = group[‘maximize’] Illustrates the maxmize Parameters can be divided group Appoint . Of course, it can also be used globally , also maxmize yes sgd Unique ( By 1.11 edition )
Be careful , Other versions of optimizer such as adam, I didn't see it maxmize Parameters . Want to perform gradient rise , My idea is to rewrite the optimizer .
Be careful ,self.param_groups It's a list , Is defined in the base class ,sgd Can't find in the . As can be seen from the picture above ,sgd Is in accordance with the self.param_groups To each params Perform gradient optimization ( Up or down )
Be careful sgd init Methods defaults Variable , This is for the base class init Methodical . of super, You can see b Stand a video , It's very detailed :
Back to base class , Pay attention to defaults notes :

The dictionary is used in the back setdefault Method to set the global default defaults For each group:
边栏推荐
- 【二叉树】二叉树寻路
- Complete code of C language neural network and its meaning
- 磁盘监控相关命令
- AOSP ~binder communication principle (I) - Overview
- Dynamically generate tables
- Test interview | how much can you answer the real test interview question of an Internet company?
- np.random.shuffle与np.swapaxis或transpose一起时要慎用
- DBSync新增对MongoDB、ES的支持
- 动态生成表格
- 使用知云阅读器翻译统计遗传学书籍
猜你喜欢

LabVIEW在打开一个新的引用,提示内存已满

y58.第三章 Kubernetes从入门到精通 -- 持续集成与部署(三一)

Function pointer and pointer function in C language

Ansible概述和模块解释(你刚走过了今天,而扑面而来的却是昨天)
![[736. LISP syntax parsing]](/img/62/5e2aeec150096aa3fd81025d146255.png)
[736. LISP syntax parsing]

Full link voltage test: the dispute between shadow database and shadow table

c语言神经网络基本代码大全及其含义

Sublime tips

U++4 接口 学习笔记

Operand of null-aware operation ‘!‘ has type ‘SchedulerBinding‘ which excludes null.
随机推荐
What changes will PMP certification bring?
Basic knowledge of road loss of 3GPP channel model
[ArcGIS tutorial] thematic map production - population density distribution map - population density analysis
If you‘re running pod install manually, make sure flutter pub get is executed first.
DBSync新增对MongoDB、ES的支持
Using thread class and runnable interface to realize the difference between multithreading
U++ 游戏类 学习笔记
如何设计 API 接口,实现统一格式返回?
《四》表单
Development thoughts of adding new requirements in secondary development
痛心啊 收到教训了
2039: [蓝桥杯2022初赛] 李白打酒加强版 (动态规划)
SQL injection cookie injection
[QT] custom control loading
[PHP SPL notes]
Why is the salary of test and development so high?
一文搞懂常见的网络I/O模型
高数中值定理总结
精彩速递|腾讯云数据库6月刊
Complete code of C language neural network and its meaning