当前位置:网站首页>Tsinghua University product: penalty gradient norm improves generalization of deep learning model
Tsinghua University product: penalty gradient norm improves generalization of deep learning model
2022-07-04 06:08:00 【User 9447256】
The structure of neural network is simple , Insufficient training sample size , It will lead to the low classification accuracy of the trained model ; The structure of neural network is complex , Training sample size is too large , It will lead to over fitting of the model , Therefore, how to train neural network to improve the generalization of model is a very core problem in the field of artificial intelligence . I recently read an article related to this problem , In this paper, the author improves the generalization of the deep learning model by adding the constraint of the gradient norm of the regularization term in the loss function . The author expounds and verifies the methods in this paper in detail from two aspects of principle and experiment .L i p s c h i t z \mathrm{Lipschitz}Lipschitz Continuous learning is a very important and common mathematical tool in the theoretical analysis of deep learning , The loss function of neural network is L i p s c h i t z yes \mathrm{Lipschitz} yes Lipschitz Mathematical derivation with continuity as the starting point . In order to facilitate readers to more smoothly appreciate the author's beautiful mathematical proof ideas and processes , This paper supplements the details of mathematical proof that is not carried out in the paper .
————————————————
Copyright notice : This paper is about CSDN Blogger 「 sorcery 2022」 The original article of , follow CC 4.0 BY-SA Copyright agreement , For reprint, please attach the original source link and this statement .
Link to the original text :https://blog.csdn.net/qq_38406029/article/details/122851202
边栏推荐
- JS扁平化数形结构的数组
- High performance parallel programming and optimization | lesson 02 homework at home
- 509. Fibonacci number, all paths of climbing stairs, minimum cost of climbing stairs
- 分布式CAP理论
- 19. Framebuffer application programming
- JS how to convert seconds into hours, minutes and seconds display
- My NVIDIA developer journey - optimizing graphics card performance
- Learn about the Internet of things protocol WiFi ZigBee Bluetooth, etc. --- WiFi and WiFi protocols start from WiFi. What do we need to know about WiFi protocol itself?
- Steady! Huawei micro certification Huawei cloud computing service practice is stable!
- Practical gadget instructions
猜你喜欢

Online shrimp music will be closed in January next year. Netizens call No

ES6 模块化

Actual cases and optimization solutions of cloud native architecture

746. Climb stairs with minimum cost

如何实现视频平台会员多账号登录

740. Delete and get points

ES6 modularization

buuctf-pwn write-ups (8)

Google Chrome browser will support the function of selecting text translation

C realize Snake games
随机推荐
Thinkphp6.0 middleware with limited access frequency think throttle
ANSYS command
剑指 Offer II 038. 每日温度
QT releases multilingual International Translation
Overview of relevant subclasses of beanfactorypostprocessor and beanpostprocessor
2022.7.3-----leetcode. five hundred and fifty-six
QT qtablewidget table column top requirements ideas and codes
Online shrimp music will be closed in January next year. Netizens call No
JSON Web Token----JWT和傳統session登錄認證對比
如何避免 JVM 内存泄漏?
[openvino+paddle] paddle detection / OCR / SEG export based on paddle2onnx
Distributed cap theory
Luogu deep foundation part 1 Introduction to language Chapter 5 array and data batch storage
Internet of things protocol ZigBee ZigBee module uses the concept of protocol stack
Vant --- detailed explanation and use of list component in vant
[Chongqing Guangdong education] electronic circuit homework question bank of RTVU secondary school
C language exercises (recursion)
Leetcode question brushing record | 206_ Reverse linked list
buuctf-pwn write-ups (8)
C语言练习题(递归)