当前位置:网站首页>Tsinghua University product: penalty gradient norm improves generalization of deep learning model
Tsinghua University product: penalty gradient norm improves generalization of deep learning model
2022-07-04 06:08:00 【User 9447256】
The structure of neural network is simple , Insufficient training sample size , It will lead to the low classification accuracy of the trained model ; The structure of neural network is complex , Training sample size is too large , It will lead to over fitting of the model , Therefore, how to train neural network to improve the generalization of model is a very core problem in the field of artificial intelligence . I recently read an article related to this problem , In this paper, the author improves the generalization of the deep learning model by adding the constraint of the gradient norm of the regularization term in the loss function . The author expounds and verifies the methods in this paper in detail from two aspects of principle and experiment .L i p s c h i t z \mathrm{Lipschitz}Lipschitz Continuous learning is a very important and common mathematical tool in the theoretical analysis of deep learning , The loss function of neural network is L i p s c h i t z yes \mathrm{Lipschitz} yes Lipschitz Mathematical derivation with continuity as the starting point . In order to facilitate readers to more smoothly appreciate the author's beautiful mathematical proof ideas and processes , This paper supplements the details of mathematical proof that is not carried out in the paper .
————————————————
Copyright notice : This paper is about CSDN Blogger 「 sorcery 2022」 The original article of , follow CC 4.0 BY-SA Copyright agreement , For reprint, please attach the original source link and this statement .
Link to the original text :https://blog.csdn.net/qq_38406029/article/details/122851202
边栏推荐
- How to determine whether an array contains an element
- buuctf-pwn write-ups (8)
- 4G wireless all network solar hydrological equipment power monitoring system bms110
- 每周小结(*63):关于正能量
- Internet of things protocol ZigBee ZigBee module uses the concept of protocol stack
- 740. Delete and get points
- Win10 clear quick access - leave no trace
- JS扁平化数形结构的数组
- Lightroom import picture gray / Black rectangular multi display
- 冲击继电器JC-7/11/DC110V
猜你喜欢
BeanFactoryPostProcessor 与 BeanPostProcessor 相关子类概述
High performance parallel programming and optimization | lesson 02 homework at home
4G wireless all network solar hydrological equipment power monitoring system bms110
buuctf-pwn write-ups (8)
C réaliser des jeux de serpents gourmands
A little understanding of GSLB (global server load balance) technology
724. Find the central subscript of the array
Google Chrome browser will support the function of selecting text translation
Kubernets first meeting
AWT introduction
随机推荐
C language exercises (recursion)
接地继电器DD-1/60
Install pytoch geometric
gslb(global server load balance)技术的一点理解
ES6 modularization
C实现贪吃蛇小游戏
Layoutmanager layout manager: flowlayout, borderlayout, GridLayout, gridbaglayout, CardLayout, BoxLayout
Design and implementation of redis 7.0 multi part AOF
Descriptive analysis of data distribution characteristics (data exploration)
Luogu deep foundation part 1 Introduction to language Chapter 5 array and data batch storage
buuctf-pwn write-ups (8)
如何判断数组中是否含有某个元素
Gridview出现滚动条,组件冲突,如何解决
Lightroom import picture gray / Black rectangular multi display
How to realize multi account login of video platform members
509. Fibonacci number, all paths of climbing stairs, minimum cost of climbing stairs
How to expand all collapse panels
实用的小工具指令
724. Find the central subscript of the array
Design and implementation of tcp/ip series overview