当前位置:网站首页>Tsinghua University product: penalty gradient norm improves generalization of deep learning model
Tsinghua University product: penalty gradient norm improves generalization of deep learning model
2022-07-04 06:08:00 【User 9447256】
The structure of neural network is simple , Insufficient training sample size , It will lead to the low classification accuracy of the trained model ; The structure of neural network is complex , Training sample size is too large , It will lead to over fitting of the model , Therefore, how to train neural network to improve the generalization of model is a very core problem in the field of artificial intelligence . I recently read an article related to this problem , In this paper, the author improves the generalization of the deep learning model by adding the constraint of the gradient norm of the regularization term in the loss function . The author expounds and verifies the methods in this paper in detail from two aspects of principle and experiment .L i p s c h i t z \mathrm{Lipschitz}Lipschitz Continuous learning is a very important and common mathematical tool in the theoretical analysis of deep learning , The loss function of neural network is L i p s c h i t z yes \mathrm{Lipschitz} yes Lipschitz Mathematical derivation with continuity as the starting point . In order to facilitate readers to more smoothly appreciate the author's beautiful mathematical proof ideas and processes , This paper supplements the details of mathematical proof that is not carried out in the paper .
————————————————
Copyright notice : This paper is about CSDN Blogger 「 sorcery 2022」 The original article of , follow CC 4.0 BY-SA Copyright agreement , For reprint, please attach the original source link and this statement .
Link to the original text :https://blog.csdn.net/qq_38406029/article/details/122851202
边栏推荐
猜你喜欢
随机推荐
How to realize multi account login of video platform members
Design and implementation of redis 7.0 multi part AOF
js获取对象中嵌套的属性值
left_ and_ right_ Net interpretable design
4G wireless all network solar hydrological equipment power monitoring system bms110
fastjson
Kubernets first meeting
19. Framebuffer application programming
Descriptive analysis of data distribution characteristics (data exploration)
Google Chrome browser will support the function of selecting text translation
724. Find the central subscript of the array
Notes and notes
AWT introduction
Recommended system 1 --- framework
Upper computer software development - log information is stored in the database based on log4net
Invalid revision: 3.18.1-g262b901-dirty
Nexus 6p从8.0降级6.0+root
2022.7.2-----leetcode. eight hundred and seventy-one
Wechat applet +php realizes authorized login
QT get random color value and set label background color code









