当前位置:网站首页>Doctoral Dissertation of the University of Toronto - training efficiency and robustness in deep learning
Doctoral Dissertation of the University of Toronto - training efficiency and robustness in deep learning
2022-06-27 19:49:00 【Zhiyuan community】

Thesis link :https://arxiv.org/abs/2112.01423
The training efficiency of degree learning model is low ; They learn by processing millions of training data many times , And it needs powerful computing resources to process a large amount of data in parallel at the same time , Not sequential processing . The deep learning model also has unexpected failure modes ; They may be fooled , Make a wrong prediction .
In this paper , We study the methods to improve the training efficiency and robustness of the deep learning model . In the context of learning visual semantic embedding , We find that learning more information training data first can improve the convergence speed and the generalization performance of test data . We formalize a simple technique , It is called hard negative mining , Modification of the learning objective function , No computational overhead . Next , We seek to improve the optimization speed in the general optimization method of deep learning . We show that the redundant perceptual modification of training data sampling improves the training speed , An effective method for detecting the diversity of training signals is developed , Gradient clustering . Last , We study the antagonism robustness in deep learning , And the method of realizing the maximum antagonism robustness without using additional data training . For the linear model , We prove that the maximum robustness is guaranteed only by properly selecting the optimizer , Regularization , Or architecture .
边栏推荐
猜你喜欢

Core dynamic Lianke rushes to the scientific innovation board: with an annual revenue of 170million yuan, Beifang Electronics Institute and Zhongcheng venture capital are shareholders

New Zhongda chongci scientific and Technological Innovation Board: annual revenue of 284million and proposed fund-raising of 557million

指针和结构体

【登录界面】

What is ICMP? What is the relationship between Ping and ICMP?

多伦多大学博士论文 | 深度学习中的训练效率和鲁棒性

Function key input experiment based on stm32f103zet6 Library

External interrupt experiment based on stm32f103zet6 library function

华大单片机KEIL报错_WEAK的解决方案

Running lantern experiment based on stm32f103zet6 library function
随机推荐
什么是SSR/SSG/ISR?如何在AWS上托管它们?
使用logrotate对宝塔的网站日志进行自动切割
Cucumber自动化测试框架使用
Photoshop layer related concepts layercomp layers move rotate duplicate layer compound layer
shell脚本常用命令(三)
可靠的分布式锁 RedLock 与 redisson 的实现
Informatics Olympiad 1333: [example 2-2] blah data set | openjudge noi 3.4 2729:blah data set
429-二叉树(108. 将有序数组转换为二叉搜索树、538. 把二叉搜索树转换为累加树、 106.从中序与后序遍历序列构造二叉树、235. 二叉搜索树的最近公共祖先)
判断一个变量是数组还是对象?
labelimg使用指南
金鱼哥RHCA回忆录:DO447管理项目和开展作业--创建作业模板并启动作业
信息学奥赛一本通 1333:【例2-2】Blah数集 | OpenJudge NOI 3.4 2729:Blah数集
带你认识图数据库性能和场景测试利器LDBC SNB
Comment encapsuler un appel à une bibliothèque
聊聊毕业季
binder hwbinder vndbinder
Redis 原理 - String
Memoirs of actual combat: breaking the border from webshell
CMS 执行的七个阶段
DCC888 :Register Allocation