当前位置:网站首页>MIT doctoral dissertation optimization theory and machine learning practice
MIT doctoral dissertation optimization theory and machine learning practice
2022-06-30 22:37:00 【Zhiyuan community】

Machine learning is a method of extracting prediction models from data , Thus, the prediction can be generalized to the technology of unobserved data . The process of selecting a good model based on known data sets needs to be optimized . To be specific , The optimization process generates a variable in the constraint set to minimize the goal . This process includes many machine learning channels including neural network training , This will be the main testing ground for our theoretical analysis in this paper . In all kinds of optimization algorithms , Gradient method has become the dominant algorithm in deep learning because of its high dimensional scalability and the natural limitations of back propagation . However , Although gradient based algorithms are popular , But our theoretical understanding of this algorithm in machine learning environment seems to be far from enough . One side , Within the existing theoretical framework , Most of the upper and lower bounds are closed , The theoretical problem seems to have been solved . On the other hand , It is difficult for theoretical analysis to produce faster algorithms than the experience found by practitioners . This paper reviews the theoretical analysis of gradient method , It points out the difference between theory and practice . then , We explained why the mismatch occurred , And through the development of theoretical analysis driven by empirical observation , Some initial solutions are proposed .
Thesis link :https://dspace.mit.edu/bitstream/handle/1721.1/143318/Zhang-jzhzhang-PhD-EECS-2022.pdf?sequence=1&isAllowed=y

边栏推荐
- 电脑版微信文件存储在哪个文件夹可以找到
- 去中心化交易所系统开发技术原理丨数字货币去中心化交易所系统开发(说明案例)
- Failed to configure a DataSource: ‘url‘ attribute is not specified and no embedded datasource could
- During telecommuting, the project team punched in the wechat group | solicited papers from the community
- ESP8266 成为客户端和服务器
- "Team training competition" Shandong multi university training 3
- 公有云市场迈入深水区,冷静的亚马逊云还坐得住吗?
- KubeVela 1.4:让应用交付更安全、上手更简单、过程更透明
- Femas:云原生多运行时微服务框架
- Youfu network hybrid cloud accelerates enterprise digital transformation and upgrading
猜你喜欢

Win11如何优化服务?Win11优化服务的方法

KubeVela 1.4:让应用交付更安全、上手更简单、过程更透明

How does win11 optimize services? Win11 method of optimizing service

ESP8266 成为客户端和服务器

Tencent has been conducting advanced automated functional testing for 3 years. It is a gift to you who are confused in manual testing

How to upload binary pictures in uniapp

多線程經典案例

Nansen double disk encryption giant self rescue: how to prevent the collapse of billions of dominoes

Redis的缓存穿透、缓存击穿和缓存雪崩

Graduation project
随机推荐
pytorch 的Conv2d的详细解释
Apache server OpenSSL upgrade
latex左侧大括号 latex中大括号多行公式
[Android, kotlin, tflite] mobile device integration depth learning light model tflite (image classification)
How to use filters in jfinal to monitor Druid for SQL execution?
Architecture of IM integrated messaging system sharing 100000 TPS
KVM IO性能测试数据
图纸加密如何保障我们的核心图纸安全
Discuz forum speed up to delete XXX under data/log PHP file
Go language learning notes - Gorm usage - database configuration, table addition | web framework gin (VII)
In depth analysis of Apache bookkeeper series: Part 4 - back pressure
What does the &?
Cloud games | cloud computing drives the game industry into a "new era"
Is it difficult to get a certified equipment supervisor? What is the relationship with the supervising engineer?
KVM IO performance test data
Redis的事务和锁机制
How does win11 optimize services? Win11 method of optimizing service
在线客服聊天系统源码_美观强大golang内核开发_二进制运行傻瓜式安装_附搭建教程...
Uniapp third party network request
总结的一些内存问题