当前位置:网站首页>MIT doctoral dissertation optimization theory and machine learning practice
MIT doctoral dissertation optimization theory and machine learning practice
2022-06-30 22:37:00 【Zhiyuan community】

Machine learning is a method of extracting prediction models from data , Thus, the prediction can be generalized to the technology of unobserved data . The process of selecting a good model based on known data sets needs to be optimized . To be specific , The optimization process generates a variable in the constraint set to minimize the goal . This process includes many machine learning channels including neural network training , This will be the main testing ground for our theoretical analysis in this paper . In all kinds of optimization algorithms , Gradient method has become the dominant algorithm in deep learning because of its high dimensional scalability and the natural limitations of back propagation . However , Although gradient based algorithms are popular , But our theoretical understanding of this algorithm in machine learning environment seems to be far from enough . One side , Within the existing theoretical framework , Most of the upper and lower bounds are closed , The theoretical problem seems to have been solved . On the other hand , It is difficult for theoretical analysis to produce faster algorithms than the experience found by practitioners . This paper reviews the theoretical analysis of gradient method , It points out the difference between theory and practice . then , We explained why the mismatch occurred , And through the development of theoretical analysis driven by empirical observation , Some initial solutions are proposed .
Thesis link :https://dspace.mit.edu/bitstream/handle/1721.1/143318/Zhang-jzhzhang-PhD-EECS-2022.pdf?sequence=1&isAllowed=y

边栏推荐
- Starting from pg15 xid64 ticket skipping again
- Neo4j load CSV configuration and use
- Where can I find the computer version of wechat files
- [Android, kotlin, tflite] mobile device integration depth learning light model tflite (image classification)
- B_ QuRT_ User_ Guide(32)
- What are database OLAP and OLTP? Same and different? Applicable scenarios
- 手机上怎么开股票账户?另外,手机开户安全么?
- 将Nagios监控信息存入MySQL
- 远程办公期间,项目小组微信群打卡 | 社区征文
- 企业出海数字化转型解决方案介绍
猜你喜欢
![[450. delete nodes in binary search tree]](/img/fd/bab2f92edeadd16263f15de6cc4420.png)
[450. delete nodes in binary search tree]

MIT博士论文 | 优化理论与机器学习实践

Redis' transaction and locking mechanism

远程办公期间,项目小组微信群打卡 | 社区征文
![Flip the linked list ii[three ways to flip the linked list +dummyhead/ head insertion / tail insertion]](/img/a8/6472e2051a295f5e42a88d64199517.png)
Flip the linked list ii[three ways to flip the linked list +dummyhead/ head insertion / tail insertion]

Braces on the left of latex braces in latex multiline formula

Where can I find the computer device manager

What is the experience of pairing with AI? Pilot vs alphacode, Codex, gpt-3

Redis' cache penetration, cache breakdown and cache avalanche

ESP8266 成为客户端和服务器
随机推荐
Two dots on the top of the latex letter
B_ QuRT_ User_ Guide(33)
Failed to configure a DataSource: ‘url‘ attribute is not specified and no embedded datasource could
分享十万级TPS的IM即时通讯综合消息系统的架构
Neo4j load CSV configuration and use
Deployment of microservices based on kubernetes platform
How to judge whether the JS object is empty
latex字母头顶两个点
Cas classique multithreadé
Redis的事务和锁机制
Redis - 01 缓存:如何利用读缓存提高系统性能?
How to design test cases
【Android,Kotlin,TFLite】移动设备集成深度学习轻模型TFlite(物体检测篇)
Niubi | the tools I have treasured for many years have made me free to fish with pay
B_ QuRT_ User_ Guide(31)
软件确认测试的内容和流程有哪些?确认测试报告需要多少钱?
Is it difficult to get a certified equipment supervisor? What is the relationship with the supervising engineer?
将Nagios监控信息存入MySQL
软件测试报告包含哪些内容?如何获取高质量软件测试报告?
[450. delete nodes in binary search tree]