当前位置:网站首页>MIT doctoral dissertation optimization theory and machine learning practice
MIT doctoral dissertation optimization theory and machine learning practice
2022-06-30 22:37:00 【Zhiyuan community】

Machine learning is a method of extracting prediction models from data , Thus, the prediction can be generalized to the technology of unobserved data . The process of selecting a good model based on known data sets needs to be optimized . To be specific , The optimization process generates a variable in the constraint set to minimize the goal . This process includes many machine learning channels including neural network training , This will be the main testing ground for our theoretical analysis in this paper . In all kinds of optimization algorithms , Gradient method has become the dominant algorithm in deep learning because of its high dimensional scalability and the natural limitations of back propagation . However , Although gradient based algorithms are popular , But our theoretical understanding of this algorithm in machine learning environment seems to be far from enough . One side , Within the existing theoretical framework , Most of the upper and lower bounds are closed , The theoretical problem seems to have been solved . On the other hand , It is difficult for theoretical analysis to produce faster algorithms than the experience found by practitioners . This paper reviews the theoretical analysis of gradient method , It points out the difference between theory and practice . then , We explained why the mismatch occurred , And through the development of theoretical analysis driven by empirical observation , Some initial solutions are proposed .
Thesis link :https://dspace.mit.edu/bitstream/handle/1721.1/143318/Zhang-jzhzhang-PhD-EECS-2022.pdf?sequence=1&isAllowed=y

边栏推荐
- Online education program user login and registration
- 将Nagios监控信息存入MySQL
- [Android, kotlin, tflite] mobile device integration deep learning light model tflite (object detection)
- KVM IO performance test data
- 交易所系统开发如何开发?数字货币交易所系统开发成熟技术案例
- Where can I find the computer device manager
- Using Obsidian with Hugo, markdown's local editing software is seamlessly connected with online
- 微信小程序中的数据双向绑定
- Mysql:sql overview and database system introduction | dark horse programmer
- Is the stock account opened in qiniu safe? How to apply for a low commission stock account?
猜你喜欢

Some memory problems summarized

Redis' cache penetration, cache breakdown and cache avalanche

Tencent has been conducting advanced automated functional testing for 3 years. It is a gift to you who are confused in manual testing

Ten of the most heart piercing tests / programmer jokes, read the vast crowd, how to find?

Starting from pg15 xid64 ticket skipping again

电脑设备管理器在哪里可以找到
![[无线通信基础-13]:图解移动通信技术与应用发展-1-概述](/img/1d/62e55f1b5445d7349ec383879f4275.png)
[无线通信基础-13]:图解移动通信技术与应用发展-1-概述

理想中的接口自动化项目

How to realize the center progress bar in wechat applet

零样本和少样本学习
随机推荐
基于kubernetes平台微服务的部署
[micro service ~nacos] configuration center of Nacos
In depth analysis of Apache bookkeeper series: Part 4 - back pressure
Femas:云原生多运行时微服务框架
软件测试报告包含哪些内容?如何获取高质量软件测试报告?
Nansen double disk encryption giant self rescue: how to prevent the collapse of billions of dominoes
What is the experience of pairing with AI? Pilot vs alphacode, Codex, gpt-3
图纸加密如何保障我们的核心图纸安全
vncserver: Failed command ‘/etc/X11/Xvnc-session‘: 256!
严格次小生成树
dba
MIT博士论文 | 优化理论与机器学习实践
[Android, kotlin, tflite] mobile device integration deep learning light model tflite (object detection)
Jmeter跨线程参数关联无需脚本
How to develop the exchange system? Mature technology case of digital currency exchange system development
Using Obsidian with Hugo, markdown's local editing software is seamlessly connected with online
Doker's container data volume
Win11电脑名如何更改?Win11更改电脑名的方法
微信小程序通过点击事件传参(data-)
100 important knowledge points that SQL must master: creating and manipulating tables