当前位置:网站首页>The video of machine learning to learn [update]
The video of machine learning to learn [update]
2022-08-04 03:36:00 【terrific51】
Video Learning for Machine Learning
I. Overview of Machine Learning
1. What is machine learning
Machine learning is a computer program that learns from experience and gets better at a specific task.
Most important for machine learning:
- Data: Experience is ultimately transformed into data that the computer can understand, so that the computer can learn from the experience.
- Model: The algorithm.Once you have the data, you can design a model and use the data as input to train the model.The trained model eventually becomes the core of machine learning, making the model the hub that can generate decisions.
2. Supervised vs Unsupervised Learning
(1) Supervised Learning
Supervised learning (Supervised learning) allows the computer to learn the rules from a large amount of known input and output paired data, so that it can make a reasonable output prediction for a new input.
- House price prediction (regression problem)
Shown below is an example of a house price forecast.It is an example of supervised learning.
As in this housing price prediction example, supervised learning provides the algorithm with a data set (which contains the correct answers), that is, we give it a housing price data set, and each sample in this data set corresponds to a correct answer(i.e. the actual selling price of the house).The purpose of the algorithm is to give more correct answers.
The prediction of house price is a regression problem (regression), because house price is a real, continuous value.
- Malignant/benign cancer (classification problem)
Malignant/benign cancer (ie 0/1) is a classification problem because malignant/benign (ie 0/1) are discrete values.
(2) Unsupervised Learning
Unsupervised learning (Unsupervised learning) analyzes the inherent characteristics and structure of the data itself by learning a large amount of unlabeled data.
Unsupervised learning requires letting the algorithm discover everything from the data on its own.One of the common algorithms is clustering: using algorithms to group together news stories, market segmentation, etc.
3. Model description
Linear Regression Model
- Cost function
Squared error cost function
Model, parameters, cost function, objective (find a value of w, b that minimizes J(w,b))
when b=0
By simplifying the model, our goal is to find wA value that minimizes J(w)
When w=1, J(w)=0 is calculated:When w=0.5, J(w)=0.58 is calculated:
Take different values of w, calculate J(w), and draw a graph as shown:
when b!=0
House price forecast
4. Gradient descent algorithm
where a is the learning rate.If the learning rate is too small, then large drops are feasible, but will be slow.This will take a long time because you will be taking very small steps, many steps before it gets close to the minimum.But if the learning rate is too large, the steps will be very large, and it is likely that the minimum value will be skipped due to the large step size, and the minimum value will never be reached.
As shown in the picture:
If the parameter is reachedlocal minima, then further binning reduces steps to none at all.It doesn't change the process precisely because it keeps the solution at a local minimum.
As we approach a local minimum, the rank drop will beSmaller steps are automatically taken because the derivative automatically gets smaller as we approach a local minimum, which means that the steps also get smaller automatically, even if a remains at some fixed value.
Derivation:
边栏推荐
- pnpm 是凭什么对 npm 和 yarn 降维打击的
- Introduction to the memory model of the JVM
- 高效IO模型
- 安装postgis时报找不到“POSTGIS_VERSION”这个函数
- [Study Notes Dish Dog Learning C] Dynamic Memory Management
- if,case,for,while
- Mockito unit testing
- SSLHandshakeException: No appropriate protocol (protocol is disabled or cipher suites are inappropri
- [Medical Insurance Science] To maintain the safety of medical insurance funds, we can do this
- Polygon zkEVM network node
猜你喜欢
MySQL query optimization and tuning
从图文展示到以云为核,第五代验证码独有的策略情报能力
【观察】超聚变:首提“算网九阶”评估模型,共建开放繁荣的算力网络
仿牛客论坛项目梳理
This Thursday evening at 19:00, the fourth live broadcast of knowledge empowerment丨The realization of equipment control of OpenHarmony smart home project
基地址:环境变量
pnpm 是凭什么对 npm 和 yarn 降维打击的
Y86. Chapter iv Prometheus giant monitoring system and the actual combat, Prometheus storage (17)
LeetCode每日一题(2285. Maximum Total Importance of Roads)
数据安全峰会2022 | 美创DSM获颁“数据安全产品能力验证计划”评测证书
随机推荐
6口全千兆二层网管型工业以太网交换机千兆2光4电光纤自愈ERPS环网交换机
How to automatically export or capture abnormal login ip and logs in elastic to the database?
【医保科普】维护医保基金安全,我们可以这样做
FFmpeg —— 通过修改yuv,将视频转为黑白并输出(附源码)
Mockito单元测试
基地址:环境变量
Returns the maximum number of palindromes in a string
【MD5】采用MD5+盐的加密方式完成注册用户和登录账号
内网服务器访问远程服务器的端口映射
sqoop ETL tool
Pine Script | How to display and typeset a plot switch?
Metaverse "Drummer" Unity: Crazy expansion, suspense still exists
2022年最新海南建筑八大员(材料员)模拟考试试题及答案
C language -- ring buffer
元宇宙“吹鼓手”Unity:疯狂扩局,悬念犹存
Power button (LeetCode) 215. The first K largest elements in the array (2022.08.03)
MRS: Introduction to the use of Alluxio
docker+bridge+redis master-slave+sentry mode
复制带随机指针的链表
This Thursday evening at 19:00, the fourth live broadcast of knowledge empowerment丨The realization of equipment control of OpenHarmony smart home project