当前位置:网站首页>Wu Enda's machine learning mind mapping insists on clocking in for 23 days - building a knowledge context, reviewing, summarizing and replying
Wu Enda's machine learning mind mapping insists on clocking in for 23 days - building a knowledge context, reviewing, summarizing and replying
2022-07-02 20:04:00 【AXYZdong】
Author:AXYZdong Automation Engineering Male
A little bit of thinking , Have a little idea , A little bit rational !Set a small goal , Try to be a habit ! Meet a better self in the most beautiful years !
[email protected],CSDN First episode ,AXYZdong original
The only blog update address is : AXYZdong The blog of
B The homepage of the website is :AXYZdong My personal homepage
List of articles
- 0. Preface
- 1. Instructions for using mind map
- 2. The main content of mind mapping
- 3. Mind map text
- 0. introduction (Introduction)
- 1. Univariate linear regression (Linear Regression with One Variable)
- 2. Multivariate linear regression (Linear Regression with Multiple Variables)
- 3. Logical regression (Logistic Regression)
- 4. Regularization (Regularization)
- 5. neural network : describe (Neural Networks:Representation)
- 6. neural network : Study (Neural Networks:Learning)
- 7. Suggestions for applied machine learning (Advice for Applying Machine Learning)
- 8. The design of machine learning system (Machine Learning System Design)
- 9. Support vector machine (Support Vector Machines)
- 10. clustering (Clustering)
- 11. Dimension reduction (Dimensionality)
- 12. Anomaly detection (Anomaly Detection)
- 13. Recommendation system (Recommender Systems)
- 14. Large scale machine learning (Large Scale Machine Learning)
- 15. Application example : Picture text recognition (Application Example: Photo OCR)
- 16. summary (Conclusion)
- 4. About in the title “ Keep punching 23 God ”
- 5. reference
0. Preface
Machine learning is one of the most exciting directions in information technology . This paper takes teacher Wu Enda's machine learning course as the main line , Use Process On Online drawing constructs the mind map of machine learning .
1. Instructions for using mind map
Cooperate with teacher Wu Enda's machine learning video , Build knowledge context , Review and summarize the reply .
Browse all mind maps online : Wu Enda machine learning - Mind Mapping ProcessOn
Students who need to browse the full picture pay attention to AXYZdong official account , reply machine learning Get the password !
2. The main content of mind mapping
introduction (Introduction)
Supervised learning part :
- Univariate linear regression (Linear Regression with One Variable)
- Multivariate linear regression (Linear Regression with Multiple Variables)
- Logical regression (Logistic Regression)
- Regularization (Regularization)
- neural network : describe (Neural Networks:Representation)
- neural network : Study (Neural Networks:Learning)
- Support vector machine (Support Vector Machines)
Unsupervised learning part :
- clustering (Clustering)
- Dimension reduction (Dimensionality)
- Anomaly detection (Anomaly Detection)
Special applications :
- Recommendation system (Recommender Systems)
- Large scale machine learning (Large Scale Machine Learning)
Suggestions on establishing machine learning system :
- Suggestions for applied machine learning (Advice for Applying Machine Learning)
- The design of machine learning system (Machine Learning System Design)
- Application example : Picture text recognition (Application Example: Photo OCR)
3. Mind map text
0. introduction (Introduction)
The introduction mainly introduces the definition of machine learning 、 Related algorithms of machine learning 、 Supervised learning and unsupervised learning .
There is no uniform definition of machine learning , The following two are the two scholars' understanding of machine learning mentioned in the video .
- Arthur Samuel (1959). Machine Learning: Field of study that gives computers the ability to learn without being explicitly programmed.
- Tom Mitchell (1998). Well-posed Learning Problem: A computer program is said to learn from experience E with respect to some task Tand some performance measure P, if its performance on T, as measured by P, improveswith experience E.

▲ Part0 Introduction1. Univariate linear regression (Linear Regression with One Variable)
The main content of this part includes the model representation of univariate linear regression 、 Cost function 、 Gradient descent method and using gradient descent method to solve the minimum value of cost function .

▲ Part1 Linear Regression with One Variable2. Multivariate linear regression (Linear Regression with Multiple Variables)
Multivariable linear regression is equivalent to the expansion of univariate , Mainly according to the model assumptions 、 The idea of constructing cost function and studying the minimum value of cost function .
Unlike univariate linear regression , Multivariate linear regression may also involve the problem of feature scaling , The main reason is that there are characteristic variables with different scales , In order to make the gradient descent converge quickly , These characteristic variables need to be unified ( Similar to the idea of normalization )
Compared with univariate linear regression , Multivariable linear regression is used to solve the characteristic equation of cost function , In addition to the gradient descent method , You can also use regular equations . According to the number of characteristic variables , Choose these two methods flexibly .

▲ Part2 Linear Regression with Multiple Variables3. Logical regression (Logistic Regression)
there “ Return to ” Different from linear regression , It's a customary name . Its essence is classification , The variables to be predicted are discrete values .

▲ Part3 Logistic Regression4. Regularization (Regularization)
Regularization (Regularization) The proposed , It mainly solves the problem of fitting (over-fitting) The problem of . Including the regularization of linear regression and the regularization of logical regression , Its essence is to preserve all features by adding regularization terms , At the same time, reduce the parameters ( Coefficient before characteristic variable ) Size .
One hypothesis can be better fitted to the training data than others , But it can't fit the data well on the data set other than the training data , At this time, we think that there is a phenomenon of over fitting in this hypothesis . The main reasons for this are noise or too little training data .

▲ Part4 Regularization5. neural network : describe (Neural Networks:Representation)
neural network (Neural Networks) A brief statement of , Involving nonlinear assumptions 、 Model representation of neural networks 、 Intuitive understanding of neural networks and multiple classification .
When there are too many features , Ordinary logistic regression model , Can't handle so many features effectively , Now we need neural networks .

▲ Part5 Neural Networks:Representation6. neural network : Study (Neural Networks:Learning)
neural network (Neural Networks) The cost function of , Gradient descent seeks the minimum value of the cost function , Using back propagation algorithm (Backpropagation Algorithm) Calculate the direction of gradient descent .
Numerical tests using gradients (Numerical Gradient Checking) Method , The cost of prevention seems to be decreasing , But the final result may not be the problem of optimal solution .
If you let the initial parameters be 0, Then the activation unit of the second layer will have the same value . Therefore, you need to initialize the parameters , The method of random initialization is adopted ,Python The code is as follows :
Theta1 = rand(10,11) * (2*eps) - eps

▲ Part6 Neural Networks:Learning Learning7. Suggestions for applied machine learning (Advice for Applying Machine Learning)
When using the trained model to predict unknown data, it is found that there is a large error , What to do next ? Use the diagnostic method to judge which methods are effective for our algorithm .
The training set and test set are used to evaluate whether the hypothesis function is over fitted , The parameters obtained by minimizing the cost function of the training set are substituted into the cost function of the test set .
Cross validation sets to help select models . Diagnostic bias and variance , The performance of the algorithm is not ideal , Or the deviation is relatively large , Or the variance is bigger . let me put it another way , What happens is either an under fit , It's either an over fitting problem .
The learning curve takes the training set error and cross validation set error as the number of training set instances (m) The graph drawn by the function of .

▲ Part7 Advice for Applying Machine Learning
8. The design of machine learning system (Machine Learning System Design)
The main content of this part is error analysis 、 The error measure of class skew 、 The trade-off between precision and recall and machine learning data .

▲ Part8 Machine Learning System Design9. Support vector machine (Support Vector Machines)
Support vector machine (Support Vector Machines) In essence, it is to optimize the objective function in logistic regression , Will contain log Item usage cost Function instead of .
Support vector machine uses a maximum spacing to separate samples , Robust , It is sometimes called a large spacing classifier .
Kernel function (Kernel) Introduce support vector machine SVM in , Instead of the corresponding high-dimensional vector inner product .

▲ Part9 Support Vector Machines10. clustering (Clustering)
clustering (Clustering) One kind of unsupervised learning .
Key algorithms :K- Mean algorithm .K-Means Is the most popular clustering algorithm , The algorithm accepts an unmarked data set , Then clustering the data into different groups .

▲ Part10 Clustering11. Dimension reduction (Dimensionality)
Dimension reduction (Dimensionality) It is mainly used for data compression and data visualization , It is also a kind of unsupervised learning .
Important algorithm : Principal component analysis PAC(Principal Component Analysis) Algorithm .

▲ Part11 Dimensionality12. Anomaly detection (Anomaly Detection)
This part mainly includes Gaussian distribution (Gaussian Distribution), Gaussian algorithm is used for anomaly detection , Feature transformation transforms the original data into Gaussian distribution .
Important algorithm : gaussian (Gaussian ) Algorithm .

▲ Part12 Anomaly Detection13. Recommendation system (Recommender Systems)
This part includes : Content based recommendation system 、 Collaborative filtering (Collaborative Filtering)、 Vectorization : Low rank matrix decomposition 、 Implementation details : Mean normalization .
Important algorithm : Collaborative filtering (Collaborative Filtering) Algorithm .

▲ Part13 Recommender Systems14. Large scale machine learning (Large Scale Machine Learning)
primary coverage : Random gradient descent method (Stochastic Gradient Descent)、 Small batch gradient descent (Mini-Batch Gradient Descent) 、 Convergence of stochastic gradient descent algorithm 、 Online learning (Online Learning) and Mapping simplification and data parallelism (Map Reduce and Data Parallelism).

▲ Part14 Large Scale Machine Learning15. Application example : Picture text recognition (Application Example: Photo OCR)
Focus on Steps of image and character recognition and The sliding window (Sliding Windows) Use .
▲ Part15 Application Example: Photo OCR16. summary (Conclusion)

▲ Part16 Conclusion4. About in the title “ Keep punching 23 God ”
Blink Keep punching 23 God

▲ Blink Clock in 23 God 5. reference
[1]:[ Chinese and English subtitles ] Wu Enda machine learning series
[2]:fengdu78, Coursera-ML-AndrewNg-Notes, (2018), GitHub repository, https://github.com/fengdu78/Coursera-ML-AndrewNg-Notes
This sharing is here

If my blog helps you 、 If you like my blog content , please “ give the thumbs-up ” “ Collection ” “ Focus on ” One button, three links !
For more, please go to AXYZdong The blog of
If any of the above is inaccurate , Welcome below Leave a message . Or you have a better idea , Welcome to exchange and study together ~~~
边栏推荐
- C language linked list -- to be added
- Google Earth engine (GEE) - Landsat 9 image full band image download (Beijing as an example)
- Common problems and description of kt148a voice chip IC development
- After 65 days of closure and control of the epidemic, my home office experience sharing | community essay solicitation
- Postman interface test practice, these five questions you must know
- API文档工具knife4j使用详解
- 【Hot100】21. 合并两个有序链表
- 八年测开经验,面试28K公司后,吐血整理出高频面试题和答案
- Correspondence between pytoch version, CUDA version and graphics card driver version
- From 20s to 500ms, I used these three methods
猜你喜欢

SQLite 3.39.0 release supports right external connection and all external connection

测试人员如何做不漏测?这7点就够了

为什么我对流程情有独钟?

Istio1.12: installation and quick start

Second hand housing data analysis and prediction system

通信人的经典语录,第一条就扎心了……

After eight years of test experience and interview with 28K company, hematemesis sorted out high-frequency interview questions and answers

RPD出品:Superpower Squad 保姆级攻略

有时候只查询一行语句,执行也慢

CS5268完美代替AG9321MCQ Typec多合一扩展坞方案
随机推荐
Refactoring: improving the design of existing code (Part 2)
Shardingsphere jdbc5.1.2 about select last_ INSERT_ ID () I found that there was still a routing problem
Infix expression is converted to suffix expression (C language code + detailed explanation)
Cron expression (seven subexpressions)
Motivation! Big Liangshan boy a remporté le prix Zhibo! Un article de remerciement pour les internautes qui pleurent
基于SSM实现网上购物商城系统
Outsourcing for three years, abandoned
疫情封控65天,我的居家办公心得分享 | 社区征文
[source code analysis] model parallel distributed training Megatron (5) -- pipestream flush
Kt148a voice chip IC user end self replacement voice method, upper computer
Start practicing calligraphy
KT148A语音芯片ic的硬件设计注意事项
【871. 最低加油次数】
[JS] get the search parameters of URL in hash mode
Solution to blue screen after installing TIA botu V17 in notebook
How to avoid duplicate data in gaobingfa?
功能、作用、效能、功用、效用、功效
Share several map bed websites for everyone to share pictures
自动化制作视频
蓝牙芯片ble是什么,以及该如何选型,后续技术发展的路径是什么