当前位置:网站首页>Machine learning terminology
Machine learning terminology
2022-07-03 01:10:00 【The code family】
1) Model
The word model will run through the whole tutorial , It is the core concept of machine learning . You can think of it as a “ Magic box ”, You make a wish to it ( input data ), It will help you realize your wishes ( Output forecast results ). The whole process of machine learning will revolve around the model , Train the best “ Magic box ”, It can realize your promise as accurately as possible “ desire ”, This is the goal of machine learning .
2) Data sets
Data sets , It's easy to understand literally , It represents a collection of data , if “ Model ” yes “ Magic box ” Words , So the data set is responsible for charging it “ Energy battery ”, In short , If a dataset is missing , Then the model has no meaning to exist . The data set can be divided into “ Training set ” and “ Test set ”, They are in the of machine learning “ Training phase ” and “ Prediction output stage ” It plays an important role .
3) sample & features
Sample refers to the data in the data set , A piece of data is called “ A sample ”, Usually , The sample will contain multiple eigenvalues to describe the data , For example, there is now a set of data describing human form “180 70 25” If you just look at the data, you will be very confused , But with “ features ” After describing, it will become easy to understand , As shown below :
| height (cm) | weight (kg) | Age |
| 180 | 70 | 25 |
chart 1: sample & features
It can be seen from the above figure that the composition of the data set is “ One sample per line , A list of characteristics ”. Eigenvalues can also be understood as the correlation of data , The data of each column is related to the eigenvalue of this column .
4) vector
Any algorithm will involve many mathematical terms or formulas . Many mathematical formulas will also be involved in the process of writing this tutorial , And professional terminology , Here, let's briefly explain the common basic terms .
The first common term is “ vector ”, Vector is the key term of machine learning . Vectors are strictly defined in linear algebra . Vectors are also called Euclidean vectors 、 Geometric vectors 、 vector , A quantity with size and direction . You can visually understand it as a line segment with an arrow . The arrow points to : Represents the direction of the vector ; segment length : Represents the size of the vector . The quantity corresponding to a vector is called quantity ( It's called scalar in physics ), Quantity is only size , No direction . Every sample in the data set is a piece of data in vector form .
5) matrix
Matrix is also a commonly used mathematical term , You can think of a matrix as a two-dimensional array of vectors , Data sets store data in the form of two-dimensional matrices , You can think of it as a spreadsheet “ One sample per line , A list of characteristics ” The form of expression is as follows :

边栏推荐
- 安全运营四要素之资产、脆弱性、威胁和事件
- FPGA - 7系列 FPGA内部结构之Clocking -04- 多区域时钟
- Esp32 simple speed message test of ros2 (limit frequency)
- Several cases of recursive processing organization
- Machine learning: numpy version linear regression predicts Boston house prices
- Excel if formula determines whether the two columns are the same
- 1038 Recover the Smallest Number
- leetcode-849:到最近的人的最大距离
- 【FPGA教程案例6】基于vivado核的双口RAM设计与实现
- RISA rz/g2l processor introduction | frame diagram | power consumption | schematic diagram and hardware design guide
猜你喜欢

Strongly connected components of digraph

How to convert Quanzhi a40i/t3 to can through SPI

Lu Zhe, chief scientist of Shiping information: building data and personnel centered security capabilities

拥抱平台化交付的安全理念

数据分析思维分析犯法和业务知识——分析方法(一)

Correctly distinguish the similarities and differences among API, rest API, restful API and web service

信息熵的基础

How to systematically learn machine learning
![[C language] branch and loop statements (Part 1)](/img/47/6efcc59bd26e26f66c698635c26c8b.png)
[C language] branch and loop statements (Part 1)

电话网络问题
随机推荐
[AUTOSAR 11 communication related mechanism]
Leetcode-2115: find all the dishes that can be made from the given raw materials
Leetcode-871: minimum refueling times
leetcode-241:为运算表达式设计优先级
excel去除小数点后面的数据,将数字取整
[case sharing] let the development of education in the new era advance with "number"
有向图的强连通分量
465. 最优账单平衡 DFS 回溯
Machine learning: numpy version linear regression predicts Boston house prices
Rk3568 development board evaluation (II): development environment construction
Infrared thermography temperature detection system based on arm rk3568
What is needed to develop a domestic arm intelligent edge computing gateway
Compare version number
【无标题】
Every k nodes in the linked list are flipped
按键精灵打怪学习-多线程后台坐标识别
Key wizard play strange learning - front desk and Intranet send background verification code
[AUTOSAR twelve mode management]
R language ggplot2 visualization: use ggplot2 to display dataframe data that are all classified variables in the form of thermal diagram, and customize the legend color legend of factor
Leetcode-224: basic calculator