当前位置:网站首页>General process of machine learning training and parameter optimization (discussion)
General process of machine learning training and parameter optimization (discussion)
2022-07-06 02:13:00 【Min fan】
Abstract : In practical machine learning applications , Not only model training , Also control the input parameters . This paper describes the general process , For reference only .
1. Training machine learning models
For an input of m m m Features , Output as a decision indicator , Machine learning models can be built
f : R m → R (1) f: \mathbb{R}^m \to \mathbb{R} \tag{1} f:Rm→R(1)
among R \mathbb{R} R Is a set of real numbers . If different features have their own value range , Then the machine learning model can be expressed as
f : ∏ i = 1 m V i → R (2) f: \prod_{i=1}^m \mathbf{V}_i \to \mathbb{R} \tag{2} f:i=1∏mVi→R(2)
among V i \mathbf{V}_i Vi It's No i i i Value range of features .
Simplicity , Only... Is discussed below (1) Model corresponding to formula .
Given to contain n n n Characteristic matrix of instances X = [ x 1 , … , x n ] T ∈ R n × m \mathbf{X} = [\mathbf{x}_1, \dots, \mathbf{x}_n]^{\mathrm{T}} \in \mathbb{R}^{n \times m} X=[x1,…,xn]T∈Rn×m And the corresponding label vector Y ∈ R n \mathbf{Y} \in \mathbb{R}^n Y∈Rn, The optimization objective of machine learning can generally be expressed as
min f L ( f ( X ) , Y ) + R ( f ) (3) \min_f \mathcal{L}(f(\mathbf{X}), \mathbf{Y}) + R(f) \tag{3} fminL(f(X),Y)+R(f)(3)
among f ( X ) = [ f ( x 1 ) , … , f ( x n ) ] f(\mathbf{X}) = [f(\mathbf{x}_1), \dots, f(\mathbf{x}_n)] f(X)=[f(x1),…,f(xn)] Vector for predicted tags , R ( f ) R(f) R(f) by f f f Regular term of parameter in . If the optimization goal is a convex function , Then the gradient descent method can be used to quickly find the optimal solution . For regular terms :
- If f f f For a linear model , The regular can be 1 norm 、2 norm 、 Kernel norm, etc . Its function is to prevent over fitting .
- If f f f For a neural network model , You can use the dropout And other technologies to prevent over fitting .
2. Parameter optimization method
For some practical problems , Some of the input characteristics are objective , Some are controllable . No loss of generality , Before order m 1 m_1 m1 The first feature is objective , after m 2 m_2 m2 Three features are controllable ( So we also call it parameter ), m 1 + m 2 = m m_1 + m_2 = m m1+m2=m. Suppose a reliable machine learning model has been trained through a large amount of data f f f, And we expect to maximize the decision indicators . Given the objective eigenvector x b ∈ R m 1 \mathbf{x}_b \in \mathbb{R}^{m_1} xb∈Rm1, The objective function of parameter optimization is
arg max x u ∈ R m 2 f ( x b ∥ x u ) (4) \argmax_{\mathbf{x_u} \in \mathbb{R}^{m_2}} f(\mathbf{x}_b \| \mathbf{x}_u)\tag{4} xu∈Rm2argmaxf(xb∥xu)(4)
among ∥ \| ∥ Indicates the vector splicing operation .
- If f f f Each controllable feature is a convex function , Then the optimal parameters can be obtained by gradient descent and other methods .
- If f f f The controllable features are not a convex function , Then some bionic algorithms can be used to optimize the parameters .
- If the controllable features are enumerated, the cardinality of the definition domain is not large , Then the optimal parameters can be obtained directly by the exhaustive method . example : Controllable features include 5 individual , Everyone with a 10 Possible values , From the 1 0 5 10^5 105 The optimal parameter vector is obtained from three parameter combinations , It only takes a few seconds to calculate .
边栏推荐
- Computer graduation design PHP campus restaurant online ordering system
- [depth first search notes] Abstract DFS
- [eight part essay] what is the difference between unrepeatable reading and unreal reading?
- 同一个 SqlSession 中执行两条一模一样的SQL语句查询得到的 total 数量不一样
- 安装php-zbarcode扩展时报错,不知道有没有哪位大神帮我解决一下呀 php 环境用的7.3
- How does redis implement multiple zones?
- 竞价推广流程
- The intelligent material transmission system of the 6th National Games of the Blue Bridge Cup
- Selenium element positioning (2)
- Computer graduation design PHP enterprise staff training management system
猜你喜欢
The intelligent material transmission system of the 6th National Games of the Blue Bridge Cup
Computer graduation design PHP enterprise staff training management system
【机器人库】 awesome-robotics-libraries
Computer graduation design PHP campus restaurant online ordering system
Blue Bridge Cup embedded_ STM32 learning_ Key_ Explain in detail
How to upgrade kubernetes in place
Accelerating spark data access with alluxio in kubernetes
leetcode3、實現 strStr()
Initialize MySQL database when docker container starts
Exness: Mercedes Benz's profits exceed expectations, and it is predicted that there will be a supply chain shortage in 2022
随机推荐
Selenium waiting mode
竞赛题 2022-6-26
How to improve the level of pinduoduo store? Dianyingtong came to tell you
Prepare for the autumn face-to-face test questions
Global and Chinese market of commercial cheese crushers 2022-2028: Research Report on technology, participants, trends, market size and share
Pangolin Library: subgraph
Unity learning notes -- 2D one-way platform production method
Compact lidar global and Chinese markets 2022-2028: technology, participants, trends, market size and share Research Report
Concept of storage engine
竞价推广流程
Selenium element positioning (2)
Redis daemon cannot stop the solution
Spark accumulator
This time, thoroughly understand the deep copy
Use image components to slide through photo albums and mobile phone photo album pages
Know MySQL database
Sword finger offer 12 Path in matrix
Dynamics 365 开发协作最佳实践思考
Visualstudio2019 compilation configuration lastools-v2.0.0 under win10 system
How to upgrade kubernetes in place