当前位置：网站首页>Support vector machine for machine learning

Support vector machine for machine learning

2022-07-03 06:10:00 【Master core technology】

One 、 Concept of support vector machine

Support vector machine (Support Vector Machine) It is a kind of generalized linear classifier which classifies data according to supervised learning （generalized linear classifier）, The decision boundary is the maximum margin hyperplane for learning samples (maximum-margin hyperplane).
SVM There are three treasures ： interval 、 dual 、 Nuclear skills
SVM There are three kinds of ：hard-margin SVM、soft-margin、kernel SVM
Given the training sample set $D=\{(x_1,y_1),(x_2,y_2),...,(x_m,y_m),\},y_i\in\{-1,+1\}$ , Find a hyperplane in the sample space , As far as possible Separate samples of different categories , Make the classification result produced by this partition hyperplane the most robust , See the thick line in the figure below .
Insert picture description here

The partition hyperplane can be described by the following linear equation ：
$w^Tx+b=0$
among $w=(w_1;w_2;...w_d)$ For the normal vector , It determines the direction of the hyperplane ;b Is the displacement term , Determines the distance between the hyperplane and the origin , Partition hyperplane by normal vector w And displacement b determine .
Support vector machine $f(x)=sign(w^Tx+b)$ It is a classical discriminant model .

Two 、SVM Basic type derivation

SVM Also called Maximum interval classifier , Define the model as ：

$max\ margin(w,b)=max\ distance(w,b,x_i) \qquad (1) \\ s.t.\left\{ \begin{aligned} w^Tx_i+b\ge 0,y_i=+1 \\ w^Tx_i+b \le 0,y_i=-1 \end{aligned} \right.,\ i=1,2,...,m\qquad (2)$
By way of (2) Telescopic transformation is available ：
$s.t.\left\{ \begin{aligned} w^Tx_i+b\ge 1,y_i=+1 \\ w^Tx_i+b \le -1,y_i=-1 \end{aligned}\qquad (3) \right.$
Let a point on the partition hyperplane be x‘, Then there are $w^Tx'=-b\qquad (4)$
Distance formula ：
$r=|\frac{w^T}{||w||}(x-x')|( Projection )=\frac{1}{||w||}|w^Tx+b|( Plug in (4) type )$
Insert picture description here
As shown in the figure , These training sample points closest to the hyperplane make (3) The equation equals sign holds , So for the recent point $w^T+b|=1$ , They are called ” Support vector “, According to the distance formula ：

$r=\frac1{||w||} \qquad(5)$
The sum of the distances from the two heterogeneous support vector machines to the hyperplane is ：
$\gamma =\frac2{||w||} \qquad(6)$
It's called spacing (margin).
Want to find the maximum interval (maximum margin) The partition hyperplane of , That is to find satisfaction (3) There are three constraint parameters w and b, bring $\gamma$ Maximum , namely
$\begin{aligned} &\max\limits_{w,b}=\frac2{||w||}\\ &s.t. \ y_i(w^Tx_i+b)\ge1,i=1,2,...,m \end{aligned}\qquad(7)$
Obviously maximize the interval , Just maximize $w||^{-1}$ , This is equivalent to minimizing $w||^{2}$ , therefore （7） Formula rewritten as

$\begin{aligned} &\min\limits_{w,b}=\frac1{2}{||w||}^2\\ &s.t. \ y_i(w^Tx_i+b)\ge1,i=1,2,...,m \end{aligned}\qquad(8)$
(8) The formula is SVM The basic type of

原网站

版权声明
本文为[Master core technology]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/02/202202150616559148.html

当前位置：网站首页>Support vector machine for machine learning

Support vector machine for machine learning

One 、 Concept of support vector machine

Two 、SVM Basic type derivation

边栏推荐

猜你喜欢

随机推荐