当前位置:网站首页>21. Support Vector Machine - Introduction to Kernel Functions
21. Support Vector Machine - Introduction to Kernel Functions
2022-07-31 21:11:00 【WuJiaYFN】
Main content
- The concept of kernel function
- Classification of kernel functions
I. The concept of kernel function
1.1 Kernel function definition
- In order to better classify, the support vector machine maps the input low-dimensional space X (a subset or discrete set of Euclidean space R^n) to a high-dimensional feature space H (Hilbert space), if there is K(x, y), x, y ∈ Χ in the low-dimensional space, such that K(x, y) = ϕ(x) ϕ(y), then K(x, y) is the kernel function, where ϕ(x)·ϕ(y) is the inner product mapped to the feature space by ϕ(x)·ϕ(y), and ϕ(x) is X→Hthe mapping function.
- Visual understanding: Think of a kernel function as a wrapper or interface that converts data from a hard-to-handle form to an easier-to-handle form
- Kernel tricks or nuclear transformation: One of the benefits of SVM optimization is that all operations can be written in the form of inner products. We can replace the inner product operations with kernel functions for processing.This process is called nuclear technology or nuclear transformation
The role of the 1.2 kernel function
- After spatial transformation through the kernel function, we can solve linear problems in high-dimensional space, which is equivalent to solving nonlinear problems in low-dimensional space
- The dimension of the target feature space H is generally high, and may even be infinite, so it is difficult to find the inner product. When using it, only the kernel function is defined, and the mapping function ϕ is not explicitly defined, so only the transformed inner product is involved.product without transforming the value.In this way, on the one hand, it can solve the linear inseparable problem, and on the other hand, it avoids the "curse of dimensionality" and reduces the amount of computation.
- Kernel functions are not only used in support vector machines, many other machine learning algorithms also use kernel functions
Second, the classification of kernel functions
- Introduction to common kernel functions in the collection
2.1 Linear Kernel
- Linear kernel function is the simplest kernel function, mainly used for linearly separable cases
- expression for linear kernel function:

where c is a optional constant
- The linear kernel function is the inner product of the original input space, that is, The dimension of the feature space and the input space is the same, with fewer parameters and faster operation speed
- In general, when the number of features is very large relative to the number of samples, a linear kernel function is suitable.
2.2 Polynomial Kernel
The expression of the polynomial kernel function is:

- α represents the adjustment parameter, d represents the highest order number, and c is an optional constant
The polynomial kernel function has many parameters. When the polynomial order is high, the complexity will be very high. For the orthogonally normalized data, the polynomial kernel function can be preferred
2.3 Radial Basis Function Kernel
The radial basis kernel function is also called Gaussian kernel function because it is similar to the Gaussian function
The expression of the Radial Basis Kernel Function is:

- The larger the α2 is, the smoother the Gaussian kernel function becomes, that is, a model that changes slowly with the input x, The model has large deviation and variance, poor generalization ability, and is easy to overfit
- The smaller the α2, the more severe the change of the Gaussian kernel function, the smaller the deviation and variance of the model, and the more sensitive the model is to noise samples.
Radial basis kernel functions are flexible and widely used
Compared with the polynomial kernel function, it has fewer parameters, so in most cases, it has better performance; When you are not sure which kernel function to use, it can be preferredVerify the Gaussian kernel function
2.4 Sigmoid Kernel
Sigmoid kernel function expression is as follows:

- α represents the adjustment parameter, c is an optional constant, generally, c is 1/n, and n is the data dimension
The Sigmoid kernel function is derived from the activation function in MLP, and SVM uses Sigmoid to be equivalent to a two-layer perceptron network
If you think the article is helpful to you, you can like and encourage me, welcome to collect the article and learn together
Follow me, we will learn and make progress together!!!
边栏推荐
- 【愚公系列】2022年07月 Go教学课程 025-递归函数
- PCB叠层设计
- 关注!海泰方圆加入《个人信息保护自律公约》
- grep command written test questions
- 嵌入式开发没有激情了,正常吗?
- The whole network is on the verge of triggering, and the all-round assistant for content distribution from media people - Rongmeibao
- A few permanent free network transmission, convenient and simple (Intranet through tutorials)
- Cache and Database Consistency Solutions
- 基于STM32 环形队列来实现串口接收数据
- C# 之 扑克游戏 -- 21点规则介绍和代码实现
猜你喜欢
随机推荐
【愚公系列】2022年07月 Go教学课程 025-递归函数
微信小程序的路由拦截
多线程之锁
npm 更改为淘宝镜像的方法[通俗易懂]
【AcWing】The 62nd Weekly Match 【2022.07.30】
c语言解析json字符串(json对象转化为字符串)
How to change npm to Taobao mirror [easy to understand]
Memblaze released the first enterprise-grade SSD based on long-lasting particles. What is the new value behind it?
NVIDIA has begun testing graphics products with AD106 and AD107 GPU cores
性能优化:记一次树的搜索接口优化思路
高通cDSP简单编程例子(实现查询高通cDSP使用率、签名),RK3588 npu使用率查询
rj45 to the connector Gigabit (Fast Ethernet interface definition)
MATLAB program design and application 2.4 Common internal functions of MATLAB
matplotlib ax bar color 设置ax bar的颜色、 透明度、label legend
Pytorch lstm time series prediction problem stepping on the pit "recommended collection"
Redis Overview: Talk to the interviewer all night long about Redis caching, persistence, elimination mechanism, sentinel, and the underlying principles of clusters!...
Douyin fetches video list based on keywords API
【Yugong Series】July 2022 Go Teaching Course 025-Recursive Function
Taobao/Tmall get Taobao password real url API
leetcode: 6135. The longest ring in the graph [inward base ring tree + longest ring board + timestamp]








