当前位置:网站首页>21. Support Vector Machine - Introduction to Kernel Functions
21. Support Vector Machine - Introduction to Kernel Functions
2022-07-31 21:11:00 【WuJiaYFN】
Main content
- The concept of kernel function
- Classification of kernel functions
I. The concept of kernel function
1.1 Kernel function definition
- In order to better classify, the support vector machine maps the input low-dimensional space X (a subset or discrete set of Euclidean space R^n) to a high-dimensional feature space H (Hilbert space), if there is K(x, y), x, y ∈ Χ in the low-dimensional space, such that K(x, y) = ϕ(x) ϕ(y), then K(x, y) is the kernel function, where ϕ(x)·ϕ(y) is the inner product mapped to the feature space by ϕ(x)·ϕ(y), and ϕ(x) is X→Hthe mapping function.
- Visual understanding: Think of a kernel function as a wrapper or interface that converts data from a hard-to-handle form to an easier-to-handle form
- Kernel tricks or nuclear transformation: One of the benefits of SVM optimization is that all operations can be written in the form of inner products. We can replace the inner product operations with kernel functions for processing.This process is called nuclear technology or nuclear transformation
The role of the 1.2 kernel function
- After spatial transformation through the kernel function, we can solve linear problems in high-dimensional space, which is equivalent to solving nonlinear problems in low-dimensional space
- The dimension of the target feature space H is generally high, and may even be infinite, so it is difficult to find the inner product. When using it, only the kernel function is defined, and the mapping function ϕ is not explicitly defined, so only the transformed inner product is involved.product without transforming the value.In this way, on the one hand, it can solve the linear inseparable problem, and on the other hand, it avoids the "curse of dimensionality" and reduces the amount of computation.
- Kernel functions are not only used in support vector machines, many other machine learning algorithms also use kernel functions
Second, the classification of kernel functions
- Introduction to common kernel functions in the collection
2.1 Linear Kernel
- Linear kernel function is the simplest kernel function, mainly used for linearly separable cases
- expression for linear kernel function:

where c is a optional constant
- The linear kernel function is the inner product of the original input space, that is, The dimension of the feature space and the input space is the same, with fewer parameters and faster operation speed
- In general, when the number of features is very large relative to the number of samples, a linear kernel function is suitable.
2.2 Polynomial Kernel
The expression of the polynomial kernel function is:

- α represents the adjustment parameter, d represents the highest order number, and c is an optional constant
The polynomial kernel function has many parameters. When the polynomial order is high, the complexity will be very high. For the orthogonally normalized data, the polynomial kernel function can be preferred
2.3 Radial Basis Function Kernel
The radial basis kernel function is also called Gaussian kernel function because it is similar to the Gaussian function
The expression of the Radial Basis Kernel Function is:

- The larger the α2 is, the smoother the Gaussian kernel function becomes, that is, a model that changes slowly with the input x, The model has large deviation and variance, poor generalization ability, and is easy to overfit
- The smaller the α2, the more severe the change of the Gaussian kernel function, the smaller the deviation and variance of the model, and the more sensitive the model is to noise samples.
Radial basis kernel functions are flexible and widely used
Compared with the polynomial kernel function, it has fewer parameters, so in most cases, it has better performance; When you are not sure which kernel function to use, it can be preferredVerify the Gaussian kernel function
2.4 Sigmoid Kernel
Sigmoid kernel function expression is as follows:

- α represents the adjustment parameter, c is an optional constant, generally, c is 1/n, and n is the data dimension
The Sigmoid kernel function is derived from the activation function in MLP, and SVM uses Sigmoid to be equivalent to a two-layer perceptron network
If you think the article is helpful to you, you can like and encourage me, welcome to collect the article and learn together
Follow me, we will learn and make progress together!!!
边栏推荐
- PCB叠层设计
- BM3 将链表中的节点每k个一组翻转
- GAC Honda Safety Experience Camp: "Danger" is the best teacher
- 【愚公系列】2022年07月 Go教学课程 023-Go容器之列表
- 【核心概念】图像分类和目标检测中的正负样本划分以及架构理解
- c语言解析json字符串(json对象转化为字符串)
- Apache EventMesh distributed event-driven multi-runtime
- 1161. Maximum Sum of Elements in Layer: Hierarchical Traversal Application Problems
- find prime numbers up to n
- Linux环境redis集群搭建「建议收藏」
猜你喜欢

统计UTF-8字符串中的字符函数

Chapter VII

Embedded development has no passion, is it normal?

ResNet的基础:残差块的原理

全网一触即发,自媒体人的内容分发全能助手——融媒宝

Shell 脚本 快速入门到实战 -02

Introduction to Audio Types and Encoding Formats in Unity

Memblaze released the first enterprise-grade SSD based on long-lasting particles. What is the new value behind it?
![[Intensive reading of the paper] iNeRF](/img/a7/910667911e1ce8996b9d22de63ea04.png)
[Intensive reading of the paper] iNeRF

Qualcomm cDSP simple programming example (to query Qualcomm cDSP usage, signature), RK3588 npu usage query
随机推荐
MATLAB program design and application 2.4 Common internal functions of MATLAB
移动web开发02
ThreadLocal
BM3 将链表中的节点每k个一组翻转
老牌音乐播放器 WinAmp 发布 5.9 RC1 版:迁移到 VS 2019 完全重建,兼容 Win11
Structure of the actual combat battalion module eight operations
idea中搜索具体的字符内容的快捷方式
MySQL - single function
Batch (batch size, full batch, mini batch, online learning), iterations and epochs in deep learning
Get Douyin Video Details API
Thymeleaf是什么?该如何使用。
The old music player WinAmp released version 5.9 RC1: migrated to VS 2019, completely rebuilt, compatible with Win11
【AcWing】第 62 场周赛 【2022.07.30】
顺序表的实现
leetcode:6135. 图中的最长环【内向基环树 + 最长环板子 + 时间戳】
A shortcut to search for specific character content in idea
How programmers learn open source projects, this article tells you
1161. 最大层内元素和 : 层序遍历运用题
"The core concept of" image classification and target detection in the positive and negative samples and understanding architecture
grep命令 笔试题