当前位置:网站首页>21. Support Vector Machine - Introduction to Kernel Functions
21. Support Vector Machine - Introduction to Kernel Functions
2022-07-31 21:11:00 【WuJiaYFN】
Main content
- The concept of kernel function
- Classification of kernel functions
I. The concept of kernel function
1.1 Kernel function definition
- In order to better classify, the support vector machine maps the input low-dimensional space X (a subset or discrete set of Euclidean space R^n) to a high-dimensional feature space H (Hilbert space), if there is K(x, y), x, y ∈ Χ in the low-dimensional space, such that K(x, y) = ϕ(x) ϕ(y), then K(x, y) is the kernel function, where ϕ(x)·ϕ(y) is the inner product mapped to the feature space by ϕ(x)·ϕ(y), and ϕ(x) is X→Hthe mapping function.
- Visual understanding: Think of a kernel function as a wrapper or interface that converts data from a hard-to-handle form to an easier-to-handle form
- Kernel tricks or nuclear transformation: One of the benefits of SVM optimization is that all operations can be written in the form of inner products. We can replace the inner product operations with kernel functions for processing.This process is called nuclear technology or nuclear transformation
The role of the 1.2 kernel function
- After spatial transformation through the kernel function, we can solve linear problems in high-dimensional space, which is equivalent to solving nonlinear problems in low-dimensional space
- The dimension of the target feature space H is generally high, and may even be infinite, so it is difficult to find the inner product. When using it, only the kernel function is defined, and the mapping function ϕ is not explicitly defined, so only the transformed inner product is involved.product without transforming the value.In this way, on the one hand, it can solve the linear inseparable problem, and on the other hand, it avoids the "curse of dimensionality" and reduces the amount of computation.
- Kernel functions are not only used in support vector machines, many other machine learning algorithms also use kernel functions
Second, the classification of kernel functions
- Introduction to common kernel functions in the collection
2.1 Linear Kernel
- Linear kernel function is the simplest kernel function, mainly used for linearly separable cases
- expression for linear kernel function:
where c is a optional constant
- The linear kernel function is the inner product of the original input space, that is, The dimension of the feature space and the input space is the same, with fewer parameters and faster operation speed
- In general, when the number of features is very large relative to the number of samples, a linear kernel function is suitable.
2.2 Polynomial Kernel
The expression of the polynomial kernel function is:
- α represents the adjustment parameter, d represents the highest order number, and c is an optional constant
The polynomial kernel function has many parameters. When the polynomial order is high, the complexity will be very high. For the orthogonally normalized data, the polynomial kernel function can be preferred
2.3 Radial Basis Function Kernel
The radial basis kernel function is also called Gaussian kernel function because it is similar to the Gaussian function
The expression of the Radial Basis Kernel Function is:
- The larger the α2 is, the smoother the Gaussian kernel function becomes, that is, a model that changes slowly with the input x, The model has large deviation and variance, poor generalization ability, and is easy to overfit
- The smaller the α2, the more severe the change of the Gaussian kernel function, the smaller the deviation and variance of the model, and the more sensitive the model is to noise samples.
Radial basis kernel functions are flexible and widely used
Compared with the polynomial kernel function, it has fewer parameters, so in most cases, it has better performance; When you are not sure which kernel function to use, it can be preferredVerify the Gaussian kernel function
2.4 Sigmoid Kernel
Sigmoid kernel function expression is as follows:
- α represents the adjustment parameter, c is an optional constant, generally, c is 1/n, and n is the data dimension
The Sigmoid kernel function is derived from the activation function in MLP, and SVM uses Sigmoid to be equivalent to a two-layer perceptron network
If you think the article is helpful to you, you can like and encourage me, welcome to collect the article and learn together
Follow me, we will learn and make progress together!!!
边栏推荐
- Count characters in UTF-8 string function
- SiC MOSFET的短路特性及保护
- How to change npm to Taobao mirror [easy to understand]
- Efficient Concurrency: A Detailed Explanation of Synchornized's Lock Optimization
- Architecture Battalion Module 8 Homework
- 全网一触即发,自媒体人的内容分发全能助手——融媒宝
- Structure of the actual combat battalion module eight operations
- Book of the Month (202207): The Definitive Guide to Swift Programming
- MySQL---Create and manage databases and data tables
- 手把手教你学会部署Nestjs项目
猜你喜欢
Poker Game in C# -- Introduction and Code Implementation of Blackjack Rules
Embedded development has no passion, is it normal?
嵌入式开发没有激情了,正常吗?
Chapter VII
NVIDIA已经开始测试AD106和AD107 GPU核心的显卡产品
ReentrantLock原理(未完待续)
关注!海泰方圆加入《个人信息保护自律公约》
顺序表的实现
Short-circuit characteristics and protection of SiC MOSFETs
手把手教你学会部署Nestjs项目
随机推荐
基于STM32 环形队列来实现串口接收数据
Thymeleaf是什么?该如何使用。
Pytorch lstm time series prediction problem stepping on the pit "recommended collection"
给定一个ip地址,子网掩码怎么算网络号(如何获取ip地址和子网掩码)
One thing to say, is outsourcing company worth it?
Write a database document management tool based on WPF repeating the wheel (1)
Book of the Month (202207): The Definitive Guide to Swift Programming
pytorch lstm时间序列预测问题踩坑「建议收藏」
第六章
API for JD.com to obtain historical price information of commodities
Arduino框架下STM32全系列开发固件安装指南
AI 自动写代码插件 Copilot(副驾驶员)
架构师04-应用服务间加密设计和实践
有一说一,外包公司到底值不值得去?
微信小程序的路由拦截
【AcWing】The 62nd Weekly Match 【2022.07.30】
1161. 最大层内元素和 : 层序遍历运用题
Carbon教程之 基本语法入门大全 (教程)
Shell 脚本 快速入门到实战 -02
BM3 flips the nodes in the linked list in groups of k