当前位置:网站首页>Data mining (data preprocessing) -- Notes
Data mining (data preprocessing) -- Notes
2022-07-28 20:25:00 【A hard-working Mengxin, come on】
data mining
Data cleaning
Anomaly detection 
1. Missing value fill
local Outlier Factor
distancek(A,B)=max(diatancek(B), d(A,B))
K a near neighbor .
You can't directly compare Lrd(A) Size
To do ratio LOF(A)
The following data is the displayed result :
The bigger the data , The more outlying .
2、Duplicate Data ( Duplicate data ) To weed out

Create Keys -> Sort -> Merge
keys key word -> Used to find two relatively close data .
Data Transformation
边栏推荐
- WFST decoding process
- Array out of bounds
- Raspberry pie 3b ffmpeg RTMP streaming
- Storage of C language data in memory (1)
- 3、 Are formal and actual parameters in a programming language variables?
- 9. Pointer of C language (1) what is pointer and how to define pointer variables
- mmo及时战斗游戏中的场景线程分配
- Linxu 【基本指令】
- Wust-ctf2021-re school match WP
- Advanced notes (Part 2)
猜你喜欢
![[C language] simulation implementation of strlen (recursive and non recursive)](/img/73/e92fe714515491f1ea366d6924c9ec.png)
[C language] simulation implementation of strlen (recursive and non recursive)
![[C language] simulation implementation of pow function (recursion)](/img/7b/ef8b3d97adc7810de249a37642c71f.png)
[C language] simulation implementation of pow function (recursion)
![[C language] function](/img/81/c185e2bb5eccc13433483f9558f52a.png)
[C language] function

Linxu 【基本指令】

CNN convolution neural network learning process (weight update)
![[C language] print pattern summary](/img/48/d8ff17453e810fcd9269f56eda4d47.png)
[C language] print pattern summary
![[C language] guessing numbers game [function]](/img/db/8ebdb02f137878224367503b730803.png)
[C language] guessing numbers game [function]

Reverse string

4. Const and difine and the problem of initializing arrays with const and define
![[C language] Pointer advanced knowledge points](/img/8f/0057243c603ddfe20381c9bd446f03.png)
[C language] Pointer advanced knowledge points
随机推荐
[C language] string reverse order implementation (recursion and iteration)
Array out of bounds
[experiment sharing] CCIE BGP reflector experiment
读取json配置文件,实现数据驱动测试
Solutions to the environment created by Anaconda that cannot be displayed in pycharm
WUST-CTF2021-re校赛wp
Anaconda creation environment
The privatized instant messaging platform protects the security of enterprise mobile business
83.(cesium之家)cesium示例如何运行
Implementation of strcat in C language
Solve the problem of adding the least number of parentheses (interval DP)
C language - control statement
1. C language variable type, global variable, local variable
5. Difference between break and continue (easy to understand version)
Linux Installation MySQL (pit filling version)
Regular symbol description
[C language] random number generation and `include < time. H > 'learning
[C language] comprehensively analyze the pointer and sort out the pointer knowledge points
[C language] 5000 word super detailed explanation of various operations of the sequence table
通配符 SSL/TLS 证书