当前位置:网站首页>Data mining (data preprocessing) -- Notes
Data mining (data preprocessing) -- Notes
2022-07-28 20:25:00 【A hard-working Mengxin, come on】
data mining
Data cleaning
Anomaly detection 
1. Missing value fill
local Outlier Factor
distancek(A,B)=max(diatancek(B), d(A,B))
K a near neighbor .
You can't directly compare Lrd(A) Size
To do ratio LOF(A)
The following data is the displayed result :
The bigger the data , The more outlying .
2、Duplicate Data ( Duplicate data ) To weed out

Create Keys -> Sort -> Merge
keys key word -> Used to find two relatively close data .
Data Transformation
边栏推荐
- Usage of const and assert
- Rand function generates pseudo-random numbers
- [C language] scanf format input and modifier summary
- 9. Pointer of C language (4) pointer and one-dimensional array, pointer operation
- Raspberry connects EC20 for PPP dialing
- How to use pycharm to quickly create a flask project
- 9. Pointer of C language (5) how many bytes does the pointer variable occupy
- Why is customer support important to SaaS?
- Introduction to seven kinds of polling (practice link attached)
- Linxu 【权限,粘滞位】
猜你喜欢

Item exception handling in SSM
![[C language] function](/img/81/c185e2bb5eccc13433483f9558f52a.png)
[C language] function

83.(cesium之家)cesium示例如何运行

为什么客户支持对SaaS公司很重要?

DSACTF7月re

JVM (24) -- performance monitoring and tuning (5) -- Analyzing GC logs
![[C language] Hanoi Tower problem [recursion]](/img/d8/ff66928c2bc2ad906e38a360a8cf94.png)
[C language] Hanoi Tower problem [recursion]

How to use pycharm to quickly create a flask project

CM4 development cross compilation tool chain production
![[C language] use function pointers to make a different calculator](/img/58/e6ba11e054d9e45ec979224ac3e4c6.png)
[C language] use function pointers to make a different calculator
随机推荐
Implementation of strcat in C language
C+ + core programming
读取json配置文件,实现数据驱动测试
Richpedia: A Large-Scale, Comprehensive Multi-Modal Knowledge Graph
ssm中项目异常处理
Commands related to obtaining administrator permissions
WPF -- implement websocket server
Maximum exchange [greedy thought & monotonic stack implementation]
robobrowser的简单使用
【CodeForces】Educational Codeforces Round 132 (Rated for Div. 2)
User, user group related operations
C language - pointer
Vivado designs PC and ram
Solve the cookie splitting problem (DP)
最大交换[贪心思想&单调栈实现]
Implementation of memmove in C language
Linxu 【基本指令】
Solve the brick stacking problem (DP)
HSETNX KEY_ Name field value usage
字符设备驱动结构