当前位置:网站首页>Why divide the training set and the test set before normalization?
Why divide the training set and the test set before normalization?
2022-06-27 01:40:00 【Thinking and Practice】
First divide the data into training set and test set, and then normalize the data, and then divide the test set and training set , The difference between the two :
In theory, the data set should be divided first , And then preprocess the training data , And save the preprocessed parameters , Processing test sets with the same parameters .
Because the division of training set and test set is based on the assumption that only the information of training set is known , And think that the test set data is from the future , Unknown . The information of the test set is used if the test set is divided after preprocessing .
To be one, to be one , If you do not unify, you will not unify . With the same distribution , The test is effective .
Some models have the same effect , Like decision trees . Some must be unified , Such as regression analysis
Reference material
Why divide the training set and test set first and then normalize them ?_CDA Q & a community
边栏推荐
- Online text digit recognition list summation tool
- Count the logarithm of points that cannot reach each other in an undirected graph [classic adjacency table building +dfs Statistics - > query set optimization] [query set manual / write details]
- XSS attack notes (Part 1)
- Buuctf PWN write UPS (6)
- 乔治·华盛顿大学 : Hanhan Zhou | PAC:多智能体强化学习中具有反事实预测的辅助价值因子分解
- Basic introduction to C program structure Preview
- SystemVerilog仿真速率提升
- memcached基础5
- LeetCode 142. Circular linked list II
- buuctf-pwn write-ups (6)
猜你喜欢

TopoLVM: 基于LVM的Kubernetes本地持久化方案,容量感知,动态创建PV,轻松使用本地磁盘

getReader() has already been called for this request

清华&智源 | CogView2:更快更好的文本图像生成模型

Visual introduction to Matplotlib and plotnine
![Custom jsp[if, foreach, data, select] tag](/img/a2/fc75c182d572d86f4466323e31d6c3.png)
Custom jsp[if, foreach, data, select] tag

LeetCode 142. Circular linked list II

Unable to create a folder to save the sketch: MKDIR sketch

疫情期间居家办公的总结体会 |社区征文

Clip: learning transferable visual models from natural language monitoring

在连接数据库的时候遇到了点问题,请问怎么解决呀?
随机推荐
NLP: brief introduction of transformer in NLP natural language field (pre training technology), NLP model development (elmo/gpt/bert/mt-dnn/xlnet/roberta/albert), detailed introduction to classic case
memcached基础6
Memcached foundation 2
Bootstrapblazor + FreeSQL actual combat chart usage (2)
Amazon ElastiCache 飞速搭建缓存服务集群,这才叫快
CLIP:从自然语言监督中学习可迁移的视觉模型
Structure the fifth operation of the actual camp module
XSS attack notes (Part 1)
你的case真的pass了吗?
The world is very big. Some people tattoo QR codes on their necks
Analysis of ideal L9 product power: the price is 459800 yuan, the four cylinder engine is adopted, and the endurance is 1315km
Keepalived 实现 Redis AutoFailover (RedisHA)12
XSS notes (Part 2)
每日刷题记录 (五)
Browser cache
Recursion will make strtok more attractive
Esp32 add multi directory custom component
Kept to implement redis autofailover (redisha) 15
Keepalived 实现 Redis AutoFailover (RedisHA)17
NOKOV动作捕捉系统使多场协同无人机自主建造成为可能