当前位置:网站首页>Task 1 Titanic top 4% with ensemble modeling
Task 1 Titanic top 4% with ensemble modeling
2022-07-26 07:08:00 【Heihei_ study】
One 、 About python Of nan
NaN(not a number), In mathematical representation, a number that cannot be represented , There is usually another expression here inf,inf and nan The difference is ,inf Is a floating-point number that exceeds the range of floating-point representation ( Its essence is still a number , It's just that he's infinite , Therefore, floating point numbers cannot be used to represent , such as 1/0), and nan It generally represents a non floating point number ( For example, irrational numbers ).
python Medium nan What does that mean? - You know
Two 、 Using the map base Tukey method Detect outliers in the data set
In a dataset, if an observation is unusually larger or smaller than other data in the dataset , We call it a suspected outlier . The existence of suspected outliers , It will have an inappropriate impact on the subsequent calculation results , It is very necessary to detect suspected abnormal values and deal with them properly .
A classic method of calculating suspected outliers in data sets is Tukey method. This method first calculates the quartile of the data set (Q1) And the third quartile (Q3), So as to calculate the interquartile spacing (IQR), Then it will be less than Q1 - 1.5IQR Or greater than Q3 + 1.5IQR Data points of are regarded as suspected outliers . We can use this method in DataFrame Detect outliers in .
Tukey‘s test Method outliers _weixin_30466421 The blog of -CSDN Blog
1.np.percentile() function
np.percentile() Function super detailed _ Kana likes fish blog -CSDN Blog _np.percentile()
2.Python collections.Counter() usage
Python collections.Counter() usage _pvop The blog of -CSDN Blog _collections.counter python
3.python_4.loc() and iloc() function
python_4.loc() and iloc() function _Pepei. The blog of -CSDN Blog _loc()
# Drop outliers
train = train.drop(Outliers_to_drop, axis = 0).reset_index(drop=True)# Note that here is the deletion line ,axis = 0
4.reset_index(drop=True)
reset_index(drop=True)_Yukyin The blog of -CSDN Blog
5.fillna()
fillna() Function details _Denver_Liao The blog of -CSDN Blog _.fillna
6.info() Function and describe function
info() Function and describe function _ Yi Shang's blog -CSDN Blog _info() function
7.head() function
Pandas Medium head( ) function _ac Student's blog -CSDN Blog _head function
3、 ... and 、 Characteristics analysis
1. Numerical characteristics
1.1.corr() function as well as Heat map heatmap
corr() Function usage _ Study hard --ing The blog of -CSDN Blog _corr
1.2. catplot() function
1.3.despine() function
seaborn despine() Use of functions _ Boulder's blog -CSDN Blog
2. Category features
Four 、 Feature Engineering
边栏推荐
- 优炫数据库JDBC打开日志方式有哪些
- File server fastdfs
- AcWing-每日一题
- 【一库】妙啊!这个库组织npm脚本简直爆炸!
- [today in history] July 18: Intel was founded; The first photo was posted on the world wide web; EBay spins off PayPal
- Rust language - slice type (&[u8])
- 在第一次使用德国小鸡要注意的地方
- Linux c SQLite database usage
- Binary tree knowledge summary
- <二> objectARX开发:创建和编辑基本图形对象
猜你喜欢

NPM command

AcWing-每日一题

buuReserve(4)

常用的cmd指令

Flame diagram analysis Flink backpressure

哈夫曼编码原理

Realize the full link grayscale based on Apache APIs IX through MSE

The results of the soft test can be checked, and the entry to query the results of the soft test has been opened in the first half of 2022

Log rotation logrotate

Rectification ideas for the previous article
随机推荐
20220724 三角函数系的正交性
Why can't extern compile variables decorated with const?
Do you know what "parts" MySQL contains?
How to use Hyper-V in win10 Home Edition
Realization of LED water lamp based on C51
Image annotation software reference
Common CMD instructions
[today in history] July 18: Intel was founded; The first photo was posted on the world wide web; EBay spins off PayPal
文件服务器FastDFS
[Star Project] small hat aircraft War (II)
如何删除语句审计日志?
针对前面文章的整改思路
[untitled] reprint
vulnhub Lampião: 1
< II> ObjectARX development: create and edit basic graphic objects
"Wei Lai Cup" 2022 Niuke summer multi school training camp 1 supplementary question record (acdgij)
npm 命令
Curl post request on the server, using postman tool for parameter conversion
QT监听socket事件,使用QSocketNotifier类
SQL shell (PSQL) tool under PostgreSQL