当前位置:网站首页>Machine learning -- census data analysis
Machine learning -- census data analysis
2022-07-06 11:10:00 【Pingguo stuffed with rice cakes】
machine learning -- Census data analysis
It is necessary to clean the data when conducting census analysis ; Clean the data by data cleaning ;
Download data Download the original data from the official website :UCI Machine Learning Repository



Will download okay adult.data File to adult.csv file

Data cleaning
Clean the data --- contrast https://archive.ics.uci.edu/ml/datasets/Adult Clean the data information in .

Alternative method









After replacing all strings , take <=50K Replace all with 0,>50K Replace all with 1.


The final will be ? perhaps NAN Replace with -1. notes : Be sure to pay attention to whether there are spaces .

Cleaning data is completed ( We must be careful that data cleaning errors will lead to the failure of decision tree analysis )

After cleaning the data, go to Alibaba cloud to create a project , To configure .
New project


Edit workflow
The first step is to create a COS Data sets Input -- data source --COS Data sets

To configure COS Data sets
The second step is to create a modified column name Algorithm -- Machine learning algorithm -- Data preprocessing -- Change column names

Configure and modify column names

The third step is data segmentation Algorithm -- Machine learning algorithm -- Data preprocessing -- Data segmentation

Data segmentation configuration

The fourth step is to classify the decision tree Algorithm -- Machine learning algorithm -- classification -- Decision tree classification

Then configure the decision tree to classify the previous


Connect

Finally, the second classification task evaluation Output -- Model to evaluate -- II. Classification task evaluation

Run

边栏推荐
- The virtual machine Ping is connected to the host, and the host Ping is not connected to the virtual machine
- MySQL19-Linux下MySQL的安装与使用
- QT creator design user interface
- CSDN blog summary (I) -- a simple first edition implementation
- JDBC principle
- [recommended by bloggers] C # generate a good-looking QR code (with source code)
- One click extraction of tables in PDF
- Idea import / export settings file
- 连接MySQL数据库出现错误:2059 - authentication plugin ‘caching_sha2_password‘的解决方法
- ++Implementation of I and i++
猜你喜欢

Mysql22 logical architecture

Why is MySQL still slow to query when indexing is used?

LeetCode #461 汉明距离

CSDN Q & a tag skill tree (V) -- cloud native skill tree
![[ahoi2009]chess Chinese chess - combination number optimization shape pressure DP](/img/7d/8cbbd2f328a10808319458a96fa5ec.jpg)
[ahoi2009]chess Chinese chess - combination number optimization shape pressure DP

Basic use of redis
![[recommended by bloggers] C # generate a good-looking QR code (with source code)](/img/5a/1dbafe5a28f016b815964b9b37c9f1.jpg)
[recommended by bloggers] C # generate a good-looking QR code (with source code)
![[recommended by bloggers] asp Net WebService background data API JSON (with source code)](/img/04/c721e6177b578b30cbbf334cb1b6c9.png)
[recommended by bloggers] asp Net WebService background data API JSON (with source code)

La table d'exportation Navicat génère un fichier PDM

One click extraction of tables in PDF
随机推荐
Swagger, Yapi interface management service_ SE
Introduction and use of automatic machine learning framework (flaml, H2O)
IDEA 导入导出 settings 设置文件
[recommended by bloggers] C # generate a good-looking QR code (with source code)
Postman uses scripts to modify the values of environment variables
JDBC原理
There are three iPhone se 2022 models in the Eurasian Economic Commission database
Esp8266 at+cipstart= "", "", 8080 error closed ultimate solution
MySQL19-Linux下MySQL的安装与使用
连接MySQL数据库出现错误:2059 - authentication plugin ‘caching_sha2_password‘的解决方法
Project practice - background employee information management (add, delete, modify, check, login and exit)
安全测试涉及的测试对象
SSM integrated notes easy to understand version
[recommended by bloggers] background management system of SSM framework (with source code)
A trip to Macao - > see the world from a non line city to Macao
MySQL other hosts cannot connect to the local database
Why is MySQL still slow to query when indexing is used?
CSDN问答标签技能树(五) —— 云原生技能树
JDBC principle
虚拟机Ping通主机,主机Ping不通虚拟机