当前位置:网站首页>Machine learning -- census data analysis
Machine learning -- census data analysis
2022-07-06 11:10:00 【Pingguo stuffed with rice cakes】
machine learning -- Census data analysis
It is necessary to clean the data when conducting census analysis ; Clean the data by data cleaning ;
Download data Download the original data from the official website :UCI Machine Learning Repository



Will download okay adult.data File to adult.csv file

Data cleaning
Clean the data --- contrast https://archive.ics.uci.edu/ml/datasets/Adult Clean the data information in .

Alternative method









After replacing all strings , take <=50K Replace all with 0,>50K Replace all with 1.


The final will be ? perhaps NAN Replace with -1. notes : Be sure to pay attention to whether there are spaces .

Cleaning data is completed ( We must be careful that data cleaning errors will lead to the failure of decision tree analysis )

After cleaning the data, go to Alibaba cloud to create a project , To configure .
New project


Edit workflow
The first step is to create a COS Data sets Input -- data source --COS Data sets

To configure COS Data sets
The second step is to create a modified column name Algorithm -- Machine learning algorithm -- Data preprocessing -- Change column names

Configure and modify column names

The third step is data segmentation Algorithm -- Machine learning algorithm -- Data preprocessing -- Data segmentation

Data segmentation configuration

The fourth step is to classify the decision tree Algorithm -- Machine learning algorithm -- classification -- Decision tree classification

Then configure the decision tree to classify the previous


Connect

Finally, the second classification task evaluation Output -- Model to evaluate -- II. Classification task evaluation

Run

边栏推荐
- Generate PDM file from Navicat export table
- CSDN blog summary (I) -- a simple first edition implementation
- npm一个错误 npm ERR code ENOENT npm ERR syscall open
- FRP intranet penetration
- Introduction to the easy copy module
- CSDN问答标签技能树(一) —— 基本框架的构建
- SSM整合笔记通俗易懂版
- 自动机器学习框架介绍与使用(flaml、h2o)
- MySQL19-Linux下MySQL的安装与使用
- Principes JDBC
猜你喜欢

Esp8266 at+cipstart= "", "", 8080 error closed ultimate solution

Win10: how to modify the priority of dual network cards?

CSDN question and answer tag skill tree (I) -- Construction of basic framework

一键提取pdf中的表格

neo4j安装教程

LeetCode #461 汉明距离

When you open the browser, you will also open mango TV, Tiktok and other websites outside the home page

MySQL 20 MySQL data directory

Use dapr to shorten software development cycle and improve production efficiency

自动机器学习框架介绍与使用(flaml、h2o)
随机推荐
Ansible实战系列一 _ 入门
CSDN问答标签技能树(一) —— 基本框架的构建
Some notes of MySQL
Some problems in the development of unity3d upgraded 2020 VR
【博主推荐】asp.net WebService 后台数据API JSON(附源码)
Windows cannot start the MySQL service (located on the local computer) error 1067 the process terminated unexpectedly
Idea import / export settings file
MySQL 20 MySQL data directory
Unable to call numpy in pycharm, with an error modulenotfounderror: no module named 'numpy‘
++Implementation of I and i++
Win10: how to modify the priority of dual network cards?
Windows下安装MongDB教程、Redis教程
MySQL主從複制、讀寫分離
Remember a company interview question: merge ordered arrays
Copy constructor template and copy assignment operator template
【博主推荐】C#生成好看的二维码(附源码)
Timestamp with implicit default value is deprecated error in MySQL 5.6
[C language foundation] 04 judgment and circulation
C language advanced pointer Full Version (array pointer, pointer array discrimination, function pointer)
QT creator shape