当前位置:网站首页>Machine learning -- census data analysis
Machine learning -- census data analysis
2022-07-06 11:10:00 【Pingguo stuffed with rice cakes】
machine learning -- Census data analysis
It is necessary to clean the data when conducting census analysis ; Clean the data by data cleaning ;
Download data Download the original data from the official website :UCI Machine Learning Repository
Will download okay adult.data File to adult.csv file
Data cleaning
Clean the data --- contrast https://archive.ics.uci.edu/ml/datasets/Adult Clean the data information in .
Alternative method
After replacing all strings , take <=50K Replace all with 0,>50K Replace all with 1.
The final will be ? perhaps NAN Replace with -1. notes : Be sure to pay attention to whether there are spaces .
Cleaning data is completed ( We must be careful that data cleaning errors will lead to the failure of decision tree analysis )
After cleaning the data, go to Alibaba cloud to create a project , To configure .
New project
Edit workflow
The first step is to create a COS Data sets Input -- data source --COS Data sets
To configure COS Data sets
The second step is to create a modified column name Algorithm -- Machine learning algorithm -- Data preprocessing -- Change column names
Configure and modify column names
The third step is data segmentation Algorithm -- Machine learning algorithm -- Data preprocessing -- Data segmentation
Data segmentation configuration
The fourth step is to classify the decision tree Algorithm -- Machine learning algorithm -- classification -- Decision tree classification
Then configure the decision tree to classify the previous
Connect
Finally, the second classification task evaluation Output -- Model to evaluate -- II. Classification task evaluation
Run
边栏推荐
- Solve the problem that XML, YML and properties file configurations cannot be scanned
- 软件测试-面试题分享
- 虚拟机Ping通主机,主机Ping不通虚拟机
- Windows cannot start the MySQL service (located on the local computer) error 1067 the process terminated unexpectedly
- Swagger, Yapi interface management service_ SE
- Did you forget to register or load this tag
- MySQL19-Linux下MySQL的安装与使用
- 安全测试涉及的测试对象
- 自动机器学习框架介绍与使用(flaml、h2o)
- Postman Interface Association
猜你喜欢
软件测试与质量学习笔记3--白盒测试
Knowledge Q & A based on Apache Jena
Esp8266 at+cipstart= "", "", 8080 error closed ultimate solution
Neo4j installation tutorial
Picture coloring project - deoldify
La table d'exportation Navicat génère un fichier PDM
CSDN question and answer module Title Recommendation task (I) -- Construction of basic framework
neo4j安装教程
自动机器学习框架介绍与使用(flaml、h2o)
Idea import / export settings file
随机推荐
Csdn-nlp: difficulty level classification of blog posts based on skill tree and weak supervised learning (I)
Ansible实战系列二 _ Playbook入门
CSDN blog summary (I) -- a simple first edition implementation
1. Mx6u learning notes (VII): bare metal development (4) -- master frequency and clock configuration
Solve the problem that XML, YML and properties file configurations cannot be scanned
02-项目实战之后台员工信息管理
QT creator create button
Why is MySQL still slow to query when indexing is used?
【博主推荐】C# Winform定时发送邮箱(附源码)
windows无法启动MYSQL服务(位于本地计算机)错误1067进程意外终止
Esp8266 at+cipstart= "", "", 8080 error closed ultimate solution
[BMZCTF-pwn] 11-pwn111111
Basic use of redis
CSDN Q & a tag skill tree (V) -- cloud native skill tree
csdn-Markdown编辑器
Deoldify项目问题——OMP:Error#15:Initializing libiomp5md.dll,but found libiomp5md.dll already initialized.
Test objects involved in safety test
Some problems in the development of unity3d upgraded 2020 VR
[download app for free]ineukernel OCR image data recognition and acquisition principle and product application
Image recognition - pyteseract TesseractNotFoundError: tesseract is not installed or it‘s not in your path