当前位置:网站首页>Machine learning -- census data analysis
Machine learning -- census data analysis
2022-07-06 11:10:00 【Pingguo stuffed with rice cakes】
machine learning -- Census data analysis
It is necessary to clean the data when conducting census analysis ; Clean the data by data cleaning ;
Download data Download the original data from the official website :UCI Machine Learning Repository
Will download okay adult.data File to adult.csv file
Data cleaning
Clean the data --- contrast https://archive.ics.uci.edu/ml/datasets/Adult Clean the data information in .
Alternative method
After replacing all strings , take <=50K Replace all with 0,>50K Replace all with 1.
The final will be ? perhaps NAN Replace with -1. notes : Be sure to pay attention to whether there are spaces .
Cleaning data is completed ( We must be careful that data cleaning errors will lead to the failure of decision tree analysis )
After cleaning the data, go to Alibaba cloud to create a project , To configure .
New project
Edit workflow
The first step is to create a COS Data sets Input -- data source --COS Data sets
To configure COS Data sets
The second step is to create a modified column name Algorithm -- Machine learning algorithm -- Data preprocessing -- Change column names
Configure and modify column names
The third step is data segmentation Algorithm -- Machine learning algorithm -- Data preprocessing -- Data segmentation
Data segmentation configuration
The fourth step is to classify the decision tree Algorithm -- Machine learning algorithm -- classification -- Decision tree classification
Then configure the decision tree to classify the previous
Connect
Finally, the second classification task evaluation Output -- Model to evaluate -- II. Classification task evaluation
Run
边栏推荐
- Did you forget to register or load this tag 报错解决方法
- 机器学习--人口普查数据分析
- Django running error: error loading mysqldb module solution
- MySQL主从复制、读写分离
- 02-项目实战之后台员工信息管理
- Detailed reading of stereo r-cnn paper -- Experiment: detailed explanation and result analysis
- CSDN blog summary (I) -- a simple first edition implementation
- QT creator specifies dependencies
- Ansible实战系列二 _ Playbook入门
- Development of C language standard
猜你喜欢
[recommended by bloggers] background management system of SSM framework (with source code)
CSDN问答模块标题推荐任务(一) —— 基本框架的搭建
[ahoi2009]chess Chinese chess - combination number optimization shape pressure DP
QT creator create button
Redis的基础使用
Image recognition - pyteseract TesseractNotFoundError: tesseract is not installed or it‘s not in your path
CSDN blog summary (I) -- a simple first edition implementation
Generate PDM file from Navicat export table
Summary of numpy installation problems
MySQL主從複制、讀寫分離
随机推荐
Are you monitored by the company for sending resumes and logging in to job search websites? Deeply convinced that the product of "behavior awareness system ba" has not been retrieved on the official w
Ubuntu 20.04 安装 MySQL
MySQL主從複制、讀寫分離
MySQL master-slave replication, read-write separation
Summary of numpy installation problems
【博主推荐】C#MVC列表实现增删改查导入导出曲线功能(附源码)
Solution: log4j:warn please initialize the log4j system properly
There are three iPhone se 2022 models in the Eurasian Economic Commission database
记某公司面试算法题:查找一个有序数组某个数字出现的次数
一键提取pdf中的表格
LeetCode #461 汉明距离
Postman uses scripts to modify the values of environment variables
解决扫描不到xml、yml、properties文件配置
Installation and use of MySQL under MySQL 19 Linux
MySQL 20 MySQL data directory
[recommended by bloggers] C MVC list realizes the function of adding, deleting, modifying, checking, importing and exporting curves (with source code)
[ahoi2009]chess Chinese chess - combination number optimization shape pressure DP
QT creator create button
Ansible实战系列三 _ task常用命令
Postman Interface Association