当前位置:网站首页>机器学习--人口普查数据分析
机器学习--人口普查数据分析
2022-07-06 09:13:00 【萍果馅是年糕】
机器学习--人口普查数据分析
在进行人口普查分析的时候需要对数据进行清洗;通过数据清洗的方法对数据进行清洗;
下载数据 从官方网站下载原始数据:UCI Machine Learning Repository



将下载好的adult.data文件转化成adult.csv文件

清洗数据
对数据进行清洗---对照https://archive.ics.uci.edu/ml/datasets/Adult中的数据信息进行清洗。

替换方法









将所有字符串替换完成后,将<=50K全部替换成0,>50K全部替换成1。


最后将?或者NAN替换成-1。注:一定要注意是否有空格。

清洗数据完成(一定要仔细数据清洗失误会导致决策树分析失败)

清洗完数据之后到阿里云创建工程,进行配置。
新建工程


编辑工作流
第一步创建一个COS数据集 输入--数据源--COS数据集

配置COS数据集
第二步创建一个修改列名 算法--机器学习算法--数据预处理--修改列名

配置修改列名

第三步进行数据切分 算法--机器学习算法--数据预处理--数据切分

数据切分配置

第四步进行决策树分类 算法--机器学习算法--分类--决策树分类

再配置决策树分类前面这个


进行连接

最后进行二分类任务评估 输出--模型评估--二分类任务评估

进行运行

边栏推荐
- C language advanced pointer Full Version (array pointer, pointer array discrimination, function pointer)
- Valentine's Day is coming, are you still worried about eating dog food? Teach you to make a confession wall hand in hand. Express your love to the person you want
- 【博主推荐】C# Winform定时发送邮箱(附源码)
- MySQL23-存储引擎
- CSDN问答标签技能树(五) —— 云原生技能树
- Solution: log4j:warn please initialize the log4j system properly
- frp内网穿透那些事
- Copy constructor template and copy assignment operator template
- API learning of OpenGL (2002) smooth flat of glsl
- API learning of OpenGL (2001) gltexgen
猜你喜欢

CSDN问答标签技能树(五) —— 云原生技能树

Other new features of mysql18-mysql8

1. Mx6u learning notes (VII): bare metal development (4) -- master frequency and clock configuration

Install mysql5.5 and mysql8.0 under windows at the same time

Mysql21 user and permission management

连接MySQL数据库出现错误:2059 - authentication plugin ‘caching_sha2_password‘的解决方法

API learning of OpenGL (2003) gl_ TEXTURE_ WRAP_ S GL_ TEXTURE_ WRAP_ T

Deoldify项目问题——OMP:Error#15:Initializing libiomp5md.dll,but found libiomp5md.dll already initialized.

Just remember Balabala
![[recommended by bloggers] C WinForm regularly sends email (with source code)](/img/5d/57f8599a4f02c569c6c3f4bcb8b739.png)
[recommended by bloggers] C WinForm regularly sends email (with source code)
随机推荐
Global and Chinese market of transfer switches 2022-2028: Research Report on technology, participants, trends, market size and share
[reading notes] rewards efficient and privacy preserving federated deep learning
记一次某公司面试题:合并有序数组
February 13, 2022-2-climbing stairs
csdn-Markdown编辑器
API learning of OpenGL (2002) smooth flat of glsl
Idea import / export settings file
[recommended by bloggers] C MVC list realizes the function of adding, deleting, modifying, checking, importing and exporting curves (with source code)
02-项目实战之后台员工信息管理
【博主推荐】C#生成好看的二维码(附源码)
February 13, 2022 - Maximum subarray and
The virtual machine Ping is connected to the host, and the host Ping is not connected to the virtual machine
windows下同时安装mysql5.5和mysql8.0
IDEA 导入导出 settings 设置文件
API learning of OpenGL (2003) gl_ TEXTURE_ WRAP_ S GL_ TEXTURE_ WRAP_ T
Copie maître - esclave MySQL, séparation lecture - écriture
Asp access Shaoxing tourism graduation design website
Ansible practical Series III_ Task common commands
MySQL20-MySQL的数据目录
Valentine's Day is coming, are you still worried about eating dog food? Teach you to make a confession wall hand in hand. Express your love to the person you want