当前位置:网站首页>机器学习--人口普查数据分析
机器学习--人口普查数据分析
2022-07-06 09:13:00 【萍果馅是年糕】
机器学习--人口普查数据分析
在进行人口普查分析的时候需要对数据进行清洗;通过数据清洗的方法对数据进行清洗;
下载数据 从官方网站下载原始数据:UCI Machine Learning Repository
将下载好的adult.data文件转化成adult.csv文件
清洗数据
对数据进行清洗---对照https://archive.ics.uci.edu/ml/datasets/Adult中的数据信息进行清洗。
替换方法
将所有字符串替换完成后,将<=50K全部替换成0,>50K全部替换成1。
最后将?或者NAN替换成-1。注:一定要注意是否有空格。
清洗数据完成(一定要仔细数据清洗失误会导致决策树分析失败)
清洗完数据之后到阿里云创建工程,进行配置。
新建工程
编辑工作流
第一步创建一个COS数据集 输入--数据源--COS数据集
配置COS数据集
第二步创建一个修改列名 算法--机器学习算法--数据预处理--修改列名
配置修改列名
第三步进行数据切分 算法--机器学习算法--数据预处理--数据切分
数据切分配置
第四步进行决策树分类 算法--机器学习算法--分类--决策树分类
再配置决策树分类前面这个
进行连接
最后进行二分类任务评估 输出--模型评估--二分类任务评估
进行运行
边栏推荐
- Windows cannot start the MySQL service (located on the local computer) error 1067 the process terminated unexpectedly
- 报错解决 —— io.UnsupportedOperation: can‘t do nonzero end-relative seeks
- CSDN blog summary (I) -- a simple first edition implementation
- 【博主推荐】SSM框架的后台管理系统(附源码)
- csdn-Markdown编辑器
- 【博主推荐】C# Winform定时发送邮箱(附源码)
- Remember the interview algorithm of a company: find the number of times a number appears in an ordered array
- [untitled]
- 【博主推荐】asp.net WebService 后台数据API JSON(附源码)
- CSDN question and answer tag skill tree (I) -- Construction of basic framework
猜你喜欢
Installation and use of MySQL under MySQL 19 Linux
Some problems in the development of unity3d upgraded 2020 VR
A brief introduction to the microservice technology stack, the introduction and use of Eureka and ribbon
MySQL20-MySQL的数据目录
CSDN markdown editor
Valentine's Day is coming, are you still worried about eating dog food? Teach you to make a confession wall hand in hand. Express your love to the person you want
CSDN question and answer module Title Recommendation task (I) -- Construction of basic framework
Postman environment variable settings
[recommended by bloggers] background management system of SSM framework (with source code)
API learning of OpenGL (2002) smooth flat of glsl
随机推荐
Global and Chinese market of operational amplifier 2022-2028: Research Report on technology, participants, trends, market size and share
Some notes of MySQL
Solve the problem that XML, YML and properties file configurations cannot be scanned
Mysql21 user and permission management
Some problems in the development of unity3d upgraded 2020 VR
Global and Chinese market of wafer processing robots 2022-2028: Research Report on technology, participants, trends, market size and share
Install mysql5.5 and mysql8.0 under windows at the same time
MySQL21-用戶與權限管理
Postman environment variable settings
Remember the interview algorithm of a company: find the number of times a number appears in an ordered array
Install MySQL for Ubuntu 20.04
Mysql 其他主机无法连接本地数据库
The virtual machine Ping is connected to the host, and the host Ping is not connected to the virtual machine
Invalid default value for 'create appears when importing SQL_ Time 'error reporting solution
Csdn-nlp: difficulty level classification of blog posts based on skill tree and weak supervised learning (I)
February 13, 2022-3-middle order traversal of binary tree
Ansible实战系列二 _ Playbook入门
++Implementation of I and i++
连接MySQL数据库出现错误:2059 - authentication plugin ‘caching_sha2_password‘的解决方法
项目实战-后台员工信息管理(增删改查登录与退出)