当前位置:网站首页>[new book recommendation] cleaning data for effective data science
[new book recommendation] cleaning data for effective data science
2022-06-30 09:42:00 【Librarian】
Hello everyone , The purpose of this account is to share some of the world's latest technical books and information for programmers who want to improve themselves , Today brings with it 2021 year 3 Month by month Packt The latest book on big data published by the publishing house , The language involved is Python and R Language .
Cleaning Data for Effective Data Science

author :David Mertz
Press. :Packt
Publication date :2021-03-31
ISBN:9781801071291
Book Introduction
In Data Science , Data analysis or machine learning , Most of the work required to achieve practical purposes is to clean up the data , This is self-evident . This book uses David The iconic friendly humorous style , The basic steps performed in each production data science or data analysis pipeline are discussed in detail , And it is ready for data visualization and modeling results .
This book delves into data extraction , Anomaly detection , Practical application of tools and technologies required for value estimation and functional engineering . Long exercises are provided at the end of each chapter , To practice the skills acquired .
You will first view such as JSON,CSV,SQL RDBMSes,HDF5,NoSQL database , Image format file and data capture of data format such as binary serialized data structure . Besides , This book provides many sample datasets and data files , Available for download and independent exploration .
Continue from format , You will estimate the missing value , Detect unreliable data and statistical anomalies , And generate the comprehensive functions necessary for successful data analysis and visualization .
By the end of the book , You will have an in-depth understanding of the data cleansing process required to perform actual data science and machine learning tasks .
What will you learn
How to carefully consider your data and ask the right questions
Identify problem data related to a single data point
Based on the data of the system “ shape ” Detect problem data
Remediate data integrity and health issues
Prepare data for analysis and machine learning tasks
Interpolate values into missing or unreliable data
Generation is more suitable for data science , A comprehensive function of data analysis or visualization objectives .
Who is this book for
This book is designed to make software developers interested in data analysis or scientific computing , Data scientist , Aspiring data scientists and students benefit .
Familiar with statistical knowledge , General concepts of machine learning , programing language (Python or R) Knowledge of data science and some understanding of data science will be very helpful . glossary , Reference materials and friendly help should help all readers quickly master .
This text will also be helpful for intermediate and advanced data scientists who want to improve their data hygiene rigor and review data preparation issues .
This is today's sharing , I wonder if it will help you , If you think it's good, please give me a compliment , It would be better if you could pay attention to me . If you want to get the of this book pdf You can click on the book's hyperlink , You are also welcome to leave messages and private letters in the comment area , I'll keep updating . I wish everyone can grow rapidly , Get rid of 996~ Refueling workers !!
边栏推荐
- 3.集成eslint、prettier
- Reading notes of "Introduction to deep learning: pytoch"
- utils 协程
- Alibaba billion concurrent projects in architecture
- Idea setting automatic package Guide
- Cronexpression expression explanation and cases
- Review the old and know the new
- Row column (vertical and horizontal table) conversion of SQL
- Pipe pipe --namedpipe and anonymouspipe
- Dart 开发技巧
猜你喜欢

JVM garbage collector G1 & ZGC details

Comparison problems encountered in recent study

Express file upload

Notes on masking and padding in tensorflow keras

桂林 穩健醫療收購桂林乳膠100%股權 填補乳膠產品線空白

【Ubuntu-redis安装】

Flutter 中的 ValueNotifier 和 ValueListenableBuilder

MySQL-- Entity Framework Code First(EF Code First)

Tutorial for beginners of small programs day01

Abstract classes and interfaces
随机推荐
Pytorch graduate warm LR installation
utils session&rpc
ReturnJson,让返回数据多一些自定义数据或类名
MCU firmware packaging Script Software
Applet learning path 2 - event binding
Net framework system requirements
MySQL knowledge summary (useful for thieves)
Deep Learning with Pytorch- A 60 Minute Blitz
Redis docker 主从模式与哨兵sentinel
Cb/s Architecture - Implementation Based on cef3+mfc
八大排序(一)
机器学习笔记 九:预测模型优化(防止欠拟合和过拟合问题发生)
9.JNI_ Necessary optimization design
MySQL internal component structure
1, 基本配置
Common query and aggregation of ES
Tclistener server and tcpclient client use -- socket listening server and socketclient use
Cronexpression expression explanation and cases
Simple redis lock
【Ubuntu-redis安装】