✨ Real-life Data Analysis and Model Training Workshop by Global AI Hub.

Overview

πŸŽ“ Data Analysis and Model Training Course by Global AI Hub

Syllabus:

Day 1

  • What is Data?

  • Multimedia

  • Structured and Unstructured Data

  • Data Types

  • Data Visualization

    • What is Visualization?
    • Tufte's 6 Principle
    • Visualization Types
      • Line Plot
      • Scatter Plot
      • Bar Plot
      • Histogram
      • Pie Charts
      • Heatmap
      • Box Plot
      • Kartil Nedir? NasΔ±l HesaplanΔ±r?
      • Joint Plot
      • KDE(Kernel Density Estimate)
  • Statistics

    • Descriptive Statistics Concepts
    • The Concept of Skewness
    • Correlation and Correlation Matrix
    • The Simpsons Paradox
    • Anscombe Quartet
    • Data Distribution and Hypothesis Testing
  • Data Distribution

    • Data and Distribution
    • Gaussian(Normal) Distribution
    • t-Distribution
    • Degrees of Freedom
    • Bernoulli's Distribution
    • Exponential Distribution
  • Application

    • Pandas Revision
    • Introduction to Data Preprocessing with Pandas

Day 2

  • Hypothesis Tests

    • Basic Hypothesis testing
    • P value
    • T test
    • Z test
    • Chi-square (Chi-Square) Test
    • Errors in Hypothesis Testing
  • Data Cleaning

    • The 68-95-99.7 Rule and 3 Sigma
    • Outlier, Missing and Duplicate Data and their Detection
    • Z-Score
    • Handling missing values
    • Null vs NaN
    • Pandas Functions for missing values
    • Dimensionality Reduction
    • PCA (Principal Component Analysis)
    • Collinearity (Multiple Linear Connection
  • Data Transformation

    • Data Conversion Techniques
      • round
      • Scaling
      • Label Encoding
      • One Hot Encoding
      • Stack
      • melt
      • Shorts
      • Feature Engineering
  • Data Augmentation

    • Aggregation Functions
  • Application

    • Data Visualization with Seaborn
    • Data Preprocessing with Pandas

Day 3

  • ML Review

    • What is Machine Learning?
    • Supervised Learning
    • Unsupervised Learning
    • Errors That May Be Encountered in Model Training
    • Tools Used in Data Analysis and Machine Learning
    • End-to-End Machine Learning Project Steps
  • Application

    • Training An End-to-End ML Model with a Real Dataset

Certification

The course completion is certified.

Owner
Global AI Hub
Where AI Talent Meets Opportunity
Global AI Hub
Course materials for: Geospatial Data Science

Course materials for: Geospatial Data Science These course materials cover the lectures for the course held for the first time in spring 2022 at IT Un

Michael Szell 266 Jan 02, 2023
The blazing-fast Discord bot.

Wavy Wavy is an open-source multipurpose Discord bot built with pycord. Wavy is still in development, so use it at your own risk. Tools and services u

Wavy 7 Dec 27, 2022
A module filled with many useful functions and modules in various subjects.

Usefulpy Check out the Usefulpy site Usefulpy site is not always up to date Download and Import download and install with with pip download usefulpyth

Austin Garcia 1 Dec 28, 2021
This is the data scrapped of all the pitches made up potential startup's to established bussiness tycoons of India with all the details of Investments made, equity share, Name of investor etc.

SharkTankInvestor This is the data scrapped of all the pitches made up potential startup's to established bussiness tycoons of India with all the deta

Subradip Poddar 2 Aug 02, 2022
This is a repository for "100 days of code challenge" projects. You can reach all projects from beginner to professional which are written in Python.

100 Days of Code It's a challenge that aims to gain code practice and enhance programming knowledge. Day #1 Create a Band Name Generator It's actually

SelenNB 2 May 12, 2022
Your Project with Great Documentation.

Read Latest Documentation - Browse GitHub Code Repository The only thing worse than documentation never written, is documentation written but never di

Timothy Edmund Crosley 809 Dec 28, 2022
Documentation for the lottie file format

Lottie Documentation This repository contains both human-readable and machine-readable documentation about the Lottie format The documentation is avai

LottieFiles 25 Jan 05, 2023
Build AGNOS, the operating system for your comma three

agnos-builder This is the tool to build AGNOS, our Ubuntu based OS. AGNOS runs on the comma three devkit. NOTE: the edk2_tici and agnos-firmare submod

comma.ai 21 Dec 24, 2022
A plugin to introduce a generic API for Decompiler support in GEF

decomp2gef A plugin to introduce a generic API for Decompiler support in GEF. Like GEF, the plugin is battery-included and requires no external depend

Zion 379 Jan 08, 2023
:blue_book: Automatic documentation from sources, for MkDocs.

mkdocstrings Automatic documentation from sources, for MkDocs. Features - Python handler - Requirements - Installation - Quick usage Features Language

1.1k Jan 04, 2023
Poetry plugin to export the dependencies to various formats

Poetry export plugin This package is a plugin that allows the export of locked packages to various formats. Note: For now, only the requirements.txt f

Poetry 90 Jan 05, 2023
Some code that takes a pipe-separated input and converts that into a table!

tablemaker A program that takes an input: a | b | c # With comments as well. e | f | g h | i |jk And converts it to a table: β”Œβ”€β”€β”€β”¬β”€β”€β”€β”¬β”€β”€β”€β”€β” β”‚ a β”‚ b β”‚

CodingSoda 2 Aug 30, 2022
More detailed upload statistics for Nicotine+

More Upload Statistics A small plugin for Nicotine+ 3.1+ to create more detailed upload statistics. ⚠ No data previous to enabling this plugin will be

Nick 1 Dec 17, 2021
A Power BI/Google Studio Dashboard to analyze previous OTC CatchUps

OTC CatchUp Dashboard A Power BI/Google Studio dashboard analyzing OTC CatchUps. File Contents * β”œβ”€β”€β”€data β”œβ”€β”€β”€old summaries ─── *.md β”œ

11 Oct 30, 2022
Portfolio project for Code Institute Full Stack software development course.

Comic Sales tracker This project is the third milestone project for the Code Institute Diploma in Full Stack Software Development. You can see the fin

1 Jan 10, 2022
LotteryBuyPredictionWebApp - Lottery Purchase Prediction Model

Lottery Purchase Prediction Model Objective and Goal Predict the lottery type th

Wanxuan Zhang 2 Feb 14, 2022
Uses diff command to compare expected output with student's submission output

AUTOGRADER for GRADESCOPE using diff with partial grading Description: Uses diff command to compare expected output with student's submission output U

2 Jan 11, 2022
Course Materials for Math 340

UBC Math 340 Materials This repository aims to be the one repository for which you can find everything you about Math 340. Lecture Notes Lecture Notes

2 Nov 25, 2021
The Python Dict that's better than heroin.

addict addict is a Python module that gives you dictionaries whose values are both gettable and settable using attributes, in addition to standard ite

Mats Julian Olsen 2.3k Dec 22, 2022
Manage your WordPress installation directly from SublimeText SideBar and Command Palette.

WordpressPluginManager Manage your WordPress installation directly from SublimeText SideBar and Command Palette. Installation Dependencies You will ne

Art-i desenvolvimento 1 Dec 14, 2021