This is a Text Data Analysis Project Involving (YouTube Case Study).

Overview

Text_Data_Analysis

This is a Text Data Analysis Project Involving (YouTube Case Study).

Problem Statement => Sentiment Analysis.

Package1:

There are many Sentiment Packages such as Vader, Pacy. In this project i am using TextBlob which is a NLP library.

When considering Texblob Sentiment Analysis, there are two keys involved which are Polarity and Subjectivity.

-- Polarity: Which ranges from [-1 to +1] for negative and positive sentiments.

-- Subjectivity: When there are no Sentiments in a sentence.

Error_Handling

Use try,except to handle error in your code.

Package2:

The second package I'll be using to perform visualization on the sentiment Analysis is Wordcloud.

-- wordcloud:analysis give regard to the keyword with the bigger Font, therfore any keyword with this attribute has the higher priority.

-- wordcloud : data must also be stored in String nature before being passed.

Instaling wordcloud.

wordcloud can be tricky when installing . irrespective of command prompt or conda prompt.. open and run as administrator and excute the following line of codes below.

-- for conda. conda install -c https://conda.anaconda.org/conda-forge wordcloud

-- For command prompt.

git clone https://github.com/amueller/word_cloud.git
cd word_cloud
pip install .

stopword.

This are words that donot make any sense in Analysis. such as He, Him, Is, The.

-- wordcloud has parameters that removes this stopwords . ie (stopwords = reset(STOPWORDS).

Negative comment Visuation.

Negative_comment

Positive comment Visuation.

Positive_comment

Problem Statement => Emoji Analysis.

-- !pip install emoji on Jupyter notebook.

-- pip install emoji on conda prompt or Cmd prompt open and run as administrator

Note:

After iterating the emoji_list, you need to compute it into frequncies, which means you need to come up with data in the form of Dict. this can be implemented completely from Scratch, or using count. But in this analysis we shall be using Collections models.

Emoji Visualization.

emoji

Problem Statement => Collecting the Entire data of Youtube.

-- The first way to do this by using the OS, interating with the OS and using a path to access all the files.

-- The second way is Glob, which is consider to be the best way.

Most Common econdings for reading data.

-- latin, UTF-8, iso-8859-1 (essential in reading complex data example is japanese data), cp-1252

Problem Statement => Which Category has the Maximum Likes.

The category file was clean and coverted to dict, then,the Category_name in the dict was map with category_id in the full_df and store in a column created in the full_df.

-- finding the maximum likes in the category can be archived with groupby as well but we used boxplot to visualized.

Visualized most liked category.

most_like_category

Problem Statement => Find out weather Audience are Engaged or not.

The three most important keys to consider when solving this kind of problem are Like_rate, Dislike-rate, Comment_rate.

Like rate:

likes_rate

Analysis weather your Viewes will affect your likes or not.

In this case you can use Scatter plot or Regplot to check and also correlation and visualized it using heatmap

Note.

when using seaborn regplot in jupyter notebook always set ci=None, for it to execute.

Problem Statement => Analyse trending videos

In this project the data frame had only channel_tile and video_id, so we used a groupby function to group the video_id's according to there channel_title. and renamed the video_id's column to total_video according to there channel_title respectfully, the channel with the highest count of video_id had the trending vidoes.

Visualized Trending videos.

trending videos

Problem Statement => Does Punctuation in a title and tags have any relations with views,likes,dislikes and comments?.

We have to extracts all the punctuations marks from title, channel_title or tags, and this can be done completely from scratch or using python build in modules like Regular expression, Strings. in this project we shall use the string module.

--- To check if the punctuatons will affect the views,likes,dislikes or comments. use correlatoin to check the count punctuations from title, channel_title or tags on the views,likes,dislikes or comments.

title punctuation count plot.

title_punc_count

Owner
Data Science || Data Analyst.
Itchio Downloader Tool with python

Itchio Downloader Tool Install pip install git+https://github.com/emersont1/itchio Download All Games in library from account python -m itchio.downloa

Peter Taylor 69 Dec 05, 2022
Heroic-gogdl - GOG Downloading module for Heroic Games Launcher

heroic-gogdl GOG download module for Heroic Games Launcher Purpose This will tak

Paweł Lidwin 36 Dec 23, 2022
YTPY Youtube Downloader Made by: Ferreira, Amarau and Rodric

YTPY Youtube Downloader Made by: Ferreira, Amarau and Rodric How to Install on Linux: sudo apt install python3 python3-pip git pip install pytube git

7 Nov 24, 2022
Python module to donwload all Pixiv artworks of a user using it's user ID.

Python module to donwload all Pixiv artworks of a user using it's user ID. You need a PHPSESSID token to export NSFW.

Quatrecentquatre 1 Jan 27, 2022
Download Youtube videos in mp4 format in a fast, easy, convenient way made with Python!

yt_downloader Download Youtube videos in mp4 format in a fast, easy, convenient way made with Python! Required Modules pytube os time colorama Errors

3 Jul 02, 2022
Scripts to download files and folders programmatically from Google Drive

Google Drive Downloader Scripts Every time I need to download a lot of files from Google Drive (e.g. a dataset), it's always incredibly frustrating an

Ivan Evtimov 6 Jul 22, 2021
Downloads yiffer.xyz comics as images

yiffer-dl Downloads comics as images from yiffer.xyz.

Maxim 2 Mar 20, 2022
The free and open-source Download Manager written in pure Python

The free and open-source Download Manager written in pure Python

pyLoad 2.7k Dec 31, 2022
Stremio addon for fetching videos from your google drive.

stremio-gdrive Instructions: There are two ways to go about: Method 1 is hard and long but might give you better performance and you need to make your

72 Dec 31, 2022
Downloads data from OSM API and uploads it to the mapping sandbox.

OpenStreetMap To Sandbox This is a script to download data from OSM API and upload it to the mapping sandbox. Note that it clears all data in the sand

Ilya Zverev 5 Nov 27, 2022
A simple GUI video downloader built off of the python module 'yt-dlp'

Simple-Youtube-DL-Gui Supported Operating Systems Windows 7 (x64), Windows 8 (x64), and Windows 10 (x64) How to use Main Gui Extract program from arch

12 Dec 30, 2022
This is a tool to allow downloading any links from r/Roms Magethread

ILYFPR This is a tool to allow downloading any links from r/Roms Magethread Limitations: It downloads ALL roms of the system you choose. This will be

Erase 4 Nov 04, 2021
In this repository you will find the test carried out to enter, as a python developer, the company Keeper Solutions.

Bookmarks In this repository you will find the test carried out to enter, as a python developer, the company Keeper Solutions. First it is necessary t

0 Jan 12, 2022
A Celery application to collect data, download media and extract information from social media APIs

Project IBEX A Celery application to collect data, download media and extract information from social media APIs. Requirements You must have a Redis D

ibex 4 Dec 15, 2022
Bulk Downloader for Reddit

saveddit is a bulk media downloader for reddit pip3 install saveddit Setting up authorization Register an application with Reddit Write down your clie

Pranav 136 Jan 03, 2023
Download India Stocks Historical Data

Kite Helper - Download Stock Market Data 🌎 Website Simple Application to Download any stock market data in .csv format using Kite 🏃‍♂️ Running Serve

Pishang Ujeniya 12 Dec 06, 2022
Tool to download Netflix in 4k

Netflix-4K-Script Tool to download Netflix in 4k You will need to get a L1 CDM that is whitelsited with Netflix CDM In this script are downgraded

9 Dec 23, 2021
Quickly, simply, and asynchronously download NFT's from an Opensea collection

iRightClick Quickly, simply, and asynchronously download NFT's from an Opensea collection. NOTICE This tool is not developed to encourage or facilitat

Setro 34 Dec 30, 2022
Make YouTube videos tasks in Todoist faster and time efficient!

Youtubist Basically fork of yt-dlp python module to my needs. You can paste playlist or channel link on the YouTube. It will automatically format to s

Konrad Konieczny 1 Dec 04, 2022
Download your Spotify playlists and songs along with album art and metadata

spotDL Download your Spotify playlists and songs along with album art and metadata The fastest, easiest, and most accurate command-line music download

10.6k Jan 03, 2023