The aim of this project is to use the given data and perform ETL and data analysis to infer key metrics and patterns in the dataset. In addition to this, different visualizations are developed to depict meaningful relationships.
Problem Statement
â
âŸ
Health is real wealth in the pandemic time we all realized the brute effects of covid-19 on all irrespective of any status. You are required to analyze this health and medical data for better future preparation.
âŸ
As it is rightly said, âHealth is Wealthâ. We have realized this fact in the pandemic time after witnessing the brute effects of Covid-19 on people of all age groups. Apart from this, another major contributor to the death rate is heart-related diseases
âŸ
Heart diseases have been known to take a major toll on peopleâs lives. As a layman, we may feel that the common factors for heart-related diseases are cardiac arrest or blockages. But the dataset under analysis describes multiple different medical parameters associated with the heart and their typical values. We will be analyzing the relationships between them and studying the implications of changes in those parameters. In this project, we will be incorporating the most trending and powerful BI tool namely Tableau.
Tools
đ
1.Jupyter Notebook
2.Pandas
3.NumPy
4.Matplotlib
5.MS Excel
6.Tableau
Approach For Data Analysis
âŸ
Data Extraction
âŸ
Data Preprocessing
âŸ
Data Exporting
âŸ
Dataset Loading and Modification
âŸ
Data Analysis
âŸ
Deployment
KEY PERFORMANCE INDICATOR (KPI)
Key indicators displaying a summary of the heart disease and its relationship with different metrics
Percentage of People Having Heart Disease
Variation of âthalâ (Thalassemia type) with âsexâ
Variation of âcholâ (Cholesterol), âtrestbpsâ (Resting blood pressure) with âfbsâ (Fasting Blood Sugar).
Variation of âexangâ (Exercise induced angina) with âcpâ (Chest Pain type).
Variation of ânumâ (Angiographic disease status) with âsexâ.
Variation of the âageâ with âcholâ (Cholesterol) and âsexâ
Variation of âcpâ (Chest Pain type) with âsexâ
Variation of âthalachâ (Maximum heart rate) with âageâ
Variation of ârestecgâ (Resting electrocardiograph results) with âsexâ
Variation of âslopeâ (Slope of the peak exercise ST segment), ârestecgâ (Resting Electrocardiograph results) and âoldpeakâ (ST depression induced by exercise relative to rest)
Conclusion
đĄ
âŸ
45.87% of People suffer from heart disease.
âŸ
Elderly Aged Men are more (50 to 60 Years) and Females are more in 55 to 65 Years Category
âŸ
Males are more prone to heart disease.
âŸ
Elderly Aged People are more prone to heart disease.
âŸ
People having asymptomatic chest pain have a higher chance of heart disease.
âŸ
High cholesterol levels in people having heart disease.
âŸ
Blood Pressure increases between the age of 50 to 60 and somehow continues till 70.
âŸ
Cholesterol and maximum heart rate Increased in the age group of 50-60.
âŸ
ST depression mostly increases between the age group of 30-40