Skip to content

pallabee/Demographic-Data-Analysis

Repository files navigation

Demographic-Data-Analysis

Overview

This project demonstrates Exploratory Data Analysis skills and the use of visualizations, statistical analysis for data mining.

Dataset

The raw dataset comprises of information downloaded from the following Wikipedia links and dataset from R package

  1. https://en.wikipedia.org/wiki/List_of_U.S._states_and_territories_by_life_expectancy
  2. https://en.wikipedia.org/wiki/List_of_U.S._states_and_territories_by_educational_attainment
  3. https://en.wikipedia.org/wiki/Household_income_in_the_United_States
  4. https://en.wikipedia.org/wiki/List_of_U.S._states_and_territories_by_area
  5. https://en.wikipedia.org/wiki/Gun_violence_in_the_United_States_by_state
  6. https://stat.ethz.ch/R-manual/R-devel/library/datasets/html/state.html

Tasks

  1. Loading raw as .csv, .xls, .txt files
  2. Cleaning dirty data to account for missing values and duplicate data
  3. Preprocessing cleaned data for exploration
  4. Answer data mining questions with the help of:
    • Visualization of distributions for single and mutiple variables
    • Statistical analysis
    • Analysis of correlations among variables

About

Exploratory Data Analysis using Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published