Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

Last update: Oct 10, 2022

Overview

2019-indian-election-eda

Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

This project is a part of the Course - Data Analysis using Python: Zero to Pandas offered by Jovian.ai.

We perform Exploratory Data Analyis on the 2019 Indian General Elections dataset. Here we use various Python libraries to perform Data Cleaning and Visualization. The Dataset which is used in this project is from Kaggle, authored by the user Prakrut Chauhan.

Link to the Dataset used - https://www.kaggle.com/prakrutchauhan/indian-candidates-for-general-election-2019

The dataset contains information of all the candidates who contested the elections from various Constituencies. Data includes personal information like Assets, Education, Criminal Record, etc. as well as electoral information such as Contesting Constituency, Political Party, Total Votes received, etc.

The Libraries used in the Project are:

Matplotlib (for visualization of data),
Seaborn (used alongside Matplotlib for visualization),
Numpy (used for operations on numeric data),
Pandas (used for utilising DataFrames and organising the data),
Jovian (used for downloading dataset and to run, save and upload the Notebook).

Apart from the above mentioned libraries, we use the opendatasets package to directly download the files from Kaggle and parse the data. Link to the package - https://github.com/JovianML/opendatasets

To view the Jupyter Notebook containing the EDA, click on the .ipynb file to open it. Scroll down to see the analysis. Some contents might not be visible in Dark Theme, so I recommend viewing the notebook in Light Theme.

The Notebook can also be viewed in Google Colab and Binder or can be downloaded and viewed locally.

Link to a Blog Post will be added soon.

Hope you like my work !!!

Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

Related tags

Overview

2019-indian-election-eda

Owner

Souradeep Banerjee

Single-Cell Analysis in Python. Scales to >1M cells.

[CVPR2022] This repository contains code for the paper "Nested Collaborative Learning for Long-Tailed Visual Recognition", published at CVPR 2022

Maximum Covariance Analysis in Python

My solution to the book A Collection of Data Science Take-Home Challenges

Retail-Sim is python package to easily create synthetic dataset of retaile store.

💬 Python scripts to parse Messenger, Hangouts, WhatsApp and Telegram chat logs into DataFrames.

Dbt-core - dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

Data imputations library to preprocess datasets with missing data

PipeChain is a utility library for creating functional pipelines.

The Master's in Data Science Program run by the Faculty of Mathematics and Information Science

Statistical & Probabilistic Analysis of Store Sales, University Survey, & Manufacturing data

Exploratory Data Analysis of the 2019 Indian General Elections using a dataset from Kaggle.

Binance Kline Data With Python

Methylation/modified base calling separated from basecalling.

Extract data from a wide range of Internet sources into a pandas DataFrame.

CSV database for chihuahua (HUAHUA) blockchain transactions

Pyspark project that able to do joins on the spark data frames.

PySpark bindings for H3, a hierarchical hexagonal geospatial indexing system

Numerical Analysis toolkit centred around PDEs, for demonstration and understanding purposes not production