Search This Blog

Tuesday, June 29, 2021

Exploratory data analysis using Python

Open-sourcing Python module for Exploratory Data Analysis, which can be used for any data set

This module has the following sub-functions for the data analysis

  1. Getting to know the data
  2. Data pre-processing / missing data
  3. Crosstable and data validation and visualization
  4. Logistic Regression on the data set
  5. KNN analysis
How to use it?

  1. Please install Anaconda https://docs.anaconda.com/anaconda/navigator/
  2. Please install Spider IDE https://docs.spyder-ide.org/current/index.html
  3. Download the eda.py from this project repo and a few sample data sets 
  4. Run the eda.py in Spider IDE, and when prompted, provide the data file
  5. Graphs will be populated in the Plots area in Spider
  6. At any point in time, you can exit a particular loop or sub-function by typing 'exit'

1 comment:

Harsh Vardhan said...

Great Content. Thanks for sharing this valuable information.
Hybrid Cloud Services
Hybrid Cloud Hosting