Open-sourcing Python module for Exploratory Data Analysis, which can be used for any data set
This module has the following sub-functions for the data analysis
- Getting to know the data
- Data pre-processing / missing data
- Crosstable and data validation and visualization
- Logistic Regression on the data set
- KNN analysis
- Please install Anaconda https://docs.anaconda.com/anaconda/navigator/
- Please install Spider IDE https://docs.spyder-ide.org/current/index.html
- Download the eda.py from this project repo and a few sample data sets
- Run the eda.py in Spider IDE, and when prompted, provide the data file
- Graphs will be populated in the Plots area in Spider
- At any point in time, you can exit a particular loop or sub-function by typing 'exit'