Skip to content

Latest commit

 

History

History
16 lines (14 loc) · 775 Bytes

File metadata and controls

16 lines (14 loc) · 775 Bytes

Automated-EDA

Automated Exploratory Data Analysis tool that can automat the pre-processing steps and providing a comprehensive visualization dashboard for each column type.

Components and Functionalities:

  • autoEDA.py: The main entry point of the tool.
  • visualization.py: Utility functions for generating visualizations using Matplotlib and Seaborn.
  • test.ipynb: Example of using the tool
  • International_Report_Departures.csv : test dataset.

Main Features:

  • Load data from CSV, Excel, and SQL databases.
  • Automated preprocessing based on column types.
  • Handle missing values and scale numerical features.
  • Data visualization using Matplotlib and Seaborn
  • Visualization dashboard with histograms, box plots, scatter plots, etc.
  • Feature selection options.