Skip to content

Latest commit

 

History

History
30 lines (19 loc) · 708 Bytes

File metadata and controls

30 lines (19 loc) · 708 Bytes

#Data Cleaning Repository!

Each folder contains:

  • Raw Data: Stored in .xlsx file;
  • Data Cleaning: Utilizes Python scripts to clean the data.
  • Cleaned Data: Outputs are converted to .csv file.

Repository Structure

  • Raw Data: Contains the original, unprocessed Excel files (.xlsx).

    • Example: sales_data_2023.xlsx
  • scripts/: Python scripts used for data cleaning.

    • Example: dataclean1.py
  • clean_data/: Cleaned data saved as CSV files after processing.

    • Example: data_cleaned1.csv

Dependencies

  • Python 3.x
  • pandas
  • openpyxl (for reading Excel files)

Please install these dependencies using pip:

pip install pandas openpyxl