Skip to content

Excel-based data cleaning project focused on standardizing and formatting historical data of U.S. Presidents, including name formatting, party corrections, and date normalization.

Notifications You must be signed in to change notification settings

AzimNahin/Excel-Presidential-Data-Cleaning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 

Repository files navigation

🧹 President Information – Data Cleaning (Excel Project)

This repository showcases a data cleaning project performed entirely in Microsoft Excel, using a dataset of U.S. Presidents. The work includes identifying and correcting inconsistencies, formatting data for readability, and preparing the dataset for future data analysis or visualization.

πŸ“Š Dataset Overview

The dataset contains historical information about U.S. Presidents, including:

  • President Name
  • Political Party
  • Vice President
  • Salary
  • Date Created
  • Date Updated
  • (Originally included more columns like β€œprior role” and row indexes)

βœ… Data Cleaning Process

All cleaning was done manually using Excel formulas, formatting tools, and filters. Below are the key changes made:

πŸ”§ Removed Redundancies & Irrelevant Data

  • Deleted the index column (Unnamed: 0) and unnecessary metadata.
  • Removed the β€œprior” column due to inconsistent formatting and encoding errors.

πŸ‘€ Standardized Names

  • Fixed inconsistent casing (e.g., john adams, JAMES MONROE) by converting all names to title case.
  • Trimmed extra spaces within names (e.g., George Clinton β†’ George Clinton).

πŸ›οΈ Party Standardization

  • Standardized inconsistent party names like:
    • Democratic- Republican β†’ Democratic-Republican

πŸ’Ό Cleaned Salary Data

  • Verified numeric consistency and formatting for the salary column.

πŸ“… Date Format Fixes

  • Ensured all date_created and date_updated fields follow the standard ISO format: YYYY-MM-DD.

πŸ“ Repository Contents

  • President Information - Data Cleaning.xlsx: The Excel file containing:
    • US_Presidents Data – original raw dataset
    • US_Presidents Data Fixed – cleaned and formatted version

πŸ› οΈ Tools Used

  • Microsoft Excel (no code required!)
    • Find & Replace
    • Text functions (e.g., PROPER(), TRIM())
    • Filter and Sort
    • Manual inspection and correction

πŸš€ Potential Next Steps

  • Export cleaned dataset to CSV for public data analysis.
  • Visualize trends in U.S. Presidential data (e.g., salaries, party changes).
  • Augment dataset with additional fields like education, birthplace, or term years.

πŸ‘₯ Contributor

About

Excel-based data cleaning project focused on standardizing and formatting historical data of U.S. Presidents, including name formatting, party corrections, and date normalization.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published