To ensure the data is clean and ready for analysis
- Step 1
The first process involved the changing of the salesdate column into desired data type
- Step 2
Replacing the null values of Property Address either using Case statement or Coalesce Update the table
- Step 3
Breaking Property Address into individual columns address,city,state This was done using Substring,locate,length and substring index
- Step 4
Changing Y and N to YES and NO using Case Statement Using case Statement to equate Y to YES and N to NO
- Step 5
Identifying and deleting null values Using window function to identifying null values i.e the values indicating a number higher than 1 are null
- Step 6
Delete the null values from the table Removing Redutant Columns Using Delete syntax to remove Salesdate,Taxdistrict,PropertyAddress and OwnerAdsress
The Alter and Drop commands were used in defining diffrent columns in the dataset
Insert,Update,delete commands manipulated diffrent columns in the dataset,
Select
- replacing the null values in property Address column
- Breaking out Property Address into individual columns
- Identifying the delimeter in Owner Address column and breaking it into individual columns
- The table was joined with itself while performing data manipulation