Skip to content

This was a project that was part of my Data Science with AI Master's degree. As part of this project I had to follow the CRISP-DM methodology and use the ProjectTemplate library to carry out my EDA on a never seen before collection of datasets. The process included working to set the business objectives, understand the provided data, preparing i…

Notifications You must be signed in to change notification settings

kasiotis/Data-Management-and-Exploratory-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

49 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Instruction Manual

To load this project, you'll first need to setwd() into the directory where this README file is located. Then you need to run the following two lines of R code:

library('ProjectTemplate')
load.project()

After you enter the second line of code, you'll see a series of automated messages as ProjectTemplate goes about doing its work. This work involves:

  • Reading in the global configuration file contained in config.
  • Loading any R packages you listed in the configuration file.
  • Reading in any datasets stored in data or cache.
  • Preprocessing your data using the files in the munge directory.

Once that's done, you can execute any of the analysis filese in the src directory. Before running any of the analysis files please make sure to call the function called load.project().

Additionally, in order for the pre-processing to work you may need to install some packages that were used. To do that, you can go into these files in the munge folder from were you can see all the libraries that get imported. To install these libraries please use the install.packages('LibraryName') function along with the name of the libraries you do not currently have already installed on your device.

To run the analysis you would simply go to the src folder and call each of the functions I created for the analysis. They all have comments so you can easily find what you are looking for.

To view the report detailing the process and findings of this project, please go into the reports folder and open the PDF document there.

About

This was a project that was part of my Data Science with AI Master's degree. As part of this project I had to follow the CRISP-DM methodology and use the ProjectTemplate library to carry out my EDA on a never seen before collection of datasets. The process included working to set the business objectives, understand the provided data, preparing i…

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages