Machine-Learning-and-statistical-Analysis-Of-Titanic

The mean goal of this repository is to predict if a passenger survived the sinking of the Titanic or not, based on multiple features such as sex, age ,...etc.

Steps :

STEP 1: Exploratory Data Analysis
STEP 2: Feature Engineering
STEP 3: Pre-Modeling Tasks
STEP 4: Modeling
STEP 5: Evaluating the performance of the model
STEP 6: Predictions and submission

Requirements:

DataSet :

Kaggle (https://www.kaggle.com/c/titanic/data).

Step 1 : Exploratory Data Analysis

In this phase we will extract the dataset and explore it, and we will do some descriptive statistics, and visualize our data.

- Data Extraction

- Viewing the data

- Descriptive statistics

Correlation Map

- Data Visualization

- Distribution of Age

- Sex feature vs Survived feature

- Embarked vs Survived

Step 2: Feature Engineering

Feature Engineering is a process of transforming the data into data which is easier to interept and also, to increase the predictive power of learning algorithm.
In this part we will create a new features that could improve predictions such as if the passenger is alone or not, and combining existing features to produce a more useful one, and dropping the columns doesn't improve predictions.

Step 3 : Pre-Modeling Tasks

Separating the independant and the dependant variable.
Splitting the training data.

Step 4 : Modeling the Random Forest Model.

In this part we'll try to build a Random Forest Model and then tunning the hyperparameters using the GridSearcCV.

Step 5 : Evaluating the performance of Random Forest using the performance metrics.

Evaluating the machine learning model is a crucial part in any data science project. There are many metrics that helps us to evaluate our model accuracy.
Classification Accuracy
Classification Report
Precision Score
Recall Score
Confusion matrix

AUC & ROC Curve

Step 6 : Predictions and submit the results

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
README.md		README.md
STEP 2 : Feature Engineering.ipynb		STEP 2 : Feature Engineering.ipynb
STEP 3 : Pre-Modeling Tasks.ipynb		STEP 3 : Pre-Modeling Tasks.ipynb
STEP 4 :Modeling the Random Forest Model.ipynb		STEP 4 :Modeling the Random Forest Model.ipynb
STEP 5 : Evaluating the performance of the model.ipynb		STEP 5 : Evaluating the performance of the model.ipynb
STEP 6 Predictions and submission.ipynb		STEP 6 Predictions and submission.ipynb
Step 1 Exploratory Data Analysis.ipynb		Step 1 Exploratory Data Analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Machine-Learning-and-statistical-Analysis-Of-Titanic

Steps :

Requirements:

DataSet :

Step 1 : Exploratory Data Analysis

- Data Extraction

- Viewing the data

- Descriptive statistics

Correlation Map

- Data Visualization

- Distribution of Age

- Sex feature vs Survived feature

- Embarked vs Survived

Step 2: Feature Engineering

Step 3 : Pre-Modeling Tasks

Step 4 : Modeling the Random Forest Model.

Step 5 : Evaluating the performance of Random Forest using the performance metrics.

Step 6 : Predictions and submit the results

Useful Resources :

About

Uh oh!

Releases

Packages

Languages

LailaMahmoudi/Machine-Learning-and-statistical-Analysis-Of-Titanic

Folders and files

Latest commit

History

Repository files navigation

Machine-Learning-and-statistical-Analysis-Of-Titanic

Steps :

Requirements:

DataSet :

Step 1 : Exploratory Data Analysis

- Data Extraction

- Viewing the data

- Descriptive statistics

Correlation Map

- Data Visualization

- Distribution of Age

- Sex feature vs Survived feature

- Embarked vs Survived

Step 2: Feature Engineering

Step 3 : Pre-Modeling Tasks

Step 4 : Modeling the Random Forest Model.

Step 5 : Evaluating the performance of Random Forest using the performance metrics.

Step 6 : Predictions and submit the results

Useful Resources :

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages