A Kaggle notebook solution using Logistic Regression to predict survival on the Titanic. This project covers data exploration, preprocessing, model training, evaluation, and submission generation.
- ๐ Introduction
- ๐ฆ Importing Required Libraries
- ๐ Data Loading and Exploration
- ๐ Train Dataset
- ๐ Test Dataset
- ๐ Data Information
- ๐ Exploratory Data Analysis (EDA)
- ๐งฉ Checking Missing Values
- ๐จ Data Visualization
- ๐งฎ Countplots
- ๐ Histograms
- ๐ฆ Boxplots
- ๐ ๏ธ Feature Engineering
- ๐งช Model Building
- โ๏ธ Splitting the Data
- ๐งฎ Logistic Regression Model
- ๐ Model Evaluation
- โ Accuracy Score
- ๐ Confusion Matrix
- ๐งพ Classification Report
- ๐ง ROC Curve and AUC Score
- ๐งพ Prediction on Test Data
- ๐ค Submission File Creation
- ๐ Conclusion
Feel free to explore each section above to understand the approach in detail!