This project is a classification task aimed at predicting the price category of a mobile phone based on its specifications using various machine learning algorithms.
The dataset is clean, balanced, and contains various features such as battery power, RAM, screen size, and more.
The dataset used is from Kaggle - Mobile Price Classification.
It includes 2000 samples with 20 numerical features, and the target is a price range from 0 (lowest) to 3 (highest).
-
Data Preprocessing
- Feature inspection and EDA
- Checked distributions and correlations
- Feature scaling using
StandardScaler
-
Model Training
- Trained multiple models:
- Logistic Regression
- K-Nearest Neighbors (KNN)
- Support Vector Machine (SVM)
- Decision Tree
- Random Forest
- Trained multiple models:
-
Hyperparameter Tuning
- Used GridSearchCV to tune Logistic Regression (but accuracy dropped)
-
Final Model
- Best performance achieved by Logistic Regression with 97% accuracy
- GridSearch tuning result: 88%, so default configuration was used
- Final model:
LogisticRegression() - Accuracy: 97%
- No further feature engineering was needed
- Project completed using pure Scikit-learn