A machine learning project for predicting the success of Himalayan expeditions using historical data.
This project analyzes Himalayan expedition data to predict expedition success using various machine learning models including Logistic Regression, KNN, Decision Trees, Neural Networks, Balanced Random Forest, and Easy Ensemble AdaBoost.
├── dataset/ # Raw CSV datasets
│ ├── exped.csv
│ ├── members.csv
│ └── peaks.csv
├── Models/
│ ├── data/ # Processed data
│ ├── frontend/ # Streamlit web application
│ ├── models/ # Trained model files (.h5, .pkl)
│ ├── notebooks/ # Jupyter notebooks for each model
│ ├── utils/ # Utility functions
│ ├── config.py # Configuration settings
│ ├── prepare_data.py # Data preprocessing script
│ └── requirements.txt # Python dependencies
- Multiple ML models for expedition success prediction
- Interactive web application built with Streamlit
- Data preprocessing and feature engineering
- Model comparison and evaluation
- Real-time prediction with confidence scores
- Visual model performance comparison
- Clone the repository
- Install dependencies:
pip install -r Models/requirements.txtcd Models
python prepare_data.pyOpen and run the Jupyter notebooks in Models/notebooks/ to train models:
- Train each model (Logistic Regression, KNN, etc.)
- Models are automatically saved to
Models/models/
Locally:
streamlit run Models/frontend/app.pyOn Streamlit Cloud:
- Push code to GitHub
- Deploy at streamlit.io/cloud
- Main file:
Models/frontend/app.py
See DEPLOYMENT.md for detailed deployment instructions.
The main dashboard displays available models and their current status.
View and compare the accuracy of all trained models.
Enter expedition parameters (age, team size, season, peak, etc.) to predict success probability.
Get instant predictions showing whether the expedition is likely to succeed or fail.
See confidence scores from all models for comprehensive analysis.
- Logistic Regression
- K-Nearest Neighbors (KNN)
- Decision Tree
- Neural Network
- Balanced Random Forest
- Easy Ensemble AdaBoost
- Python 3.x
- Machine Learning: scikit-learn, TensorFlow, imbalanced-learn
- Data Analysis: pandas, numpy
- Visualization: matplotlib, seaborn, plotly
- Web App: Streamlit
The project uses Himalayan expedition data including:
- Expedition details
- Member information
- Peak characteristics
This project is for educational purposes.




