This project scrapes Google Play Store reviews for the BRImo mobile application and builds sentiment-analysis models to categorize user feedback. Two Jupyter notebooks cover the end-to-end workflow: large-scale data collection and multiple machine-learning experiments on Indonesian-language reviews.
- Ensure Python 3.10+ is available.
- Create and activate a virtual environment (recommended).
- Install dependencies:
pip install -r requirements.txt
-
Scrape fresh reviews
OpenScrapping_Review_BRImo_Playstore.ipynbin Jupyter, update the desiredapp_idor output filename if needed, and run the notebook to download reviews into a CSV file (default:ulasan_aplikasi_brimo.csv). -
Run sentiment analysis
LaunchSentiment_Analysis_Aplikasi_BRImo.ipynbto:- Load the dataset (either the provided CSV or the one generated above).
- Execute preprocessing that normalizes text, maps Indonesian slang, and removes noise.
- Label reviews using a lexicon-based approach.
- Explore data through plots and word clouds.
- Train and evaluate several classifiers (logistic regression, SVM, TF-IDF neural net, and Bag-of-Words neural net).
- Perform inference on custom text inputs with the best-performing model.
.
├── Scrapping_Review_BRImo_Playstore.ipynb # Google Play review scraper
├── Sentiment_Analysis_Aplikasi_BRImo.ipynb # Preprocessing, modeling, and evaluation
├── requirements.txt # Python dependencies
└── ulasan_aplikasi_brimo.csv # Sample dataset of BRImo reviews
No explicit license is provided. Add an appropriate LICENSE file before redistributing or using this code in production.
- Fork the repository and create a feature branch.
- Make changes with clear commits and, when possible, add tests or notebook annotations explaining new behavior.
- Ensure notebooks run top-to-bottom without errors.
- Open a pull request describing the motivation, approach, and validation steps. Feedback and review improvements are welcome.