GitHub - mqcapelle/review-sentiment-classifier

Quickstart

1. Clone and install

git clone https://github.com/<your-username>/review-sentiment-classifier.git
cd review-sentiment-classifier
python3 -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt
playwright install chromium

2. Train

python src/train.py

Monitor live training at http://localhost:6006 by running in a second terminal:

tensorboard --logdir outputs/runs

3. Evaluate

python src/evaluate.py outputs/model_epoch3.pt

4. Predict on custom text

python src/predict.py "The food was incredible, best restaurant I've been to in years!"

5. Scrape & predict on Google Reviews

python src/scraper.py "https://www.google.com/maps/place/..."

Results are saved to data/google_reviews.csv.

Key Design Decisions

Why DistilBERT? 40% smaller and 60% faster than BERT with only a ~3% accuracy drop — the right trade-off for local training on consumer hardware.

Why MAE alongside accuracy? Star ratings are ordinal — being off by 1 star is fundamentally different from being off by 4. MAE captures prediction magnitude, accuracy does not.

Why 5% of the dataset? Transformer fine-tuning exhibits strong diminishing returns on data volume. 5% produces a model within ~5% accuracy of full-data training at a fraction of the compute cost.

Why MPS? Apple's Metal Performance Shaders provide 3–5x speedup over CPU for transformer workloads on M1/M2 MacBooks — no code changes needed beyond device selection.

Requirements

Python 3.9+
PyTorch 2.0+ (with MPS support for Apple Silicon)
See requirements.txt for full dependencies

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quickstart

1. Clone and install

2. Train

3. Evaluate

4. Predict on custom text

5. Scrape & predict on Google Reviews

Key Design Decisions

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Quickstart

1. Clone and install

2. Train

3. Evaluate

4. Predict on custom text

5. Scrape & predict on Google Reviews

Key Design Decisions

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages