Book Rating Prediction

As a part of the course Text Mining [TDDE16], I'm attempting to predict the rating a user has given a book given its review, using distilBERT-base-cased.

To run the code, first clone the project onto your own PC. Then relocate to the root of the project and run pip install -e . to install the PredictRating package. Once the package has been installed, pytorch has to be installed separately in order to fit your system. See this for an install command that fits your specification.

All scripts run from root. Download the goodreads datasets from here (the one called goodreads_reviews_spoiler_raw.json.gz), place it in book-rating-prediction/data and name it reviews.json. For the Amazon set, download it from here, place it in book-rating-prediction/data and name it reviews_amzn.csv.

A model is trained with train.py and evaluated with bertstats.py. baseline.py and humanlevel.py creates baselines for the BERT classifier. constants.py contains all hyperparameter values used during training. There are some tests written for the package in book-rating-prediction/tests.

The BERT model from the report can be downloaded from here. Download it and place it in book-rating-prediction/models if you want to use it.

Final report awarded with highest grade (5) available @ here.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
models		models
src/PredictRating		src/PredictRating
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Book Rating Prediction

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

davidvavinggren/book-rating-prediction

Folders and files

Latest commit

History

Repository files navigation

Book Rating Prediction

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages