This repository is a personal project where I develop football models and visualisations.
It's build on top of different technologies such as:
- Python: for data processing and machine learning (Tensorflow, Scikit-Learn and MLFlow).
- R: all data-visualisation stuff (dplyr, ggplot2, magick).
- Postgres SQL: storing and querying data easily.
- Docker: to orchestrate all these elements together and easy install/startup.
- Google Cloud Plateform: to run heavy jobs in cloud.
The project contains five folders:
./app: maybe deprecated design, mostly for data integration and easy of use../data: where raw data are stored (in addition to the database). Not in git 😉../model: machine learning models (expected goal for example)../src: source file for crawlers, database connection/ingestion, SQL queries, etc..../visualisation: source code for data-visualization, most recent works (on maps) are in./visualisation/mapssubfolders.
- Start corresponding container:
docker-compose up -d model. - For ease of use, going into the container:
docker exec -it model bash - All data are mapped to the local environment:
cd /data
- Expected goal model:
python -m model.expected_goal.main --help. - Expected assist model:
python -m model.expected_assist.main --help. - Possession2Vec model:
python -m model.pass2vec.main --help
- Start corresponding container:
docker-compose up -d passmap. - Go to
http://localhost:8082/.
- Improve models
- Add more documentations.
- Clean some viz stuff.
- Better Docker management.
Any questions/improves on Twitter @Ben8t.








