v0.5.0
This release adds support for automatically fetching from the GitHub remote cache, so that we download files on demand when we need it for research. Additionally, we significantly improve the pipeline for generating data.
What's Changed
- feat(pipeline): introduce RemoteCache by @bassosimone in #85
- fix(bqpq.py): make BigQuery init lazy by @bassosimone in #86
- feat(library): add ghremote package by @bassosimone in #88
- refactor(pipeline)!: move remote cache to cache.py by @bassosimone in #89
- refactor(pipeline)!: pass remote_cache to ctors by @bassosimone in #90
- feat(pipeline)!: introduce file-based locking by @bassosimone in #91
- fix(pipeline): ensure eventual consistency on Unix by @bassosimone in #92
- fix(pqread.py): allow subdivision1 filtering by @bassosimone in #93
- fix(pipeline/dataset.py): define all granularities by @bassosimone in #94
- feat(pipeline): improve operator experience by @bassosimone in #95
- feat(ghremote): improve DX using tqdm by @bassosimone in #96
- feat(cache): pass cache_manager to IQBCache by @bassosimone in #97
- feat: export IQBDatasetMLabTable by @bassosimone in #98
- feat(cache/mlab.py): filter by
subdivision1by @bassosimone in #99 - feat(cache)!: make read_cache_entry lazy by @bassosimone in #100
- feat(iqb): export remote-github-cache names by @bassosimone in #101
- fix: prefer os.replace to shutil.move by @bassosimone in #102
- fix(pipeline): make BQ idempotent by default by @bassosimone in #103
- fix: automatically set version number by @bassosimone in #104
- fix!: enforce keyword arguments by @bassosimone in #105
- fix(ghremote)!: more cohesive construction by @bassosimone in #106
- fix(pipeline): prevent start_date == end_date by @bassosimone in #107
- refactor: replace tqdm with rich by @bassosimone in #108
- feat: introduce the scripting package by @bassosimone in #109
- feat(scripting): add pipeline convenience code by @bassosimone in #110
- feat(pipeline): improve date-parsing errors by @bassosimone in #111
- feat(scripting): add exception interceptor by @bassosimone in #112
- refactor(./data/run_query)!: use iqb.scripting by @bassosimone in #113
- ghcache.py!: remove sync and update manifest path by @bassosimone in #114
- chore(ghremote): publish additional country-level data by @bassosimone in #115
- refactor(generate_data.py): use
iqb.scriptingby @bassosimone in #116 - chore(analysis): start using remote cache by @bassosimone in #117
- chore(data): publish 2025-06 data by @bassosimone in #118
- chore(data): publish data committed into repo by @bassosimone in #119
- cleanup(data): remove cached entries by @bassosimone in #120
- doc(data): update README.md by @bassosimone in #121
- feat(pipeline)!: explicit BigQuery enable kwarg by @bassosimone in #122
- refactor(generate_data.py): add flags and matrix by @bassosimone in #123
- fix(iqb_exception): don't suppress KeyboardInterrupt by @bassosimone in #124
- refactor: make data a pyproject by @bassosimone in #125
- fix(data)!: use integer for YAML file version by @bassosimone in #87
- cleanup(data)!: remove the run_query.py script by @bassosimone in #126
- fix(ghcache.py): emit files to upload in data/tmp by @bassosimone in #127
Full Changelog: v0.4.0...v0.5.0