-
Notifications
You must be signed in to change notification settings - Fork 2
Combine datasets to one region or province
Charlottevm edited this page Oct 15, 2021
·
11 revisions
- Go to ETLocal branch:
dataset-amalgamator
(there the script is located to combine the regions) - Run in your terminal:
python3 app/services/dataset_combiner.py geo_id=<geo_id> name=<dataset_name> migration_name=<migration_name> dataset_ids=<id1,id2,id3...>
(make sure there are only commas between the ids and no spaces)
For example when you want to update the Groningen-Drenthe region it would look like this:
python3 app/services/dataset_combiner.py geo_id=RGGD01 name=Groningen-Drenthe migration_name=gd_migrate2 dataset_ids=15054,15056
If there is no existing migration you want to update, leave migration_name blank (so: migration_name='').
- Check in the etlocal/db/migrate folder if a new folder is created for this migration
- Check the commits.yml: are all datasets you wanted to combine stated here? (you can also correct the spelling when necessary)
- If everything is fine: create a new branch from the master branch (not the
dataset-amalgamator
branch) - Run in your terminal:
rake db:migrate
- Commit the new files (including schema.rb, excluding app/services/dataset_combiner/*) and create a PR.
Please note that this is an experimental feature not intended for merging with the master branch!