-
Notifications
You must be signed in to change notification settings - Fork 22
Load real UK Biobank data
Milton Pividori edited this page Jul 31, 2018
·
30 revisions
use only the ukbREST Docker image here.
use only disjunctive datasets (unique set of data fields). TRY and see what happens when we do not follow this.
mention ukbconv commands to get csv and html files.
when showing how to run the server, show examples using HTTP Basic auth and SSL using the Docker image of ukbREST.
Once the loading process finishes, you can get all the data field codings by connecting to the PostgreSQL database and exporting a list of codings:
\copy (select distinct coding from fields where coding is not null)
to /mnt/all_codings.txt (format csv)The file /mnt/all_codings.txt is just a list of coding numbers, one per line, that you can use
to download all coding files using the download_codings.sh script:
$ mkdir /tmp/codings && cd /mnt/codings
$ [...]/misc/download_codings.sh all_codings.txt- TODO: Problems with CSV files encodings.
- Describe what happens when we found this error.
- How to specify different encodings for different files.
- the encoding issues could depend on the data fields you have access to in your application.