-
Notifications
You must be signed in to change notification settings - Fork 0
submit_data
Markus Kainu edited this page Jul 7, 2015
·
2 revisions
Yearbook process already has all the indicators from
- FAOSTAT
- World Bank WDI
- Source 3
- Source 4
- Source 5
(You can browse the current list of indicators here:)
However, you may want to add your own data and here we define some preferred guidelines for sharing data for this process.
Ideally, we would like to have the data in tidy-format. In principal, a dataset is said to be tidy if it satisfies the following conditions
- observations are in rows
- variables are in columns
- contained in a single dataset.
Tidy data makes it easier for us to carry out data analysis.
An example dataset for yearbook process could look like this:
| country_code | country_name | Year | variable1 | variable2 | variable3 |
|---|---|---|---|---|---|
| 106 | Italy | 1995 | 123 | 345 | 678 |
| 106 | Italy | 1996 | 123 | 345 | 678 |
| 106 | Italy | 1997 | 123 | 345 | 678 |
| 110 | Japan | 1995 | 123 | 345 | 678 |
| 110 | Japan | 1996 | 123 | 345 | 678 |
| 110 | Japan | 1997 | 123 | 345 | 678 |
In addition, we would need a metadata file that describes the raw data. In the case above it would look like this
| variable | short description | unit | source | long description |
|---|---|---|---|---|
| country_code | FAOSTAT country code | |||
| country_name | FAOSTAT country name | |||
| Year | Year in four digit | |||
| variable1 | dietary energy supply | (kcal/pc/day) | ||
| variable2 | Prevalence of undernourishment | percent | ||
| variable3 | Area equipped for irrigation | (1000 ha) |