-
Notifications
You must be signed in to change notification settings - Fork 6
Open
Description
Necessity
I think we should divide information we receive from the community into two:
- User-dependent information: Information about how researchers use the data
- Scientific field
- Physical variables derived
- Software used
- Region/object of study
- etc
- User-independent information: All data related to tech specs of the data set/sensor.
- Sensor name and type
- Satellite active period
- Temporal and regional coverage
- Temporal and regional resolution
- Data access platform
- Data accessibility (open access/commercial)
- etc.
Why is it important?
As we collect info about tech specs of the sensor and validate them, they are not subject to change, unless somebody spots a typo or a mistake. On the other hand, scientific application is user- or researcher-specific and one data set can have various applications.
Therefore, we can work with 2 types of files:
- [data_set_name]_techspecs - for one sensor we would have only one file (like we have currently)
- [data_set_name]_application_001, [data_set_name]_application_002 - for one sensor we would have multiple files for applications.
This separation would also help us:
- aggregate the data on applications, such as CryoSAT: used in Glaciology (24 researchers), used to derive Ice Velocity (20 users), used in the study of the Arctic (62 users) and Antarctic (50 users).
- collect information from the community more efficiently using two templates, one about tech specs and one for scientific applications.
In the end, the code would compile the third type of file for each dataset/sensor with all information - one file for each sensor, probably called [data_set_name]_index. All these index files can then be sent to Google Sheet.
What do you guys think?
Metadata
Metadata
Assignees
Labels
No labels