Skip to content
Ann Kim Novakowski edited this page Feb 8, 2024 · 7 revisions

The Data Use Ontology (DUO) provides a helpful framework for gating access to data managed by Sage Bionetworks on the Synapse platform.

DUO was developed by members of the Global Alliance for Genomic Health (GA4GH): "DUO allows [users] to semantically tag datasets with restriction about their usage, making them discoverable automatically based on the authorization level of users, or intended usage".

At Sage, we extended DUO modifiers for our use cases (see figure below) and incorporated derived annotations as a way of scaling governance support on projects by assigning access requirements (ARs)* to entities based on its DUO annotation.

*ARs are applied in the form of a clickwrap (i.e., user must agree to terms) and/or a managed access requirement (i.e., user must provide evidence). Managed ARs may require evidence in the form of Authentication (e.g., training certification, profile validation, two-factor authorization) and/or Authorization (e.g., intended data use (IDU) statement, data use certificate (DUC), ethics approval letter from an institutional review board (IRB) or independent ethics committee (IEC)).

The metadata required for each dataset will vary based on the governance framework established on a project-by-project basis.

OBO Shorthand Modifier Type Description Evidence
DUO_0000043 CC Clinical Care Use Boolean
DUO_0000020 COL Collaboration Required String PI Contact information required
DUO_0000007 DS Disease Specific Research String Provide detail [DOID] User must describe disease-specific research use in IDU statement
DUO_0000021 IRB Ethics Approval Required Boolean User prompted to provide IRB/IEC approval
DUO_0000042 GRU General Research Use Boolean
DUO_0000016 GSO Genetic Studies Only Boolean
DUO_0000022 GS Geographical Restriction String Provide country restriction [ISO 3166⍺2]
DUO_0000006 HMB Health or Medical or Biomedical Research Boolean
DUO_0000028 IS Institution Specific Restriction String Name Institution
DUO_0000015 NMDS No General Methods Research Boolean
DUO_0000004 NRES No Restriction Boolean
DUO_0000018 NPUNCU Not-for-Profit, Non-Commercial Use Only Boolean
DUO_0000046 NCU Non-Commercial Use Only Boolean
DUO_0000045 NPU Not-for-Profit Organisation Use Only Boolean
DUO_0000011 POA Population Origins or Ancestry Research Only Boolean
DUO_0000044 NPOA Population Origins or Ancestry Research Prohibited Boolean
DUO_0000027 PS Project Specific Restriction Boolean User prompted to provide IDU statement
DUO_0000024 MOR Publication Moratorium String Provide date [ISO 8601]
DUO_0000019 PUB Publication Required Boolean
DUO_0000012 RS Research Specific Restrictions String Provide detail User must describe research use in IDU statement
DUO_0000029 RTN Return to Database or Resource Boolean
DUO_0000025 TS Time Limit on Use String Provide date [ISO 8601] User prompted to renew access every x days with current evidence
DUO_0000026 US User Specific Restriction String Provide detail User may be required to join a Synapse Team
DUOplus1 Source Geography List of Strings List data generating country(ies) [ISO 3166⍺2]
DUOplus2 Data Permission List of Strings Data sharing enforced by: Agreement, Attestation, Award, Other [Identifier?]
DUOplus3 Data Tier List of Strings Anonymous, Open (aka Registered), Controlled, Private
DUOplus4 License List of Strings CC BY, CC BY-SA, CC BY-NC, CC BY-NC-SA
DUOplus5 Attribution String Provide attribution/acknowledgement statement

Resources

Publications

Clone this wiki locally