Skip to content

ProvEntities for BIDS files and BIDS datasets #174

@bclenet

Description

@bclenet

Current status

The current version of the BEP028 PR for specification indicates :

Key name Requirement Level Data type Description
Used OPTIONAL string or array of strings Identifier(s) of the provEntity (one or more) or environment (one or more) used by this activity.
Related provEntities MUST be described as specified in the ProvEntities section.
Related environments MUST be described as specified in the Environments section.

And in the ProvEntities section:

Each file with a ent suffix is a JSON file describing provEntities.
These files SHOULD not contain provEntities describing data files that are available in the dataset. Use sidecar JSON files instead for this purpose (see Provenance of a BIDS file).
These files SHOULD not contain provEntities describing the current dataset. Use dataset_description.json files instead for this purpose (see Provenance of a BIDS dataset).

Issue

For simplification reasons we would like that it is not required to write a whole provEntity JSON object in a provenance file for BIDS files or BIDS datasets listed in the Used metadata field. (In other cases, we still want the data to be described by a provEntity because we cannot derive enough information from an identifier that is not a BIDS URI)

For now we cover the following cases:

  • a BIDS file that is part of the dataset in which the provenance is described:
    • we use a BIDS URI to link to the file ;
    • we can even use the sidecar JSON of the file as a container for other metadata that would normally be stored in a provEntity.
  • the BIDS dataset in which the provenance is described:
    • we use a BIDS URI to link to the dataset (bids::.) ;
    • we can even use the dataset_description.json as a container for other metadata that would normally be stored in a provEntity.

However, we do not cover the following cases:

  • a BIDS file that is part of another dataset ;
  • another BIDS dataset.

Partial solution

We do not require describing a provEntity for any BIDS file or any BIDS dataset, e.g.:

These files SHOULD not contain provEntities describing available BIDS files (see Provenance of a BIDS file for this purpose).
These files SHOULD not contain provEntities describing available BIDS datasets. (see Provenance of a BIDS dataset for this purpose).

This option works well unless users want to describe other metadata fields (e.g. Digest, Type, or other metadata) that would normally be inside a provEntity JSON object and that cannot be derived from a BIDS URI... In this case users have to go back to describing provEntities inside provenance files.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions