Skip to content

Summarization principle implementation #20

@ericearl

Description

@ericearl

@Lestropie I'm so glad someone could start work on this project. It's been on my backlog a loooooooooooong time.

Anyway, for this issue, I am wondering whether your issue #2 or issue #4 already cover the conceptual idea (as I understand it) of the "summarization principle" proposed on bids-standard/bids-2-devel#65.

I believe an implementation of this might look something like:

  1. Given any BIDS dataset, find all unique (and minimal) combinations of JSON filename entities (the _key-value pairs) for each suffix. Broadly call this a "unique combination" for the sake of this GH issue.
  2. For each unique combination, find all JSON metadata which is the same across all appearances of that unique combination and "summarize it" to the root level of the BIDS hierarchy.
  3. Simultaneously remove these root-level JSON appearing fields from "leaf-level" (subfiles/subdirectories) JSONs, leaving behind only differing metadata.
  4. This way, any differing metadata (metadata that is not common between the root-level JSON and all leaf-level JSONs) would not be found in the root-level summarized JSON.

Tagging @yarikoptic and @effigies to check if I missed something in this conceptual implementation. Give a thumbs up if it's fine, give a comment if I misunderstood the concept. And sorry in advance if I misunderstood the concept.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions