Skip to content

htsget: using htsget server to mask regions from users (i.e clinical usecase(s)) #838

@brainstorm

Description

@brainstorm

Context: within a clinical WGS setting, there is a need to mask out sensitive information from a particular sample and/or just provide the regions that a user has asked for. At present, htsget does not have client (i.e IGV.js/ igv-desktop / tgv) hints that support permission-based data retrieval and appropriate semantic visualization.

  1. Return different result codes (or additional fields in the response that provide hints on where the allowed range is?) for "no reads present" vs "forbidden (suppressed)" on a particular range. Rationale: at present, the client cannot discern whether there's no data in that range or just forbidden by the "range permissions" given a priori.
  2. Make sure returned blocks from htsget DO NOT contain reads that pertain to those forbidden/suppressed ranges, see Edit boundary blocks on the server side to drop non-requested data umccr/htsget-rs#238 for details.
  3. Potentially have two modes of operation depending on privacy concern levels:
    1. Suppressed mode: Implements aforementioned semantic differentiation between result codes.
    2. Strict mode: Always returns errors, not "leaking" whether a region is permitted to be seen or just not present.

/cc @mmalenic @ohofmann @andrewpatto @jrobinso

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions