Skip to content

Create a QC namespace #233

@illusional

Description

@illusional

Some context: https://docs.google.com/document/d/1hO4-VAKjul25_lfYrvELocgfag3TIwBzICdHaHVnATk/edit#heading=h.1atj7ihnv188

Relevant user stories:

  • As a user, I want to explore a hypothesis during early QC.
    • This involves heavy computation, usually on the full set of sequencing groups (albeit, only a section of data).
    • As there's a lot of exploration, should be able to iteratively run analysis WITHOUT review.
    • This comes with the caveat that we DON'T want users to publish this analysis. Basically a restricted test environment.

Its usage with metamist is UNDEFINED in this, and will be more properly resolved later.

This will involve creating:

  • qc accounts for each of hail, dataproc, cromwell
  • specify users in a qc group under users.yaml
  • A set of qc buckets (qc, qc-analysis, qc-tmp) (no web bucket)
    • A $dataset-qc group (of persons) that has read access to the qc-analysis, and list access to qc / qc-tmp
    • main-full can APPEND data, but NOT read (to discourage copying results back). (Not needed if QC can read from main)
  • QC service accounts cannot access main level data (based on Hope's feedback below)
    • QC service accounts can READ the main-bucket
  • I don't think we should support the depends_on flag here, so QC groups should NOT allow access to transitive datasets.
    • the qc service accounts should be able to access the common-main bucket (for reference data)

Random notes:

  • Make sure it's included in the storage.toml
  • Produce some docs that highlight the importance of being cautious of cost with full compute access.
  • Maybe some mechanism to make this a time-dependent authorization, so we don't default to the qc namespace to test our analysis.

@violetbrina, it's worth thinking about what other implications creating a new namespace has. analysis-runner, billing, etc.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions