-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
Some context: https://docs.google.com/document/d/1hO4-VAKjul25_lfYrvELocgfag3TIwBzICdHaHVnATk/edit#heading=h.1atj7ihnv188
Relevant user stories:
- As a user, I want to explore a hypothesis during early QC.
- This involves heavy computation, usually on the full set of sequencing groups (albeit, only a section of data).
- As there's a lot of exploration, should be able to iteratively run analysis WITHOUT review.
- This comes with the caveat that we DON'T want users to publish this analysis. Basically a restricted test environment.
Its usage with metamist is UNDEFINED in this, and will be more properly resolved later.
This will involve creating:
- qc accounts for each of
hail,dataproc,cromwell - specify users in a qc group under
users.yaml - A set of qc buckets (
qc,qc-analysis,qc-tmp) (no web bucket)- A
$dataset-qcgroup (of persons) that has read access to theqc-analysis, and list access toqc/qc-tmp (Not needed if QC can read from main)main-fullcan APPEND data, but NOT read (to discourage copying results back).
- A
QC service accounts cannot access main level data(based on Hope's feedback below)- QC service accounts can READ the main-bucket
- I don't think we should support the
depends_onflag here, so QC groups should NOT allow access to transitive datasets.- the
qcservice accounts should be able to access the common-main bucket (for reference data)
- the
Random notes:
- Make sure it's included in the storage.toml
- Produce some docs that highlight the importance of being cautious of cost with full compute access.
- Maybe some mechanism to make this a time-dependent authorization, so we don't default to the qc namespace to test our analysis.
@violetbrina, it's worth thinking about what other implications creating a new namespace has. analysis-runner, billing, etc.
Metadata
Metadata
Assignees
Labels
No labels