Skip to content

[Spike] Scale OSSF Scorecards prescriptions out of GitHub for aggregation by revision #31968

@mayaCostantini

Description

@mayaCostantini

Is your feature request related to a problem? Please describe.
As we will start aggregating Scorecards prescriptions by project repository revision as present in the new scorecards-v2 BigQuery dataset and possibly create those prescriptions for packages from other ecosystems, we should think about a more scalable solution to have this data available.
The current size of the prescriptions dataset is currently of ~500M, which will largely exceed the recommended GitHub limit of 5GiB for a repository and cause storage and performance issues.

Describe the solution you'd like
Set up a new database (possibly non-relational) or make new Scorecards prescriptions available in a S3 bucket accessed through a webservice.

Additional context
Related to thoth-station/core#440

Metadata

Metadata

Assignees

No one assigned

    Labels

    kind/featureCategorizes issue or PR as related to a new feature.priority/important-soonMust be staffed and worked on either currently, or very soon, ideally in time for the next release.sig/stack-guidanceCategorizes an issue or PR as relevant to SIG Stack Guidance.

    Type

    No type

    Projects

    Status

    🆕 New

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions