Skip to content

[EPIC] Make Gravitino run with cloud storage #4396

Open
@jerryshao

Description

@jerryshao

Describe the proposal

This EPIC issue aims to make Gravitino support running with cloud storage, like S3, ADLS, GCS, etc.

The background is that current Gravitino only tests against HDFS, but with the more demands of cloud object storage, we should make sure that Gravitino can work well with cloud storage.

The work here includes necessary code changes, configuration supports, validations and tests.

Task list

  • Hive catalog supports cloud storage.
  • Iceberg catalog supports cloud storage
  • Fileset supports cloud storage
  • Iceberg rest catalog server supports cloud storage
  • Paimon supports cloud storage
  • Spark supports running on cloud storage
  • Flink supports running on cloud storage
  • Trino supports running on cloud storage

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    epicKey feature

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions