Skip to content

Investigate human-based flows behavior with distributed engines #30

@adutra

Description

@adutra

The driver process is generally responsible for initializing the RESTCatalog instance, and thus interacts the most with the catalog server. Human-based flows work generally well in this setup, as long as the driver process is interacting with a human operator.

Things get more complicated though when FileIO instances, created by executors, need to interact with the catalog server. This can happen e.g. when using S3 request signing. Each signer will have its own AuthManager, but this time, there won't be any human operator available, so the flow will timeout and fail.

As of now, I think human-based flows are completely incompatible with S3 request signing and also object storage credentials refreshing. Basically any interaction between executors and the catalog server would fail.

We should investigate ways to improve this.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions