Skip to content

[Feature request][V2sink-StarRocks/Doris] Ensure exactly-once semantics #4409

@ghost

Description

Search before asking

  • I had searched in the feature and found no similar feature requirement.

Description

To ensure accurate one-time semantics for reading data sources to write StarRocks/Doris(or any other data destination database for this case), a policy can be adopted that writes only when Seatunnel is checkpoint;

There is a question about the existing configuration of StarRocks sink. Seatunnel ensures exactly once by checkpoint, but StarRocks sink can be configured with batch_max_rows, batch_max_bytes, batch_interval_ms and other parameters; If both exist at the same time, it cannot be guaranteed exactly once;

This is even better: when the exactly-once option is explicitly configured, Seatunnel is written if and only if it is checkpoint, and the batch_* configuration in the sink section should be invalidated automatically even if configured

Usage Scenario

增量读取jdbc,写入StarRocks

Related issues

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions