Arcane user scenarios cheat sheet

I want to create a streaming job that runs on the cluster

To create a streaming job, you need to define a stream custom resource (CR) that specifies the source and sink of the data stream. You can use one of the available streaming plugins or create your own.

When a new stream resource is created, Arcane will create a backfill request for the stream. The backfill request will be picked up by the operator, which will create a Kubernetes job to run the streaming job.

When the backfill process is completed, operator will create a job that will continue the stream.

I want to create a stream without backfill

To create a stream without backfill, you need to define a stream custom resource (CR) with the field spec.suspended set to true. After that you can unsuspend the stream to start it in the streaming mode.

I want to suspend the stream

To suspend the stream, you need to update the stream custom resource (CR) and set the field spec.suspended to true. This will stop the streaming job and prevent it from processing any new data.

I want to start backfill for an existing stream

To start backfill for an existing stream, you need to create a backfill request custom resource (CR) that references the stream. The backfill request will be picked up by the operator, which will create a Kubernetes job to run the backfill process.

Example backfill request:

apiVersion: streaming.sneaksanddata.com/v1
kind: BackfillRequest
metadata:
  name: my-backfill-request
  namespace: arcane-stream-mock
spec:
  streamClass: arcane-stream-mock
  streamId: my-stream

My stream has failed and I want to restart it

If your stream has failed, you can set spec.suspended to true to stop the stream. To avoid data loss, you may create a backfill request that fills in any gaps occurred during the failure. After that, you can set spec.suspended to false to restart the stream.

I deleted the pod and my stream transitioned to failed state, how do I avoid that in the future?

Arcane streaming is built on top of Kubernetes Jobs. By default, when a pod is deleted manually or due to node eviction, all containers in the pod receive a SIGTERM signal and have a grace period to shut down gracefully with exit code 0. If the containers do not shut down within the grace period, the pod is forcefully terminated. If the pod is terminated with a non-zero exit code, Kubernetes counts this pod as failed and the job may transition to a failed state. It's a responsibility of the user and/or the plugin developer to ensure that the streaming job handles termination signals gracefully and exits with code 0 or the exit code that returned by the plugin executable on termination is added to the job's podFailurePolicy.

I want to update a stream definition while backfill is in progress. What should I expect?

Currently, if you apply any changes to a stream definition YAML, while there is an active backfill request, Operator will restart the backfill to apply your changes.

This behaviour will be improved once lock-on is implemented.

What happens if I delete a backfill request while the backfill job is running?

If you delete a backfill request while the backfill job is running, the operator will stop the backfill job and restart it in the streaming mode.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arcane user scenarios cheat sheet

I want to create a streaming job that runs on the cluster

I want to create a stream without backfill

I want to suspend the stream

I want to start backfill for an existing stream

My stream has failed and I want to restart it

I deleted the pod and my stream transitioned to failed state, how do I avoid that in the future?

I want to update a stream definition while backfill is in progress. What should I expect?

What happens if I delete a backfill request while the backfill job is running?

FilesExpand file tree

user_scenarios.md

Latest commit

History

user_scenarios.md

File metadata and controls

Arcane user scenarios cheat sheet

I want to create a streaming job that runs on the cluster

I want to create a stream without backfill

I want to suspend the stream

I want to start backfill for an existing stream

My stream has failed and I want to restart it

I deleted the pod and my stream transitioned to failed state, how do I avoid that in the future?

I want to update a stream definition while backfill is in progress. What should I expect?

What happens if I delete a backfill request while the backfill job is running?