pulsar IO sink missing after functions worker cluster leader restart #24202
Unanswered
bohuslav-broza
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Do you find any error logs on the brokers / function workers? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
hi everyone,
for years we are using Apache Pulsar deployed to K8s and at the moment we are running the version 3.3.5. Lately we extended our Pulsar usage with a Pulsar IO sink. We are dumping data from a topic to an AWS S3 using the cloud storage sink connector version 3.3.5.2 and for pulsar resource management we use pulsar resource operator version 0.9.2.
With our configuration we run into situations, where after restart of the functions worker cluster leader the sink stops operate. Details follows. Any help, hint or suggestion is highly appreciated.
Our cluster consists of two brokers and the functions workers are configured to run on those brokers with kubernetess runtime for the function instance
Functions workers form a cluster and typically broker-1 is the leader
The sink is created using the pulsar resource operator
The sink pod pf-auditlog-default-testsink-0 is created correctly in the auditlog k8s namespce and all the required K8s object - sts and service. The sink is assigned to the broker-0
When we restart the broker-0 the sink continues to work, however if we restart the leader of the functions worker cluster the broker-0 deletes the sink function (all the K8s objects) after the broker-1 becomes the leader again and we end up in the situation, where there is no pod running with the sink function
However the operator itself thinks the sink is running fine.
Functions worker configuration
broker-1-functions_worker.txt
broker-0-functions_worker.txt
The expected behaviour I require is that during the functions worker cluster changes, which are often in the K8s environment as pods are rescheduled or fail, the sink would survive these events and continue to operate. Any help, hint or suggestion is appreciated.
Beta Was this translation helpful? Give feedback.
All reactions