Closed
Description
Expected behavior
Default KeepAliveInterval
should work well.
Actual behavior
Gets "Detected stale connection to broker" and "Failed to write on connection" errors
Steps to reproduce
- I have about 6 producer services(about 100 Pods) that produce messages to different Pulsar server clusters;
- All the producers'
KeepAliveInterval
are set to default(30s), and the values of the servers are the same; - Print debug log and increase an error metrics counter;
- Start to produce messages;
- Monitor the debug log and metrics;
- Gets "Detected stale connection to broker" and "Failed to write on connection" errors quite often;
- Sets
KeepAliveInterval
to 15s, errors continue; - When
KeepAliveInterval
sets to 5s, errors stop.
System configuration
Pulsar version: v2.11.0
Pulsar client go version: v0.10.0
Metadata
Metadata
Assignees
Labels
No labels
Activity