[BUG] Cloudwatch Sink high memory usage, does not backpressure

**Describe the bug**
When CW sink cannot keep up with incoming data, threads pile up in an unbounded thread queue.

In this PR https://github.com/opensearch-project/data-prepper/pull/5770 :

```
Modify cloudwatch sink to use FixedThreadPool. The existing newCachedThreadPool() creates huge amount of threads (leading to out of memory error in some cases) when there is large amount of events created.
```
But from what I've found, FixedThreadPool still has an unbounded queue:
https://stackoverflow.com/questions/53913601/what-is-the-use-case-for-unbounded-queue-in-java-executors
When CloudWatch API is slow, queue keeps accepting new batches without limit, leading to OOM circuit breaker flips

> It creates the threads at the beginning in the thread pool. But, it then queues up events in the Data Prepper process worker threads. And it has a thread that reads from that.
> So it is creating a new buffer. And it breaks our ability to put backpressure on the overall pipeline.


> we want to consider dropping these threads. But, that might be a significant change.
A possible short-term solution would be to add a blocking queue here:
[https://github.com/opensearch-project/data-prepper/blob/087ced7fc54fb37b9f3b4dcec9[…]ugins/sink/cloudwatch_logs/client/CloudWatchLogsDispatcher.java](https://github.com/opensearch-project/data-prepper/blob/087ced7fc54fb37b9f3b4dcec9b5a0f295131a21/data-prepper-plugins/cloudwatch-logs/src/main/java/org/opensearch/dataprepper/plugins/sink/cloudwatch_logs/client/CloudWatchLogsDispatcher.java#L85-L101)
Maybe limit it to the number of process workers.

**To Reproduce**
Steps to reproduce the behavior:
1. Create a Cloudwatch sink pipeline with small batch size
2. Ingest a large amount of data
3. Observe low buffer usage, yet circuit breaker flips as sink uses the max memory

**Expected behavior**
Sink should apply backpressure


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] Cloudwatch Sink high memory usage, does not backpressure #6141

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG] Cloudwatch Sink high memory usage, does not backpressure #6141

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions