feat(bufferCountWithDebounce): add new operator #380

jeengbe · 2025-02-04T20:30:45Z

This implementation makes a lot more sense to me. Instead of a fixed interval that runs in the background completely unrelated to the source, whenever an item is received and no timer is running, a new timeout is started for the specified time. Either the buffer runs full, in which case the timeout is cleared, and the buffer is yielded, or the timeout fires, and the partially filled buffer is yielded.
With the current implementations I get a lot of partially filled buffers because e.g:

~~Buffer count: 50, buffer window: 5s, source: 10 msg/s, processor: 12.5 msg/s~~

With a pipeline like source -> buffer (5s) -> processor (4s), on the first run, you get a full buffer, but because the interval keeps running, by the time another batch of items is requested (after 4 seconds), the timeout has almost expired and only collects another 10 items (in 1 second). The consumer has actually nothing to do with this, see #380 (comment)

It makes more sense to me to start the timer once an item arrives and essentially race a full buffer versus the timer firing.

Lastly, I think it would make most sense for this operator to keep batching items in the background so that once the consumer requests the next batch, it is already present. But that's a bigger change than this one, so I thought I'd start here.

trxcllnt · 2025-02-05T16:16:06Z

Rather than redefining bufferTimeOrCount, why not make this a new operator? From your description, it sounds like you'd like a buffer operator with auditTime semantics?

jeengbe · 2025-02-05T16:50:20Z

I think this is how bufferCountOrTime should have worked from the beginning. It sounds more like a bug/issue with the current implementation that even though the timer is active during the succeeding operation (i.e. while the bufferCountOrTime generator is suspended), the source is not being pulled further.

The following test should pass, but currently doesn't.

it('should fill buffers', async () => {
  const sourceDelay = 900;
  const bufferSize = 3;
  const maxWaitTime = 4000;

  // maxWaitTime > sourceDelay * (bufferSize + 1)
  //
  // Setting 'maxWaitTime > sourceDelay * bufferSize' causes the timeout to finish when the 'bufferCountOrTime' generator is suspended,
  // so the next time a new value is polled, it will be the timerEvent. However, because
  // we check 'buffer.length > 0' in the case that we receive a timer event, it is necessary
  // that we wait 'bufferSize + 1' times so that the buffer fills with one more element, and it
  // is yielded.

  // Essentially, because maxWaitTime > sourceDelay * bufferSize, it causes the timeout to finish
  // right in the middle of the second "filling" of the buffer, so it's yielded half-empty.

  const source = interval(sourceDelay);

  const res = source.pipe(bufferCountOrTime(bufferSize, maxWaitTime));

  await expect(toArray(res.pipe(take(2)))).resolves.toEqual([
    [0, 1, 2],
    [3, 4, 5],
  ]); // Actually gives [[0, 1, 2], [3]]
});

This is my opinion on how the semantics of bufferCountOrTime should work, though it's not a strong opinion. If you still think it makes more sense to move this to a separate operator, I will gladly do so.

trxcllnt · 2025-02-05T20:16:58Z

Ah, I see the confusion. The bufferCountOrTime operator yields a value either when its buffer is full, or on each dueTime interval. I do think the behavior you're looking for is valuable, it's just a different operator, something like bufferCountWithDebounce?

jeengbe · 2025-02-09T18:52:18Z

Fixed!

jeengbe added 2 commits February 6, 2025 08:04

fix(bufferCountOrTime): start timer only with first item

d8ae0ec

feat: add new bufferCountWithDebounce instead

ecf009b

jeengbe force-pushed the je-buffercountortime-start-with-item branch from ceff6a7 to ecf009b Compare February 6, 2025 07:04

chore: add more tests

ca7d88c

jeengbe changed the title ~~fix(bufferCountOrTime): start timer only with first item~~ feat(bufferCountWithDebounce): add new operator Feb 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(bufferCountWithDebounce): add new operator #380

feat(bufferCountWithDebounce): add new operator #380

jeengbe commented Feb 4, 2025 •

edited

Loading

trxcllnt commented Feb 5, 2025

jeengbe commented Feb 5, 2025 •

edited

Loading

trxcllnt commented Feb 5, 2025

jeengbe commented Feb 9, 2025

feat(bufferCountWithDebounce): add new operator #380

Are you sure you want to change the base?

feat(bufferCountWithDebounce): add new operator #380

Conversation

jeengbe commented Feb 4, 2025 • edited Loading

trxcllnt commented Feb 5, 2025

jeengbe commented Feb 5, 2025 • edited Loading

trxcllnt commented Feb 5, 2025

jeengbe commented Feb 9, 2025

jeengbe commented Feb 4, 2025 •

edited

Loading

jeengbe commented Feb 5, 2025 •

edited

Loading