perf: Disable morsel splitting for fast-count on streaming engine #26245

nameexhaustion · 2026-01-22T17:10:42Z

Fixes Disable morsel splitting in I/O sources if scans are connected directly to in-mem sink #22702

Streaming queries that collect to an in-memory DataFrame and consist only of column projections (and pl.len()) will now disable morsel splitting at supported sources.

E.g -

Scan->InMemorySink scan_parquet().collect()
Fast-count scan_parquet().select(pl.len()).collect()
Simple projections scan_parquet().select("<column>", "column", ..).collect()

Equivalently when starting from InMemorySource -

InMemorySource->InMemorySink LazyFrame().collect()
Fast-count LazyFrame().select(pl.len()).collect()
Simple projections LazyFrame().select("<column>", "column", ..).collect()

Benchmark

Description: scan_parquet().select(pl.len()).collect()
File: 4B rows x 0 columns

Runtime Before	Runtime After	Speedup
0.0728s	0.000207s	351x

Before this PR the parquet source would split to morsels of 100k rows (sending ~42,949 morsels). It now sends only a single morsel.

Test script

from time import perf_counter
import polars as pl

path = "/Users/nxs/git/polars/.env/_data_out/big.parquet"
pl.LazyFrame(height=(1 << 32) - 1).sink_parquet(path)

q = pl.scan_parquet(path).select(pl.len())
print(q.explain(engine="streaming"))

timings = []
for _ in range(5):
    t = perf_counter()
    q.collect()
    timings.append(perf_counter() - t)

print(f"{min(timings) = }")

codecov · 2026-01-22T18:57:32Z

Codecov Report

❌ Patch coverage is 99.15254% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 81.12%. Comparing base (5dd9b23) to head (889ab84).

Files with missing lines	Patch %	Lines
...ates/polars-stream/src/nodes/io_sources/ipc/mod.rs	91.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #26245      +/-   ##
==========================================
+ Coverage   78.35%   81.12%   +2.76%     
==========================================
  Files        1777     1777              
  Lines      241720   241816      +96     
  Branches     3085     3085              
==========================================
+ Hits       189406   196172    +6766     
+ Misses      51517    44848    -6669     
+ Partials      797      796       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions bot added performance Performance issues or improvements python Related to Python Polars rust Related to Rust Polars labels Jan 22, 2026

nameexhaustion force-pushed the nxs/fast-count-no-split-morsels branch from d4c1ea4 to 5e0ec6c Compare January 22, 2026 17:14

nameexhaustion changed the title ~~perf: Disable morsel splitting for fast-count~~ perf: Disable morsel splitting for fast-count on streaming Jan 22, 2026

github-actions bot added the A-streaming Related to the streaming engine label Jan 22, 2026

nameexhaustion force-pushed the nxs/fast-count-no-split-morsels branch 2 times, most recently from 2663041 to df594b2 Compare January 22, 2026 21:40

c

889ab84

nameexhaustion force-pushed the nxs/fast-count-no-split-morsels branch from 2242cb6 to 889ab84 Compare January 22, 2026 23:23

pola-rs deleted a comment from github-actions bot Jan 22, 2026

nameexhaustion changed the title ~~perf: Disable morsel splitting for fast-count on streaming~~ perf: Disable morsel splitting for fast-count on streaming engine Jan 23, 2026

nameexhaustion marked this pull request as ready for review January 23, 2026 04:16

nameexhaustion requested review from MarcoGorelli, alexander-beedie, c-peters, orlp and ritchie46 as code owners January 23, 2026 04:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Disable morsel splitting for fast-count on streaming engine #26245

perf: Disable morsel splitting for fast-count on streaming engine #26245

nameexhaustion commented Jan 22, 2026 •

edited

Loading

Uh oh!

codecov bot commented Jan 22, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

perf: Disable morsel splitting for fast-count on streaming engine #26245

Are you sure you want to change the base?

perf: Disable morsel splitting for fast-count on streaming engine #26245

Conversation

nameexhaustion commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark

Uh oh!

codecov bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nameexhaustion commented Jan 22, 2026 •

edited

Loading

codecov bot commented Jan 22, 2026 •

edited

Loading