Add proper sampling for estimating counts

We know how to do it. We've diced up the problem, done a bunch of the proofs and we've even written out how we'll do it. Let's do it and squeeze out that extra boost in performance when someone's sorting data that isn't uniformly distributed. It ain't that hard and we're already paying the cost for start/end counts because we knew we'd do this eventually.

That said, when I do this is it worth having an explicit uniform distribution version that doesn't do the start/end to squeeze out that tiny improvement from replacing a memory lookup with some basic arithmetic? I'll decide when I actually do this ticket.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add proper sampling for estimating counts #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Add proper sampling for estimating counts #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions