Skip to content

Analyze PPTSS sampling behavior #103

Closed
@wrigleyDan

Description

@wrigleyDan

The PPTSS sampler stops when an already sampled query is sampled repeatedly.

This happens especially in small scale scenarios (less than 100 unique queries in ubi_queries index).

We want to understand if this behavior is correct and if there are ways to handle such situations better, for example by falling back to the "all" sampling method.

Use ubi_queries data in a Python notebook example to see if this behavior can be reproduced outside of the eval app.

Example configuration for creating a query set.

{
  "sampler": "pptss",
  "name": "a new pptss sampled set",
  "description": "",
  "sampling": "",
  "querySetSize": 10
}

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions