Skip to content

Analyze PPTSS sampling behavior #103

Open
@wrigleyDan

Description

The PPTSS sampler stops when an already sampled query is sampled repeatedly.

This happens especially in small scale scenarios (less than 100 unique queries in ubi_queries index).

We want to understand if this behavior is correct and if there are ways to handle such situations better, for example by falling back to the "all" sampling method.

Use ubi_queries data in a Python notebook example to see if this behavior can be reproduced outside of the eval app.

Example configuration for creating a query set.

{
  "sampler": "pptss",
  "name": "a new pptss sampled set",
  "description": "",
  "sampling": "",
  "querySetSize": 10
}

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions