[meta] Tail-based sampling (TBS) improvements

This is a meta-issue on tail-based sampling.

Tail-based sampling comes up frequently in bug reports, as there is minimal documentation and guidance on TBS configuration. It is not clear to users how TBS works, which leads to misconfigured TBS storage size, and consequently apm-server and ES issues.

When TBS local storage (badger) is filled, it results in error in writing traces (where apm-server logs `received error writing sampled trace: configured storage limit reached (current: 127210377485, limit: 126000000000)`) and bypassing TBS as sampling rate jumps to 100%, causing a performance cliff and downstream effects: surprising significant increase on writes to ES, and either slowing ES and causing backpressure to apm-server, or unexpected high storage usage in ES.

The task list contains tasks to either document it properly, investigate/fix bugs, and to provide escape hatches for compromises.

Impact: TBS is a popular feature among heavy apm-server users who rely on TBS to reduce ES storage requirements while retaining the value of the sampled traces. We need to ensure and show that TBS is good for high load, like the rest of apm-server.

### Tasks
- [x] https://github.com/elastic/apm-server/issues/11346
- [x] https://github.com/elastic/apm-server/issues/11127
- [ ] https://github.com/elastic/apm-server/issues/13525
- [x] https://github.com/elastic/apm-server/issues/14923
- [x] https://github.com/elastic/apm-server/issues/11546
- [x] https://github.com/elastic/apm-server/issues/14933
- [x] https://github.com/elastic/apm-server/issues/14996
- [x] https://github.com/elastic/apm-server/issues/15121
- [x] https://github.com/elastic/apm-server/issues/15246
- [x] https://github.com/elastic/apm-server/issues/15500
- [x] https://github.com/elastic/apm-server/issues/14247
- [ ] https://github.com/elastic/apm-server/issues/15330
- [x] https://github.com/elastic/apm-server/issues/14760

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[meta] Tail-based sampling (TBS) improvements #14931

Tasks

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[meta] Tail-based sampling (TBS) improvements #14931

Description

Tasks

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions