[Feature Request] Add segment sorter to `ShuffleForcedMergePolicy` instead of name-based segment sorting

### Is your feature request related to a problem? Please describe

Currently, when OpenSearch performs a forced merge (ex `_forcemerge?max_num_segments=1`), it wraps the underlying merge policy with [ShuffleForcedMergePolicy](https://github.com/opensearch-project/OpenSearch/blob/main/server/src/main/java/org/opensearch/index/engine/ShuffleForcedMergePolicy.java). 

From my understanding the `ShuffleForcedMergePolicy` – a special merge policy that interleaves documents from the oldest and newest segments during the merge. “Interleaves” means mixing things together by alternating them. By shuffling documents from old and new segments, the `ShuffleForcedMergePolicy` breaks the ordering pattern, enabling more uniform distribution of data during force merge.

Now for time-series index and using `LogByteSizeMergePolicy` where merges always combine adjacent segments and assign the result a new Lucene name (via a global counter), but that name doesn’t reflect the segment’s true timestamp range. Because the current shuffle logic sorts segments by these Lucene names (info.name), the interleaving can drift from real time order.

Coming from https://github.com/opensearch-project/OpenSearch/issues/17404 seen a variance with `desc_sort_timestamp` after force merge. After some investigation the main cause (https://github.com/opensearch-project/OpenSearch/issues/17404#issuecomment-2752178639) is Document ID Reassignment after the force merge which is coming from `ShuffleForcedMergePolicy`.  More tests done here https://github.com/opensearch-project/OpenSearch/issues/17737#issuecomment-2813469768.

A high level example with `_6 (Jan 1–4), _8 (Jan 4–6), _5 (Jan 6–7), _7 (Jan 7–8), _9 (Jan 8–9)]`, with current logic  when sorted by name its `[_5, _6, _7, _8, _9]`. Now the resulting merge order is `[_5, _9, _6, _8, _7]`.

### Describe the solution you'd like


Allow supplying a segment comparator (use this for time-series index) pulled from the index’s own LeafSorter. So that the data is now evenly interleaved and sort queries dont pay a penalty by skipping Lucene’s BKD optimization .

Now from above high level example with `_6 (Jan 1–4), _8 (Jan 4–6), _5 (Jan 6–7), _7 (Jan 7–8), _9 (Jan 8–9)]`. The merge order should be `[_6, _9, _8, _7, _5]`. The gives more predictable DocID assignment and newest and oldest data is interleaved properly.

### Related component

Search

### Describe alternatives you've considered

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request] Add segment sorter to `ShuffleForcedMergePolicy` instead of name-based segment sorting #18168

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Related component

Describe alternatives you've considered

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Add segment sorter to ShuffleForcedMergePolicy instead of name-based segment sorting #18168

Description

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Related component

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Feature Request] Add segment sorter to `ShuffleForcedMergePolicy` instead of name-based segment sorting #18168