Experiment using ignored source for fields with no doc values or stored fields. #114886

martijnvg · 2024-10-16T09:27:15Z

A POC that tries to use ignored source as fall back if synthetic source is enabled and a field is neither stored or has docv values enabled.

This should be more efficient compared to using synthetic source in block loaders, since we will not potentially read many doc values values / stored fields twice to synthesize the source.

…ed fields. A POC that tries to use ignored source as fall back if synthetic source is enabled and a field is neither stored or has docv values enabled. This looks to be easier than fully supporting synthetic source in block loaders (pushing down source loader at this level). And is also more efficient, since we will not load doc values / stored fields we don't need.

…ock loaders. Currently, in compute engine when loading source if source mode is synthetic, the synthetic source loader is already used. But the ignored_source field isn't always marked as a required source field, causing the source to potentially miss a lot of fields. This change includes `_ignored_source` field as a required stored field. Long term in case of synthetic source we should only load ignored source in case a field has no doc values or stored field. Like is being explored in elastic#114886

…ReaderOperator via BlockSourceReader. (#114903) Currently, in compute engine when loading source if source mode is synthetic, the synthetic source loader is already used. But the ignored_source field isn't always marked as a required source field, causing the source to potentially miss a lot of fields. This change includes _ignored_source field as a required stored field and allowing keyword fields without doc values or stored fields to be used in case of synthetic source. Relying on synthetic source to get the values (because a field doesn't have stored fields / doc values) is slow. In case of synthetic source we already keep ignored field/values in a special place, named ignored source. Long term in case of synthetic source we should only load ignored source in case a field has no doc values or stored field. Like is being explored in #114886 Thereby avoiding synthesizing the complete _source in order to get only one field.

…ReaderOperator via BlockSourceReader. (elastic#114903) Currently, in compute engine when loading source if source mode is synthetic, the synthetic source loader is already used. But the ignored_source field isn't always marked as a required source field, causing the source to potentially miss a lot of fields. This change includes _ignored_source field as a required stored field and allowing keyword fields without doc values or stored fields to be used in case of synthetic source. Relying on synthetic source to get the values (because a field doesn't have stored fields / doc values) is slow. In case of synthetic source we already keep ignored field/values in a special place, named ignored source. Long term in case of synthetic source we should only load ignored source in case a field has no doc values or stored field. Like is being explored in elastic#114886 Thereby avoiding synthesizing the complete _source in order to get only one field.

…ReaderOperator via BlockSourceReader. (#114903) (#115064) Currently, in compute engine when loading source if source mode is synthetic, the synthetic source loader is already used. But the ignored_source field isn't always marked as a required source field, causing the source to potentially miss a lot of fields. This change includes _ignored_source field as a required stored field and allowing keyword fields without doc values or stored fields to be used in case of synthetic source. Relying on synthetic source to get the values (because a field doesn't have stored fields / doc values) is slow. In case of synthetic source we already keep ignored field/values in a special place, named ignored source. Long term in case of synthetic source we should only load ignored source in case a field has no doc values or stored field. Like is being explored in #114886 Thereby avoiding synthesizing the complete _source in order to get only one field.

…ReaderOperator via BlockSourceReader. (elastic#114903) Currently, in compute engine when loading source if source mode is synthetic, the synthetic source loader is already used. But the ignored_source field isn't always marked as a required source field, causing the source to potentially miss a lot of fields. This change includes _ignored_source field as a required stored field and allowing keyword fields without doc values or stored fields to be used in case of synthetic source. Relying on synthetic source to get the values (because a field doesn't have stored fields / doc values) is slow. In case of synthetic source we already keep ignored field/values in a special place, named ignored source. Long term in case of synthetic source we should only load ignored source in case a field has no doc values or stored field. Like is being explored in elastic#114886 Thereby avoiding synthesizing the complete _source in order to get only one field.

martijnvg added :Analytics/Compute Engine Analytics in ES|QL :StorageEngine/Mapping The storage related side of mappings labels Oct 16, 2024

elasticsearchmachine added the v9.0.0 label Oct 16, 2024

martijnvg mentioned this pull request Oct 16, 2024

Include ignored source as part of loading field values in ValueSourceReaderOperator via BlockSourceReader. #114903

Merged

martijnvg mentioned this pull request Oct 23, 2024

Improve block loader fallback to source when source mode is synthetic. #115394

Closed

3 tasks

elasticsearchmachine added v9.1.0 and removed v9.0.0 labels Jan 30, 2025

elasticsearchmachine added v9.2.0 and removed v9.1.0 labels Jun 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Experiment using ignored source for fields with no doc values or stored fields. #114886

Experiment using ignored source for fields with no doc values or stored fields. #114886

Uh oh!

martijnvg commented Oct 16, 2024 •

edited

Loading

Uh oh!

Uh oh!

Experiment using ignored source for fields with no doc values or stored fields. #114886

Are you sure you want to change the base?

Experiment using ignored source for fields with no doc values or stored fields. #114886

Uh oh!

Conversation

martijnvg commented Oct 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

martijnvg commented Oct 16, 2024 •

edited

Loading