[Research] Possible performance improvements  in Lens expressions

I will add here some of the findings  related to performance bottlenecks I've found in the current Lens/SearchStrategy architecture and possible improvements to these solutions.

## Unnecessary multiple requests with the same esagg query 

When a search is sent to ES and a response is received, the search service is looking if the request needs a post-flight request.
https://github.com/elastic/kibana/blob/74fdd1b5f25c783ef95b3fddb2eaef03ff597345/src/plugins/data/common/search/search_source/search_source.ts#L539-L548
If needed, it transforms the response to a `partial` response and update the body with the postflight request.
This works correctly if the postflight is actually necessary, but due to the current implementation the postflight request is always "applied" even if not needed, causing a subsequent request to be sent to ES.
This results to an increase of:
- more time spent unnecessary before returning the results to the client
- 1 more unnecessary search strategy that cache check
- 1 more unnecessary run of tabify


**Analysis**
The current method that checks if a request needs a subsequent post-flight request relies on a loose check from the function `hasPostFlightRequests`. This function checks if the agg property `type.postFlightRequest` is a function.

https://github.com/elastic/kibana/blob/74fdd1b5f25c783ef95b3fddb2eaef03ff597345/src/plugins/data/common/search/search_source/search_source.ts#L474-L483

This function is there even if is not required. For example in a `terms` aggregation without the `other` bucket the function is still there but just return its identity 
https://github.com/elastic/kibana/blob/74fdd1b5f25c783ef95b3fddb2eaef03ff597345/src/plugins/data/common/search/aggs/buckets/terms.ts#L93
All the other cases this is defaulted to an `identity` function, so the `hasPostFlightRequests` function will always return true.
https://github.com/elastic/kibana/blob/74fdd1b5f25c783ef95b3fddb2eaef03ff597345/src/plugins/data/common/search/aggs/agg_type.ts#L311

## `wait_for_completion_timeout` value is too low and can't process, without delays, a full response

This parameter, used in async search, describes the timeout before returning [asynch search with a partial result.](https://www.elastic.co/guide/en/elasticsearch/reference/current/async-search.html). 
This parameter is currently set to `200ms`. 
https://github.com/elastic/kibana/blob/b8d8c737e6cc7889c19a6e7984d618bf378ee617/src/plugins/data/config.ts#L58

After this `200ms` interval the polling mechanism kicks in and the results then are just delayed everytime by at least ~300ms 
https://github.com/elastic/kibana/blob/b8d8c737e6cc7889c19a6e7984d618bf378ee617/src/plugins/data/common/search/poll_search.ts#L20-L35

Probably I don't have enough knowledge in that, but I don't see any major drawback to increase this value to at least 1s as proposed here https://github.com/elastic/kibana/issues/157837#issuecomment-1663100478 or even more.
The main drawback with that is an open connection between ES and Kibana that last for ~1 second, instead of opening and closing a new one 5 times in the same time interval.


## `getXDomain` can be speeded up

When using cartesian charts, we compute the x domain. If that domain is big, the time to compute is pretty relevant. For example for a 50k data point dataset it tooks ~40ms. This can probably reduced by half if we adopt a better strategy on data processing, avoiding multiple array scans to sort, filter, map values and we just loop once with a reduce. 

<img width="422" alt="Screenshot 2024-04-30 at 17 24 52" src="https://github.com/elastic/kibana/assets/1421091/3d37ed21-ef80-4c6c-ba39-903cd4386611">





	if (!this.hasPostFlightRequests()) {
	obs.next(this.postFlightTransform(response));
	obs.complete();
	} else {
	// Treat the complete response as partial, then run the postFlightRequests.
	obs.next({
	...this.postFlightTransform(response),
	isPartial: true,
	isRunning: true,
	});

	private hasPostFlightRequests() {
	const aggs = this.getField('aggs');
	if (aggs instanceof AggConfigs) {
	return aggs.aggs.some(
	(agg) => agg.enabled && typeof agg.type.postFlightRequest === 'function'
	);
	} else {
	return false;
	}
	}

	const getPollInterval = (elapsedTime: number): number => {
	if (typeof pollInterval === 'number') return pollInterval;
	else {
	// if static pollInterval is not provided, then use default back-off logic
	switch (true) {
	case elapsedTime < 1500:
	return 300;
	case elapsedTime < 5000:
	return 1000;
	case elapsedTime < 20000:
	return 2500;
	default:
	return 5000;
	}
	}
	};

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Research] Possible performance improvements in Lens expressions #182151

Unnecessary multiple requests with the same esagg query

`wait_for_completion_timeout` value is too low and can't process, without delays, a full response

`getXDomain` can be speeded up

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Research] Possible performance improvements in Lens expressions #182151

Description

Unnecessary multiple requests with the same esagg query

wait_for_completion_timeout value is too low and can't process, without delays, a full response

getXDomain can be speeded up

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`wait_for_completion_timeout` value is too low and can't process, without delays, a full response

`getXDomain` can be speeded up