improve search performance by scanning with the timestamp index iteratively #2698

lilydjwg · 2025-04-13T06:04:12Z

Or else SQLite uses the command index and sort. Even with the timestamp index, SQLite still needs sort.

This is what atuin uses:

select * from history indexed by idx_history_command where deleted_at is null group by command having max(timestamp) order by timestamp desc limit 100;
|--SCAN history USING INDEX idx_history_command
`--USE TEMP B-TREE FOR ORDER BY

Try to tell SQLite to use the timestamp index. Slightly faster.

select * from history indexed by idx_history_timestamp where deleted_at is null group by command having max(timestamp) order by timestamp desc limit 100;
|--SCAN history USING INDEX idx_history_timestamp
|--USE TEMP B-TREE FOR GROUP BY
`--USE TEMP B-TREE FOR ORDER BY

The fastest index to use is...not to use an index at all.

select * from history not indexed where deleted_at is null group by command having max(timestamp) order by timestamp desc limit 100;
QUERY PLAN
|--SCAN history
|--USE TEMP B-TREE FOR GROUP BY
`--USE TEMP B-TREE FOR ORDER BY

The following one is very fast, but it might not fetch enough rows due to duplications. So we do the deduplication ourselves instead and let SQLite just scan using the index and never sort. The downside of this method is that it doesn't perform well when there are too many duplicates, hence #2697.

select * from (select * from history where deleted_at is null order by timestamp desc limit 1000) group by command having max(timestamp);
QUERY PLAN
|--CO-ROUTINE (subquery-1)
|  `--SCAN history USING INDEX idx_history_timestamp |--SCAN (subquery-1)
`--USE TEMP B-TREE FOR GROUP BY

For the following command, the elapsed times are (lowest among several runs):

atuin search --search-mode fuzzy --limit 100 --cmd-only > /dev/null

command index   : 0.154s
timestamp index : 0.139s
no index        : 0.124s
paging          : 0.012s

Checks

I am happy for maintainers to push small adjustments to this PR, to speed up the review cycle
I have checked that there are no existing pull requests for the same thing

…tively Or else SQLite uses the command index and sort. Even with the timestamp index, SQLite still needs sort. This is what atuin uses: select * from history indexed by idx_history_command where deleted_at is null group by command having max(timestamp) order by timestamp desc limit 100; |--SCAN history USING INDEX idx_history_command `--USE TEMP B-TREE FOR ORDER BY Try to tell SQLite to use the timestamp index. Slightly faster. select * from history indexed by idx_history_timestamp where deleted_at is null group by command having max(timestamp) order by timestamp desc limit 100; |--SCAN history USING INDEX idx_history_timestamp |--USE TEMP B-TREE FOR GROUP BY `--USE TEMP B-TREE FOR ORDER BY The fastest index to use is...not to use an index at all. select * from history not indexed where deleted_at is null group by command having max(timestamp) order by timestamp desc limit 100; QUERY PLAN |--SCAN history |--USE TEMP B-TREE FOR GROUP BY `--USE TEMP B-TREE FOR ORDER BY The last one is very fast, but it might not fetch enough rows due to duplications. So we do the deduplication ourselves instead and let SQLite just scan using the index and never sort. select * from (select * from history where deleted_at is null order by timestamp desc limit 1000) group by command having max(timestamp); QUERY PLAN |--CO-ROUTINE (subquery-1) | `--SCAN history USING INDEX idx_history_timestamp |--SCAN (subquery-1) `--USE TEMP B-TREE FOR GROUP BY For the following command, the elpased times are (lowest among several runs): atuin search --search-mode fuzzy --limit 100 --cmd-only > /dev/null command index : 0.154s timestamp index : 0.139s no index : 0.124s paging : 0.012s

lilydjwg force-pushed the paging branch from 6ecd7f3 to b24045e Compare April 13, 2025 06:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

improve search performance by scanning with the timestamp index iteratively #2698

improve search performance by scanning with the timestamp index iteratively #2698

Uh oh!

lilydjwg commented Apr 13, 2025

Uh oh!

Uh oh!

Uh oh!

improve search performance by scanning with the timestamp index iteratively #2698

Are you sure you want to change the base?

improve search performance by scanning with the timestamp index iteratively #2698

Uh oh!

Conversation

lilydjwg commented Apr 13, 2025

Checks

Uh oh!

Uh oh!