refactor(database, state-indexer): State schema improvements for reads and updates #410

khorolets · 2025-07-30T06:53:02Z

This PR is a continuation of @kobayurii work done in #394 that we have rejected because we could not write (index) the data fast enough to support that idea.

After a bunch of different experiments and other ideas for improving the schema we got back to that schema and optimized it enough to increase a throughput on our side to be able to index the tip of the network or backfill.

TL;DR

Average blocks per second:

Before: 0.1 BPS
After: 5.0-12.0 BPS

Key changes:

state_changes_{family} tables have to be converted into state_changes_{family}_compact:
- block_height changes to block_height_from representing the moment in time when the particular record becomes "active"
- new field block_height_to together with the block_height_from builds up a range of "active" period for a particular record
- Altogether this allows to dramatically shrink the lookup range for PostgreSQL thus increasing the speed or reads for all the state changes related data in ReadRPC (state, access keys, account, contract)
Inserts and updates (introduced logic to support block_height_to) are refactored to be done in parallel on per partition basis.
- A function get_text_partition on the PostgreSQL side that allows to map account_id to the partition is added
And a cherry on top: we need to migrate block_height_from and block_height_to column types from numeric(20, 0) to bigint. Correspondingly, they represent u64 and i64. And while block height is u64 (the reason we used numeric(20, 0)) we still can fit it in i64 for a long time from now. The bigint field is much more performant on the PostgreSQL side for our inserts and updates.

NB! Before releasing this we need to fix/update migrations to ensure we create the proper schema. Also we need to migrate our entire databases for a new schema.

…changes that is more efficient to read from

…ndexer into files. Remove redundant block_hash from handle_state_changes method in logic-state-indexer

… long state indexer writes take time and how many partitions touched

…artition number. Switch CTE to unnest for updates

…eights to biging (i64) to speed inserts and updates up

… to use i64 instead of BigDecimal

kobayurii

Grate! Thank you!

khorolets requested a review from kobayurii July 30, 2025 06:53

khorolets added enhancement New feature or request performance labels Jul 30, 2025

khorolets force-pushed the refactor/read-optimized-schema branch from d5cf306 to b9bfcdd Compare July 30, 2025 07:06

khorolets changed the base branch from main to develop July 30, 2025 07:25

kobayurii force-pushed the refactor/read-optimized-schema branch 2 times, most recently from 2491ff4 to 41a471b Compare July 31, 2025 06:30

khorolets and others added 9 commits August 6, 2025 14:43

refactor(database,state-indexer): Introduce compact schema for state_…

fd313d5

…changes that is more efficient to read from

chore(database, logic-state-indexer): Split database/postgres/state_i…

b0be71c

…ndexer into files. Remove redundant block_hash from handle_state_changes method in logic-state-indexer

chore(database, state-indexer): Add additional metrics to monitor how…

fec57e3

… long state indexer writes take time and how many partitions touched

refactor(database, state-indexer): Add postgresql function to match p…

bc7f3c7

…artition number. Switch CTE to unnest for updates

refactor(database, state-indexer): Replace numberic(20,0) for block_h…

98c9246

…eights to biging (i64) to speed inserts and updates up

refactor(database, rpc-server): Update read queries related to states…

b34a87d

… to use i64 instead of BigDecimal

add migration scripts

d07e2b2

add indexes for new tables

d5b9f4d

start state indexer from interaption block

1bcc8c6

kobayurii force-pushed the refactor/read-optimized-schema branch from 4af4ca4 to 1bcc8c6 Compare August 6, 2025 11:43

kobayurii added 4 commits August 11, 2025 12:06

paginated state optimization

c370570

fix paginated state

a10ead2

fix query

7890fc9

fix page_token

e060710

kobayurii marked this pull request as ready for review August 11, 2025 15:37

kobayurii approved these changes Aug 11, 2025

View reviewed changes

kobayurii merged commit 591ab22 into develop Aug 12, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(database, state-indexer): State schema improvements for reads and updates #410

refactor(database, state-indexer): State schema improvements for reads and updates #410

Uh oh!

khorolets commented Jul 30, 2025

Uh oh!

kobayurii left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

refactor(database, state-indexer): State schema improvements for reads and updates #410

refactor(database, state-indexer): State schema improvements for reads and updates #410

Uh oh!

Conversation

khorolets commented Jul 30, 2025

TL;DR

Key changes:

Uh oh!

kobayurii left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants