Skip to content

Epic 9 - Distributed Index Build #14

@rmax

Description

@rmax

Parent

Goal

Allow index builds to run in parallel across shards and workers.

Detailed tasks

  • 9.1 Parallel shard building
    • Use rayon or async tasks to build shards concurrently.
  • 9.2 Distributed build mode
    • Support splitting the dataset across workers.
  • 9.3 Merge index shards
    • Combine distributed build outputs into final artifacts.
  • 9.4 Build scheduler
    • Add CLI support via shardlake build-index --parallel.

Definition of done

  • Local parallel builds are supported.
  • Distributed build outputs can be merged deterministically.
  • The CLI exposes a parallel build path.

Child issue breakdown

Dependency summary

Dependency graph

#75   #76
  \     |
   \   #77
    \ /
    #78

Metadata

Metadata

Assignees

Labels

epicTop-level epic issue

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions