Skip to content

Add a bulk-loading mode to indexes #97534

Open
@jpountz

Description

@jpountz

Description

It is a frequent use-case to have an initial load of data when no searches are expected, followed by rare updates but heavy searches.

For such use-cases, it would be interesting to tune Elasticsearch appropriately for each of these two mode, e.g. the bulk load mode could:

  • increase the merge factor from 10 to 32
  • increase the flush interval/size to reduce segment flushing
  • disable scheduled refreshes

And then we could also specialize the rare-update/frequent-search use-case. In addition to bringing above values back to normal:

  • increase the min merge segment size from 2MB to a much higher value like 100MB

It's possible to do all these things manually today already, but it would be nice to package it better so that there's a single setting that needs to be updated to change the index "mode".

Metadata

Metadata

Assignees

No one assigned

    Labels

    :Distributed Indexing/EngineAnything around managing Lucene and the Translog in an open shard.>enhancementTeam:Distributed (Obsolete)Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions