Skip to content

Add a MapDB implementation of CitationCache#234

Draft
sstults wants to merge 12 commits intoadsabs:mainfrom
sstults:sstults-mapdbcache
Draft

Add a MapDB implementation of CitationCache#234
sstults wants to merge 12 commits intoadsabs:mainfrom
sstults:sstults-mapdbcache

Conversation

@sstults
Copy link

@sstults sstults commented Apr 28, 2025

One of the limitations of our current in-memory cache is that the whole cache is stored in memory. This was an intentional design decision to prefer query speed over hardware cost. The one of the drawbacks of this approach is that each cache instance is only aware of the Lucene documents in the local index. In order to deploy Montysolr in a multi-shard collection we will need to implement a cache that uses the external document identifiers that are unique across the collection rather than the internal Lucene docids that are only unique to a shard.

This PR is a step in the direction of expanding the number of documents that can be indexed by reducing the memory requirements per-shard.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant