Skip to content

Optimise indexing of linked resources #1014

@PascalRepond

Description

@PascalRepond

How it works

When indexing a resource (user, deposit, document, etc.), SONAR dumps all the fields of the linked resource in its index. This is not optimal as there are unnecessary data in indexes and sometimes twice!

See this user, for example: https://sonar.rero.ch/api/users/?q=309638

ALL fields of the organisation resource have been dumped twice: organisation + subdivision.organisation, most of this data is totally useless in this index.

The same problem happens with deposits.

Improvement suggestion

To be specified

Go through all resources elasticsearch dumpers and be mindful of which linked data really needs to be dumped (only what is useful for search)

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementEnhancement of an existing feature

    Type

    No type

    Projects

    Status

    Inbox

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions