Skip to content

Fix timeout issue during the sim1_postprocess_s1_e1_filter_input phase #434

Description

@marekhorst

Originally reported in: openaire/iis#1326

Documents similarity algorithm fails after running it on a non-deduplicated OpenAIRE Graph counting 300M of publications (deduped graph included 200M).

After in depth inspection covered by the openaire/iis#1326 (comment) it turned out we need to modify documents similarity sources by increasing allowed timeout value which should be defined in sim1-postprocess-s1-e1-filter-sims.pig PIG script.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions