Skip to content

[Late Q4 2025] Publication curation backlog + mop up #223

@aclayton555

Description

@aclayton555

This ticket emerges from discussions in #218 . Specifically, since we stopped running the manifest trimming script, there are many more publications emerging via the publication crawler that need to undergo some level of triage step to understand if a PMID is being picked up in the crawler due to one of a few scenarios:

  • Is being picked up for no known reason and the PMID is indeed already fully curated (maybe a bug) - no action needed other than ignore in the latest curation cycle
  • Is being picked up and the PMID has been partially curated due to closed access. Likely listed with "Pending Annotation" for most fields - action needed to update publication with annotations if they are now available
  • Is being picked up and the PMID has not been curated (i.e. hot off the press publication) - action needed to curate publications (this is the routine monthly curation)

Other scenarios are also possible. Regardless, we already know as of the 25-9 sprint that this has created a backlog of many more publications than initially expected that need to undergo some level of curation.

Suggest that we aim to tackle this in late Q4 2025 alongside a big curation tidy of datasets, as mentioned in mc2-center/mc2-center-dcc#114

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions