-
Couldn't load subscription status.
- Fork 1
Description
This ticket emerges from discussions in #218 . Specifically, since we stopped running the manifest trimming script, there are many more publications emerging via the publication crawler that need to undergo some level of triage step to understand if a PMID is being picked up in the crawler due to one of a few scenarios:
- Is being picked up for no known reason and the PMID is indeed already fully curated (maybe a bug) - no action needed other than ignore in the latest curation cycle
- Is being picked up and the PMID has been partially curated due to closed access. Likely listed with "Pending Annotation" for most fields - action needed to update publication with annotations if they are now available
- Is being picked up and the PMID has not been curated (i.e. hot off the press publication) - action needed to curate publications (this is the routine monthly curation)
Other scenarios are also possible. Regardless, we already know as of the 25-9 sprint that this has created a backlog of many more publications than initially expected that need to undergo some level of curation.
Suggest that we aim to tackle this in late Q4 2025 alongside a big curation tidy of datasets, as mentioned in mc2-center/mc2-center-dcc#114