Skip to content

Optimize data export to wikidata and wikipedia #942

@sofialeksell2406-collab

Description

DOD:

  • No duplicate entities: A company that is already on wikidata and is updated in our script should not be uploaded again.
  • No incorrect removals: The script should not remove entities that were actually meant to be published. If data from the pipeline is validated and different from what is on wikidata update wikidata.
  • No missing data - Look into small companies and make sure they are present in wikidata.

Maybe add a validation script going through these steps.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

Status

In review

Relationships

None yet

Development

No branches or pull requests

Issue actions