Skip to content

Importer creates duplicate citizenship properties for same country with different time periods #113

@monneyboi

Description

@monneyboi

Problem

The importer creates multiple citizenship properties for the same politician-country combination when Wikidata has multiple statements with different start/end time qualifiers. This results in 30 duplicate entries in our database.

Examples

San Marino (Q238) is particularly affected with 7 politicians having duplicate citizenships.

Expected Behavior

We should only store one citizenship property per politician-country combination. The time period qualifiers (start/end dates) are not relevant for our use case.

Suggested Fix

During import, deduplicate citizenship statements by (politician_id, entity_id) - only import the first one encountered or the one without time qualifiers.

Metadata

Metadata

Assignees

No one assigned

    Labels

    loomPoliloom core project issues

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions