Skip to content

Custom predicates for blocking #1223

@lcpprop

Description

@lcpprop

I have a really large dataset > 750 000 rows. I want to create a custom predicate where columns1 and columns3 should be the same across comparison pairs.

I could define them as such using dedupe predicate classes:

predicates.CompoundPredicate([
predicates.wholeFieldPredicate('pltr_gross_amt'),
predicates.wholeFieldPredicate('pltr_tran_date')
])

Now, how do I implement my custom predicate, such that all uncertain rows that are shown would adhere to this predicate? Could I overwrite default predicates?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions