Skip to content

[FEATURE]: Add new check to identify PII #309

Open
@mwojtyczka

Description

@mwojtyczka

Is there an existing issue for this?

  • I have searched the existing issues

Problem statement

Discovering PII data is not a quality check per se but it is useful for data classification tasks. Sensitive data could be identified and users could make different decisions based on that: quarantine or flag.

Proposed Solution

Add is_pii as a new check or provide example in the documentation. If 3rd party libraries are required (e.g. presidio analyzer) then it's probably better to add this as an example of custom check to the documentation to avoid extra dependencies in the project.

Additional Context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions