Skip to content

[FEATURE]: Add quality checks workflow #173

Open
@mwojtyczka

Description

@mwojtyczka

Is there an existing issue for this?

  • I have searched the existing issues

Problem statement

When DQX is installed in the workspace, we provide profiling workflow/job to generate quality rule candidates. After that user can use the generated checks to do quality checks. However, performing the actual quality checking requires code level integration. In some cases, users may want to just do quality checking on the existing input data without performing any transformations.

Proposed Solution

Add quality checks workflow/job, that can be run to assess data quality of a dataset and write out quarantine and output to locations as specified in the config. This won't give the possibility to perform any data transformations as part of the quality checking but it would be useful in a lot of scenarios.

User journey:

  1. (option) Profiler job to profile the data and generate quality rule candidates
  2. Run Quality Check job to assess data quality of the input data, and persist the results as per config
  3. Refresh quality dashboard

Additional Context

No response

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions