Skip to content

Lack of understandable documentation for Custom Data Validation #246

Closed
@sachn1

Description

@sachn1

URL(s) with the issue:

  1. https://www.tensorflow.org/tfx/data_validation/custom_data_validation
  2. https://www.tensorflow.org/tfx/data_validation/api_docs/python/tfdv/validate_statistics

Description of issue (what needs changing):

Clear description:

  1. No end-to-end workflow in using custom_validation_config.proto. Need more understanding about:
    1. How is the protobuf file written?
    2. Where should the sql statements be written?
    3. How is the custom protobuf file used as value to custom_validation_config argument in tfdv.validate_statistics()? I saw in some places that the protobuf file has to be converted to a python file and then import ValidationConfig from it. But saw no documentation related to it in tensorflow/tfdv/tfx.
    4. https://www.tensorflow.org/tfx/data_validation/api_docs/python/tfdv/validate_statistics has an arg custom_validation_config but no documentation how a custom config can be loaded and passed as an arg.
  2. What is the difference between using tensorflow-data-validation and tfx.ExampleValidator?

Correct links

The Link to custom_validate_statistics in https://www.tensorflow.org/tfx/data_validation/custom_data_validation is incorrect - Gets a 404 - page not found error.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions