Skip to content

validate croissant.json warning #963

Description

@ihsaan-ullah

I have a very simple json file and I am validating it using the mlcroissant package

mlcroissant validate --jsonld croissant.json

I get this warning about context:

WARNING: The JSON-LD `@context` is not standard. Refer to the official @context (e.g., from the example datasets in https://github.com/mlcommons/croissant/tree/main/datasets/1.0). The different keys are: {'dct', 'isLiveDataset', 'sc', 'dataType', 'conformsTo', 'transform', 'format', 'samplingRate', 'subField', 'md5', 'recordSet', 'citeAs', 'examples', 'fileObject', 'fileProperty', 'key', 'jsonPath', 'repeated', 'source', 'parentField', 'field', 'references', 'replace', 'column', 'cr', 'includes', 'fileSet', 'extract', 'separator', 'rai', 'data', 'path', 'regex'}

I know that I can ignore this warning but is there any documentation/recommendation about these fields and why they should be added to th @context?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions