Updated introduction of Responsible AI in the 1.1 spec. #984

JoanGi · 2025-12-04T14:46:48Z

Since the first release, the approach of the Responsible AI (RAI) extension has evolved substantially.

Croissant now incorporates two core mechanisms for the responsible use and sharing of data directly into the main specification.
Concurrently, the Croissant RAI extension continues to exist; it serves as a machine-readable format for data cards and is intended to be an incubator for new RAI trends emerging from the community.

This pull request updates the presentation of the RAI use case to maintain consistency with this revised approach.

github-actions · 2025-12-04T14:46:59Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

benjelloun · 2025-12-04T15:40:46Z

docs/croissant-spec-draft.md

+2. **Machine-readable RAI Data Documentation**: This specification proposes a machine-readable vocabulary for capturing and publishing existing Responsible AI (RAI) documentation solutions (such as [Data Cards](https://dl.acm.org/doi/pdf/10.1145/3531146.3533231)), thereby streamlining their publishing, sharing, discovery, and reuse. Further details are available in the [Croissant RAI specification](http://mlcommons.org/croissant/RAI/1.1).
+
+
+We welcome additional extensions from the community to meet the needs particular and resposible AI aspects of specific data modalities (e.g. audio or video) and domains (e.g. geospatial, life sciences, cultural heritage).


Typo: resposible -> responsible.

benjelloun · 2025-12-04T15:42:37Z

docs/croissant-spec-draft.md

+1. **Data use and dissemination**: It provides a [set of mechanisms](#responsible-ai-and-governance) to enable the responsible use and dissemination of data. This is achieved by offering a machine-actionable representation of the data's provenance, lineage, and usage conditions at various levels of granularity. These mechanisms are built upon the integration of W3C standards (such as PROV-O and ODRL), ensuring compatibility with existing solutions.

-2. It records at a granular level how a dataset was created, processed and enriched throughout its lifecycle – this process is meant to be automated as much as possible by integrating Croissant with popular ML frameworks. By allowing the metadata to be loaded automatically, Croissant also enables developers to compute RAI metrics automatically and systematically, identifying potential data quality issues to be fixed.
+2. **Machine-readable RAI Data Documentation**: This specification proposes a machine-readable vocabulary for capturing and publishing existing Responsible AI (RAI) documentation solutions (such as [Data Cards](https://dl.acm.org/doi/pdf/10.1145/3531146.3533231)), thereby streamlining their publishing, sharing, discovery, and reuse. Further details are available in the [Croissant RAI specification](http://mlcommons.org/croissant/RAI/1.1).


The RAI 1.1 spec doesn't exist yet... Do you want to link to RAI 1.0 for now?

benjelloun · 2025-12-04T15:45:26Z

docs/croissant-spec-draft.md

 This is how Croissant helps address RAI:

-1. It proposes a machine-readable way to capture and publish metadata about ML datasets – this makes existing documentation solutions like [Data Cards](https://sites.research.google/datacardsplaybook/) easier to publish, share, discover, and reuse;
+1. **Data use and dissemination**: It provides a [set of mechanisms](#responsible-ai-and-governance) to enable the responsible use and dissemination of data. This is achieved by offering a machine-actionable representation of the data's provenance, lineage, and usage conditions at various levels of granularity. These mechanisms are built upon the integration of W3C standards (such as PROV-O and ODRL), ensuring compatibility with existing solutions.


Let's just say provenance instead of provenance, lineage.

Also, can you link to the relevant sections of the spec?

benjelloun · 2025-12-04T15:53:32Z

docs/croissant-spec-draft.md

+1. **Data use and dissemination**: It provides a [set of mechanisms](#responsible-ai-and-governance) to enable the responsible use and dissemination of data. This is achieved by offering a machine-actionable representation of the data's provenance, lineage, and usage conditions at various levels of granularity. These mechanisms are built upon the integration of W3C standards (such as PROV-O and ODRL), ensuring compatibility with existing solutions.

-2. It records at a granular level how a dataset was created, processed and enriched throughout its lifecycle – this process is meant to be automated as much as possible by integrating Croissant with popular ML frameworks. By allowing the metadata to be loaded automatically, Croissant also enables developers to compute RAI metrics automatically and systematically, identifying potential data quality issues to be fixed.
+2. **Machine-readable RAI Data Documentation**: This specification proposes a machine-readable vocabulary for capturing and publishing existing Responsible AI (RAI) documentation solutions (such as [Data Cards](https://dl.acm.org/doi/pdf/10.1145/3531146.3533231)), thereby streamlining their publishing, sharing, discovery, and reuse. Further details are available in the [Croissant RAI specification](http://mlcommons.org/croissant/RAI/1.1).


Hmmm.... How is this bullet point different from the one above? Aren't they both about creating machine readable RAI information?

JoanGi · 2025-12-05T10:41:45Z

I've uploaded a shortened version with only one bullet point and added graphical support from the PROV example to be consistent with other use cases. Please check it @benjelloun.

Updated introduction of Responsible AI in the 1.1 spec.

4029099

JoanGi requested a review from benjelloun December 4, 2025 14:46

JoanGi requested a review from a team as a code owner December 4, 2025 14:46

Corrected type in subtitle 1 of Responsible AI intro.

27a2fc4

benjelloun reviewed Dec 4, 2025

View reviewed changes

Shortened RAI intro, added provenance image.

851f273

benjelloun approved these changes Dec 5, 2025

View reviewed changes

Merge branch 'main' into main

3353e9b

JoanGi merged commit df30323 into mlcommons:main Dec 9, 2025
12 checks passed

github-actions bot locked and limited conversation to collaborators Dec 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Updated introduction of Responsible AI in the 1.1 spec. #984

Updated introduction of Responsible AI in the 1.1 spec. #984

Uh oh!

JoanGi commented Dec 4, 2025

Uh oh!

github-actions bot commented Dec 4, 2025 •

edited

Loading

Uh oh!

benjelloun Dec 4, 2025

Uh oh!

benjelloun Dec 4, 2025

Uh oh!

benjelloun Dec 4, 2025

Uh oh!

benjelloun Dec 4, 2025

Uh oh!

benjelloun Dec 4, 2025

Uh oh!

JoanGi commented Dec 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		2. Machine-readable RAI Data Documentation: This specification proposes a machine-readable vocabulary for capturing and publishing existing Responsible AI (RAI) documentation solutions (such as [Data Cards](https://dl.acm.org/doi/pdf/10.1145/3531146.3533231)), thereby streamlining their publishing, sharing, discovery, and reuse. Further details are available in the [Croissant RAI specification](http://mlcommons.org/croissant/RAI/1.1).


		We welcome additional extensions from the community to meet the needs particular and resposible AI aspects of specific data modalities (e.g. audio or video) and domains (e.g. geospatial, life sciences, cultural heritage).

Updated introduction of Responsible AI in the 1.1 spec. #984

Updated introduction of Responsible AI in the 1.1 spec. #984

Uh oh!

Conversation

JoanGi commented Dec 4, 2025

Uh oh!

github-actions bot commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benjelloun Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

benjelloun Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

benjelloun Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

benjelloun Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

benjelloun Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

JoanGi commented Dec 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Dec 4, 2025 •

edited

Loading