Skip to content

FAIR4AI Working Group repo for organization, planning, and development.

Notifications You must be signed in to change notification settings

Imageomics/FAIR4AI-checklists

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

FAIR4AI-checklists

FAIR4AI Working Group repo for organization, planning, and development.

Current Resources/Drafts:

Guiding Questions and Considerations

  • What is the distinction between FAIR and FAIR4AI?
    • Is this different when thinking about biodiversity data?
    • How does it vary by data type/modality?
  • What is the distinction between FAIR4AI and AI-Ready?
  • What are the requirements and expectations for data providers as compared to users?

Potential Answers or Framing

FAIR4AI is to the point that it can be fed into a pipeline to have an output ready to put into a model. This can be the "AI-enabled" step.

We are working from the idea that AI-Ready is meant as this data can be fed directly into a model, so data need not be published in this format, just in a format that it is "reasonable" to get there. An example being the TreeOfLife-200M dataset is not AI-Ready, but using existing pipelines (once all data is downloaded), it can be transformed into the webdataset format used to train BioCLIP 2. Under this definition, the constituent parts of TreeOfLife-200M are not FAIR4AI.

Footnotes

  1. For context, the Imageomics metadata checklist is mostly a generalized version of the Data Card checklist. They fit in the Imageomics Project Lifecycle as part of the iterative process of filling out a dataset card and updating on GitHub with checklists in the GitHub project repo issue.

  2. There is also a newer version on GitHub. I believe we incorporated both.

About

FAIR4AI Working Group repo for organization, planning, and development.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published