-
Notifications
You must be signed in to change notification settings - Fork 0
Add FAIR4AI evaluation agent and documentation #10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Initial commit of the FAIR4AI evaluation agent, including core scripts, GUI, example usage, requirements, form checklist, template outputs, and sample evaluation results. Provides full documentation in README.md and an environment variable template for API configuration.
|
Could you remove the |
|
ah, thanks, will do! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please see in-line suggestions and comments. I'll do the docs extraction we discussed.
Done! I think we should pull all the .env setup and API token instructions to a docs page too, then we can just link to it.
| @@ -0,0 +1,991 @@ | |||
| # FAIR4AI Automated Evaluation Agent | |||
|
|
|||
| An AI-powered system that automatically evaluates datasets against the FAIR4AI checklist using Large Language Models (LLMs). The agent analyzes metadata files or dataset landing pages to provide comprehensive FAIR (Findable, Accessible, Interoperable, Reusable) assessments. | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| An AI-powered system that automatically evaluates datasets against the FAIR4AI checklist using Large Language Models (LLMs). The agent analyzes metadata files or dataset landing pages to provide comprehensive FAIR (Findable, Accessible, Interoperable, Reusable) assessments. | |
| An AI-powered system that automatically evaluates datasets against the [FAIR4AI checklist form](https://forms.gle/P3MWmJJAi5vq248E8) using Large Language Models (LLMs). The agent checks metadata files or dataset landing pages for the information requested in the checklist to provide comprehensive FAIR (Findable, Accessible, Interoperable, Reusable) assessments. |
| - Generates structured outputs in JSON and CSV formats with FAIR score estimates | ||
|
|
||
| **Key Features:** | ||
| - ✅ Multiple LLM providers (OpenAI, Azure OpenAI, Anthropic Claude) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| - ✅ Multiple LLM providers (OpenAI, Azure OpenAI, Anthropic Claude) | |
| - ✅ Multiple LLM providers supported (OpenAI, Azure OpenAI, Anthropic Claude) |
| **System Requirements:** | ||
| - Python 3.8 or higher | ||
| - API key for OpenAI, Azure OpenAI, or Anthropic | ||
| - Dependencies: pandas, openai, anthropic, requests, beautifulsoup4 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| - Dependencies: pandas, openai, anthropic, requests, beautifulsoup4 |
This is covered by the install and doesn't need to be included in the README
|
|
||
| Or install individually: | ||
| ```bash | ||
| pip install openai pandas requests beautifulsoup4 # For OpenAI | ||
| # OR | ||
| pip install anthropic pandas requests beautifulsoup4 # For Anthropic Claude | ||
| ``` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| Or install individually: | |
| ```bash | |
| pip install openai pandas requests beautifulsoup4 # For OpenAI | |
| # OR | |
| pip install anthropic pandas requests beautifulsoup4 # For Anthropic Claude | |
| ``` |
This seems like an unnecessary complication -- I don't think there should be issues installing openai and anthropic with the requirements file.
| ## Quick Start | ||
|
|
||
| ### 1. Install Dependencies | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| We recommend creating a [virtual environment](https://imageomics.github.io/Collaborative-distributed-science-guide/wiki-guide/Virtual-Environments/) in which to install the requirements as described below. | |
| 5. Save all extracted metadata to output directory | ||
| 6. Run the evaluation using extracted metadata | ||
|
|
||
| Supported URLs include NEON, DataONE, Zenodo, Dryad, and other repositories with structured metadata. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| Supported URLs include NEON, DataONE, Zenodo, Dryad, and other repositories with structured metadata. | |
| Supported URLs include those for NEON, DataONE, Zenodo, Dryad, Hugging Face Datasets, and other repositories with structured metadata. |
| - **Progress Bar**: Shows completion percentage | ||
| - **Log Window**: Real-time messages and status updates | ||
|
|
||
| #### Example Workflow: URL-Based Evaluation |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could be cool to add a screenshot or two of the GUI, but that'd be for bigger docs. If this is more fleshed-out and public then we could use MkDocs to set something up (more user-friendly than a super long README).
| - `options`: Multiple choice options (pipe-separated) | ||
| - `required`: Whether required (TRUE/FALSE) | ||
|
|
||
| Default form: `form_ai_checklist_automated.csv` (135 questions across 9 sections) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| Default form: `form_ai_checklist_automated.csv` (135 questions across 9 sections) | |
| Default form: `form_ai_checklist_automated.csv` (135 questions across 9 sections) | |
| The code used to create this file from the [FAIR4AI Checklist form](), is in the [checklist workflow directory](../checklist-workflow/00_README.md). |
| ├── {prefix}.json # FAIR4AI evaluation results | ||
| └── {prefix}.csv # FAIR4AI evaluation results | ||
| ``` | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| `prefix` is the name passed to the `--output` parameter. | |
also ignore Mac system files
a6f39cb to
f787705
Compare
Initial commit of the FAIR4AI evaluation agent, including core scripts, GUI, example usage, requirements, form checklist, template outputs, and sample evaluation results. Provides full documentation in README.md and an environment variable template for API configuration.