AI Analysis integration & JOSS Updates #47

aditigopalan · 2025-05-22T17:04:46Z

AI Analysis (Fixes #40, #41)

This is done by using AWS Bedrock and Synapse integration. The implementation includes a custom AWS Bedrock agent, hosted as a Synapse agent, which interacts with the Synapse Python client using the Agent class. This approach skips the need for direct API tokens, however the user would still need a Synapse account + PAT to use this feature, instructions in README.

Key Features

Custom AWS Bedrock Agent: The agent, named cc-toolkit-agent, provides a summary report in HTML format, offering detailed insights into repository analysis.
Synapse Integration: Utilizes the Synapse Python client for seamless interaction with the hosted agent.
The summary report outputted is in HTML format and includes comprehensive analysis results.

README Enhancements (Fixes #26)

Key Changes

Input CSV Example: Added an example of input.csv in a table/markdown format for better understanding.
Syntax Alerts: Highlighted areas still in development to set clear expectations for users.
Synapse Upload Requirements: Detailed requirements for uploading to Synapse, including necessary credentials and configuration files.
Summaries/Dependencies: Provided summaries and dependencies to ensure users have a clear understanding of the toolkit's requirements and capabilities.

JOSS Analysis & Testing (Fixes #14)

Key Features

Captured in JOSSAnalysis.nf:

README Content Testing: Analyzes README content for key sections, including statement of need, installation instructions, and example usage. Each section is scored based on completeness and clarity.
Requirements.txt Content Testing: Validates dependency file contents, ensuring proper specification of dependencies with versions. Similar checks are implemented for other languages.
Unit Tests Execution: The new TestExecutor module runs tests in appropriate containers, providing detailed results, including pass/fail counts and test framework used.
JOSS Review Criteria Integration: All tests are designed based on JOSS paper review criteria, with a scoring system that aligns with JOSS expectations.

Configuring CODEOWNERS (Fixes #43)

Merge from main

…odule - Add proper error handling for missing status files - Fix Python f-string formatting - Improve file reading logic for better reliability

… between processes

docs: update README with AI analysis details and contribution guidelines - Clarify that AI analysis is optional and requires Synapse agent ID - Add specific Synapse agent ID (LOWYSX3QSQ) in examples - Reorder output files to highlight AI analysis as final report - Add note about AI analysis providing qualitative summary - Update output file descriptions to clarify metrics vs summary - Add reference to new CONTRIBUTING.md file

- Rename SynapseAnalysis process to AIAnalysis for clarity - Update module include statement to use new AIAnalysis module - Remove Generate Report - Update workflow documentation to reflect AI analysis step - Add debug print statement for AI input tuple - Keep Synapse agent ID requirement for AI analysis - Maintain existing workflow structure and data flow

Create new AIAnalysis.nf module that: - Uses Synapse agent to analyze repository quality - Takes Almanack and JOSS results as input - Generates qualitative analysis and recommendations - Includes timeout handling (600s) for long-running analyses - Provides detailed error handling and logging - Outputs results in JSON format with repository-specific naming The module integrates with Synapse's AI capabilities to provide: - High-level summary of repository strengths and weaknesses - Prioritized recommendations for improvement - JOSS readiness assessment - Specific action items for repository enhancement

- Add support for both JSON and CSV input formats - Implement comprehensive README content analysis - Add detailed dependency management assessment - Improve test coverage evaluation - Add scoring system for JOSS criteria - Enhance error handling and logging - Add support for multiple programming languages - Implement detailed status reporting with improvement suggestions Key improvements: - Better handling of different input formats - More thorough analysis of repository documentation - Enhanced dependency checking across multiple languages - Improved test result parsing and scoring - Better error handling and reporting

aditigopalan · 2025-05-27T13:16:05Z

@BWMac thank you for the suggestions! Ready for review again 🫡

bin/analyze.py

bin/analyze_joss.py

modules/AIAnalysis.nf

modules/AnalyzeJOSSCriteria.nf

modules/TestExecutor.nf

Co-authored-by: Brad Macdonald <[email protected]>

If you script is executable and in the bin directory it is on the PATH for Nextflow Co-authored-by: Brad Macdonald <[email protected]>

Co-authored-by: Brad Macdonald <[email protected]>

- files are directly on the path for Nextflow if in bin

BWMac

Great work! Just one more comment but LGTM! 🔥 🔥 🔥

bin/analyze_joss.py

aditigopalan added 30 commits March 14, 2025 13:27

Adding Joss modules to main.nf

9e8d94f

Create AnalyzeJOSSCriteria.nf

f3733b0

Create InterepretWithGPT.nf

d20f2aa

Update GenerateReport.nf

c586586

Update RunAlmanack.nf

bb908b4

Merge pull request #38 from mc2-center/main

cd1e55f

Merge from main

Update main.nf with new modules

1fe2824

Update AnalyzeJOSSCriteria.nf

d5d01b9

Adding final report and csv functionallity

a379209

Updating GPT response

ac1d8ae

Updating tuple

67f6884

Setting GPT analysis as optional

dac393c

Update consolidated_report.csv

cb695a7

Setting better criteria for JOSS Review

2e96950

Adding tests as a check

a2bcf68

Update JOSSCriteria to take output from toolkit as well

cd0d970

Adding example output for JOSS

30398c3

fix: Improve error handling and file reading in AnalyzeJOSSCriteria m…

02122c9

…odule - Add proper error handling for missing status files - Fix Python f-string formatting - Improve file reading logic for better reliability

Update main workflow to include test execution and improved data flow…

55cfcea

… between processes

Update JOSS criteria analysis to handle test results and improve scoring

beded55

Add GPT interpretation module for detailed analysis of JOSS results

702ac18

Update repository processing to include test detection

98c0f89

Update Almanack analysis to improve status reporting

5e3c7bf

Update Nextflow config with new process containers and parameters

539f4ef

Add new TestExecutor module for running and analyzing repository tests

19f096f

Add CONTRIBUTING.md file

5f34012

BWMac reviewed May 27, 2025

View reviewed changes

aditigopalan and others added 23 commits May 27, 2025 11:28

Update bin/analyze_joss.py

a220593

Co-authored-by: Brad Macdonald <[email protected]>

Minimize nesting bin/analyze_joss.py

8a2246e

Co-authored-by: Brad Macdonald <[email protected]>

Update bin/run_tests.py

d189c25

Co-authored-by: Brad Macdonald <[email protected]>

Update modules/AIAnalysis.nf

95077e6

If you script is executable and in the bin directory it is on the PATH for Nextflow Co-authored-by: Brad Macdonald <[email protected]>

Update modules/AnalyzeJOSSCriteria.nf

a9bfc92

Co-authored-by: Brad Macdonald <[email protected]>

Update modules/TestExecutor.nf

263b8ee

Co-authored-by: Brad Macdonald <[email protected]>

Update modules/TestExecutor.nf

d665c0c

Co-authored-by: Brad Macdonald <[email protected]>

Fixing indentation error

5423752

Updating main.nf to remove bin paths

dc70d6f

- files are directly on the path for Nextflow if in bin

Removing bin path from file

bca71a6

Removing unused AgentSession

653b899

Adding type hints

fd44c40

Update return type hints

f972dd1

Adding docstring

c00c9da

Adding docstring

2b3e25f

Removing unused variables

3babf55

Breaking up analyze_joss_criteria into helper functions

9089d6e

Removing bin path

af15f0f

Defining strings as enums

4354547

Updating string > enum

70e42db

Adding type hints/ docstrings

4ff3b66

Removing unused results

c3277f1

refactor(run_tests): improve test result pattern matching

2c786b0

aditigopalan requested a review from BWMac May 27, 2025 16:26

BWMac approved these changes May 27, 2025

View reviewed changes

bin/analyze_joss.py Outdated Show resolved Hide resolved

String > enum for needs improvement

3688fa1

aditigopalan merged commit e7d06f5 into main May 27, 2025
1 check passed

aditigopalan deleted the feature/gpt-result-interpretation branch May 27, 2025 17:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

AI Analysis integration & JOSS Updates #47

AI Analysis integration & JOSS Updates #47

Uh oh!

aditigopalan commented May 22, 2025 •

edited

Loading

Uh oh!

aditigopalan commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BWMac left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

AI Analysis integration & JOSS Updates #47

AI Analysis integration & JOSS Updates #47

Uh oh!

Conversation

aditigopalan commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AI Analysis (Fixes #40, #41)

Key Features

README Enhancements (Fixes #26)

Key Changes

JOSS Analysis & Testing (Fixes #14)

Key Features

Configuring CODEOWNERS (Fixes #43)

Uh oh!

aditigopalan commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BWMac left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aditigopalan commented May 22, 2025 •

edited

Loading