Skip to content

Conversation

@chenliang15405
Copy link
Collaborator

Description

Support create agent benchmark task for execute falcon text2sql evaluation dataset by remote invoking the agent through HTTP API

How Has This Been Tested?

  • Step1: Create an evaluation task and select the evaluation Agent
    20251208203842

  • Step2: Waiting for the execution to be completed
    20251208203951

Snapshots:

Include snapshots for easier review.

Checklist:

  • My code follows the style guidelines of this project
  • I have already rebased the commits and make the commit message conform to the project standard.
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • Any dependent changes have been merged and published in downstream modules

alan.cl added 4 commits December 5, 2025 19:53
@github-actions github-actions bot added the enhancement New feature or request label Dec 8, 2025
Copy link
Collaborator

@Aries-ckt Aries-ckt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Collaborator

@fangyinc fangyinc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@fangyinc fangyinc merged commit 19c2cee into eosphoros-ai:main Dec 10, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants