Feature/tool retrieval by qchapp · Pull Request #16 · Imaging-Plaza/ai-agent

qchapp · 2026-01-28T12:24:20Z

This pull request introduces major improvements to the agent's retrieval capabilities, restructures the agent's tool interface, and standardizes import paths. The most significant changes include switching to embedding-based similarity for query expansion, adding iterative retrieval with retries and a new agent tool for alternative searches, and refactoring the agent's tool adapters to better handle context and metadata. Several bug fixes and configuration enhancements are also included.

Retrieval and Query Expansion Enhancements:

Replaced hard-coded synonym dictionaries with embedding-based similarity matching (BGE-M3) for dynamic query expansion. The system now automatically extracts and updates vocabulary from the catalog.
Added automatic retry logic in the retrieval pipeline: if the initial search yields fewer than 5 results, the system generates alternative queries using semantic neighbors and retries (up to 2 attempts).
Introduced a new search_alternative agent tool, allowing the agent to explicitly request up to 3 alternative searches per conversation, enabling agent-driven iterative refinement. [1] [2]

Agent and Tool Interface Refactoring:

Refactored agent tool adapters to inject image paths and format hints from AgentState, ensuring tools have necessary context without requiring the LLM to reason about file paths. The run_agent function now requires explicit image paths and automatically generates metadata/context for the LLM and tools. [1] [2]
Updated the agent's prompt and state management to support the new retrieval features, including similarity expansion, retries, and alternative search tool usage. [1] [2]
Increased tool call limits for certain tools (e.g., repo_info) and improved tool call logging structure. [1] [2]

Configuration and Import Path Improvements:

Standardized all import paths to use the ai_agent. prefix for consistency. [1] [2] [3] [4]
Updated config.yaml to allow easy switching between model versions (e.g., "gpt-5.1" and "gpt-4o").
Added a new configuration module (utils/config.py) using Pydantic for type-safe config loading and validation.

Bug Fixes:

Fixed a Pydantic forward reference error by reordering class definitions in schema.py.
Improved conversation context handling, ensuring the agent maintains history for natural follow-up requests.

Other Notable Changes:

Enhanced the rerank tool to use the original query (not expanded) and clarified reranking logic. [1] [2] [3]
Removed outdated test files and made minor dependency updates.

These changes collectively make the agent more robust, context-aware, and capable of sophisticated, iterative tool retrieval and selection.

Copilot

Pull request overview

This PR modernizes the retrieval stack and agent interface: retrieval is now embedding-based with retries and image-aware hints, the agent tools have been refactored to inject file context from state, and imports/configuration are standardized.

Changes:

Introduces a new retrieval pipeline API (retrieve, retrieve_no_rerank, retry logic, image metadata hints) plus a comprehensive test suite and README documenting retrieval behavior.
Refactors the agent’s tools and run_agent entrypoint to route image_paths and format hints through AgentState, adds a search_alternative tool, and updates prompts and schemas for the new conversation and retrieval semantics.
Standardizes ai_agent. import paths, updates configuration (including model selection via config.yaml), and tweaks catalog representation (SoftwareDoc) and documentation (CHANGELOG.md).

Reviewed changes

Copilot reviewed 21 out of 21 changed files in this pull request and generated 10 comments.

Show a summary per file

File	Description
tests/test_retrieval_pipeline.py	Adds an extensive pytest suite for `RAGImagingPipeline` covering medical/non-medical queries, edge cases, metadata hints, and retry behavior (note: current `sys.path` setup doesn’t actually add `src` to the import path, so `import ai_agent...` can still fail).
tests/README_RETRIEVAL_TESTS.md	Documents the new retrieval test suite, its organization, runtime, and how it validates the simplified embedding-based pipeline.
src/ai_agent/utils/utils.py	Fixes `SoftwareDoc` import to use the standardized `ai_agent.retriever.software_doc` path, keeping `_best_runnable_link` intact.
src/ai_agent/utils/tags.py	Simplifies supported control tags by dropping `[NO_RERANK]` and its helper, leaving `[REFINE]` and exclusion tags consistent with the new behavior.
src/ai_agent/utils/previews.py	Points image metadata and IO imports to `ai_agent.utils.*`, keeping preview-building behavior unchanged.
src/ai_agent/ui/app.py	Updates `handle_message` to call the new `run_agent(task, image_paths=..., ...)` signature so the agent derives metadata and formats internally.
src/ai_agent/retriever/vector_index.py	Adds minor whitespace for readability; load/save and indexing semantics are unchanged.
src/ai_agent/retriever/software_doc.py	Adds a validator to coerce list-valued `name` fields and rewrites `to_retrieval_text()` to emphasize tasks/anatomy/modality and embed format/task metadata more retrieval-optimally.
src/ai_agent/generator/schema.py	Reorders `ConversationStatus`/`Conversation` above `ToolSelection` to fix Pydantic forward-ref issues and keeps the normalization logic for conversation status and reasons.
src/ai_agent/generator/prompts.py	Rewrites the selector and agent system prompts for the new semantic retrieval + alternative search flow and parameterizes the maximum number of tool choices via `NUM_CHOICES` (default 5), though this default now differs from the UI’s fallback of 3.
src/ai_agent/api/pipeline.py	Refactors `RAGImagingPipeline` to accept `min_results`/`max_retries`, adds image-hint construction, implements retry/broadening logic in `retrieve_no_rerank`, and adds a high-level `retrieve` that applies CrossEncoder reranking.
src/ai_agent/agent/utils.py	Extends `AgentState` to track `image_paths` and `original_formats`, and provides quota/prepare helpers to limit tool usage and hide disabled tools.
src/ai_agent/agent/tools/utils.py	Updates imports to `ai_agent.*` and keeps a lazily-initialized singleton `RAGImagingPipeline` for use by agent tools.
src/ai_agent/agent/tools/search_tool.py	Adapts the search tool to pass `image_paths` and normalized `original_formats` into the pipeline and switch from manual rerank handling to the new `pipeline.retrieve()` (docstring still references the removed dictionary-based query expansion).
src/ai_agent/agent/tools/search_alternative_tool.py	Adds a new `search_alternative` tool that constructs an alternative query with soft format tokens and calls `pipeline.retrieve()` to support explicit agent-driven alternative searches.
src/ai_agent/agent/tools/rerank_tool.py	Aligns reranking with the new pipeline by ensuring the original query (without similarity expansion) is used as the reranker input.
src/ai_agent/agent/tools/gradio_space_tool.py	Fixes imports to `ai_agent.utils.*` and preserves the Gradio-based `run_example` functionality for running remote demos on user images.
src/ai_agent/agent/models.py	Simplifies `ToolRunLog` (removing the unused `summary` field) and exposes `AgentToolSelection` as a wrapper around `ToolSelection` with attached tool call logs.
src/ai_agent/agent/agent.py	Overhauls the agent: wires in the new prompts, defines adapters for `search_tools`, `search_alternative`, `repo_info`, and `run_example`, and rewrites `run_agent` to require `image_paths`, derive metadata/format hints, pass them via `AgentState`, and log tool usage (note: `_demo_pipeline` is now an unused but expensive global, and `run_agent` recomputes metadata already produced by `_build_preview_for_vlm`).
config.yaml	Updates the (commented) default agent model name to “gpt-5.1” and keeps the configurable base URL and API key environment variable wiring.
CHANGELOG.md	Records the new similarity-based query expansion, retry logic, alternative search tool, configuration module, and the schema forward-ref fix, matching the behavioral changes in this PR.

Copilot · 2026-01-29T09:43:10Z

@qchapp I've opened a new pull request, #17, to work on those changes. Once the pull request is ready, I'll request review from you.

Co-authored-by: qchapp <74377782+qchapp@users.noreply.github.com>

Eliminate redundant metadata extraction in agent pipeline

…aging-plaza-ai-agent into feature/tool-retrieval

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

…added some tests for multimodalities

Copilot

Pull request overview

Copilot reviewed 20 out of 20 changed files in this pull request and generated 8 comments.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

qchapp · 2026-01-29T19:39:43Z

Hi @caviri, could you review this PR for me? I already reviewed and I'm waiting for your confirmation before the merge! Thanks

caviri

Hi @qchapp, looks good to me in principle. I just quickly checked the code/structure; I didn't run it. Once you deploy to HF, you can add GitHub actions to get automatic builds and facilitate the inspection of PRs.

Copilot AI review requested due to automatic review settings January 28, 2026 12:24

Copilot started reviewing on behalf of qchapp January 28, 2026 12:24 View session

query expanded to improve retrieval

0d4a3cd

Copilot AI reviewed Jan 28, 2026

View reviewed changes

qchapp added 2 commits January 28, 2026 14:09

big update

18f51d5

continued rebase

65d1a76

qchapp force-pushed the feature/tool-retrieval branch from 8363b48 to 65d1a76 Compare January 28, 2026 13:23

qchapp and others added 2 commits January 28, 2026 14:27

removed useless imports

74b4e4d

Initial plan

9f66bd1

Copilot AI mentioned this pull request Jan 29, 2026

Eliminate redundant metadata extraction in agent pipeline #17

Merged

Copilot AI and others added 7 commits January 29, 2026 09:45

Eliminate redundant metadata computation in run_agent

33fc064

Co-authored-by: qchapp <74377782+qchapp@users.noreply.github.com>

Fix: only pass metadata when it matches current files

2f273d1

Co-authored-by: qchapp <74377782+qchapp@users.noreply.github.com>

small comments update

9db4d1e

Merge pull request #17 from Imaging-Plaza/copilot/sub-pr-16

e98c671

Eliminate redundant metadata extraction in agent pipeline

deleted useless line

a5c8266

Merge branch 'feature/tool-retrieval' of https://github.com/qchapp/im…

ce07acf

…aging-plaza-ai-agent into feature/tool-retrieval

fixed issues found during test of the interface - now working properly

5b508fb

qchapp requested a review from Copilot January 29, 2026 10:41

Copilot started reviewing on behalf of qchapp January 29, 2026 10:41 View session

Copilot AI reviewed Jan 29, 2026

fixed a big issue with the way the image was passed to the model and …

3908ccd

…added some tests for multimodalities

qchapp requested a review from Copilot January 29, 2026 18:47

Copilot started reviewing on behalf of qchapp January 29, 2026 18:47 View session

Copilot AI reviewed Jan 29, 2026

View reviewed changes

qchapp and others added 5 commits January 29, 2026 20:04

Update tests/test_epfl_vision.py

fadc984

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update src/ai_agent/utils/previews.py

c953675

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update src/ai_agent/utils/previews.py

73dc3f3

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update src/ai_agent/utils/previews.py

11a7b44

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

fixed empty image paths error handling

bad59d8

qchapp requested a review from caviri January 29, 2026 19:39

caviri approved these changes Feb 2, 2026

View reviewed changes

qchapp merged commit 1d8adb2 into develop Feb 2, 2026
2 checks passed

qchapp deleted the feature/tool-retrieval branch February 2, 2026 09:34

Conversation

qchapp commented Jan 28, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Jan 29, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qchapp commented Jan 29, 2026

Uh oh!

caviri left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

caviri left a comment •

edited

Loading