Skip to content

Comments

New Blueprint Project - Agentic Audio RAG with LangGraph#263

Open
alexsifman wants to merge 124 commits intov2.0.0from
feat/agentic-audio-rag
Open

New Blueprint Project - Agentic Audio RAG with LangGraph#263
alexsifman wants to merge 124 commits intov2.0.0from
feat/agentic-audio-rag

Conversation

@alexsifman
Copy link
Collaborator

@alexsifman alexsifman commented Aug 22, 2025

This PR contains the Agentic Audio RAG blueprint, a RAG system that turns speech in audio/video files into searchable knowledge and lets you ask questions directly about the actual audio. A LangGraph-driven agent retrieves the most relevant timestamped audio segments, and an audio-native LLM listens to those clips to produce precise answers.

image

gabisponciano and others added 13 commits November 7, 2025 18:13
…local model isntead of hf as default

- Updated README.md to clarify model storage options and local setup instructions.
- Modified config.yaml to set local model paths for Qwen and CLAP models.
- Refactored run-workflow.ipynb to initialize models using a configuration-driven approach.
- Enhanced model_selection.py to check for models in local datafabric before downloading.
- Added utility function to initialize audio models, supporting both local and remote loading.
@NickyJhames
Copy link
Contributor

At the moment this bp takes a lot to deploy no only because of the pip install process on deployment script but also because this deployment process is transferring two big models into deployment environment.

This whole process takes more than what AIS gives as tolerance.

The way AIS deployment scripts works are not entirely correct, as they are checking container readiness instead of container "liveness". I'm working on a quick POC to fix this and only after that we will be able to deploy this bp.

@NickyJhames
Copy link
Contributor

We need to wait the code from this pr (https://github.azc.ext.hp.com/phoenix/phoenix-app-desktop/pull/2442) to be on released binary in order to test the deployment of this bp.

nickyjhames_hpi and others added 5 commits January 22, 2026 13:53
- Introduced QwenOmniAgent class in a new module (src/qwen_agent.py) to unify the adapter for the Qwen2.5-Omni-7B model.
- Updated run-workflow notebook to use the new QwenOmniAgent instance.
- Removed the old _QwenAdapter class from model.py to streamline the codebase.
- Ensured compatibility between notebook and MLflow deployment by using the shared QwenOmniAgent implementation.
@njhames njhames force-pushed the feat/agentic-audio-rag branch from 8b051c3 to ab73ae4 Compare January 25, 2026 16:03
@NickyJhames NickyJhames marked this pull request as ready for review January 25, 2026 16:21
@NickyJhames
Copy link
Contributor

Notebooks updated and streamlit UI and mlflow service working flawlessly ⚡

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file design UI/UX-related work documentation Improvements or additions to documentation enhancement Improvements to existing features new-blueprint python Pull requests that update python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants