Add multi-architecture support to AQUA Shape Recommender by Aryanag2 · Pull Request #1336 · oracle/accelerated-data-science

Aryanag2 · 2026-02-11T18:29:16Z

Summary

Upgrade shape recommender to support 4 architecture types using strategy pattern: text-generation, multimodal VLMs, embedding models, and audio/ASR.

Changes

New architectures supported:
- ✅ Multimodal VLMs (LLaVA, Nemotron-VL, Qwen2-VL, InternVL, Phi3-V)
- ✅ Embedding models (BERT, RoBERTa, E5-Mistral, GTE, ModernBERT)
- ✅ Audio/ASR (Whisper all sizes)
- ✅ Text-generation (Llama, Mistral, Qwen, Falcon) - no changes to existing behavior
Key implementation:
- Add ParsedModelConfig with detect_architecture() as single routing point
- Add EmbeddingConfig, WhisperConfig, VisionConfig for new architectures
- Add memory estimators: VisionMemoryEstimator, EmbeddingMemoryEstimator, WhisperMemoryEstimator
- Implement strategy pattern with 4 concrete strategies in new strategies/ package
- Add StrategyFactory for architecture-to-strategy routing
- Remove text-generation-only gate from _get_model_config()
- Add architecture-specific vLLM flags (--limit-mm-per-prompt, --task embedding, etc.)
- Graceful error handling for multimodal models with incomplete sub-configs

Impact

Models previously rejected with "not supported" errors now get proper recommendations
Zero breaking changes - all existing text-generation models work identically
Enables AQUA service to support new model types

Testing

All 39 tests passing in test_recommend.py ✅
- 27 existing tests (zero regressions)
- 12 new architecture tests added in TestNewArchitectures
  - Audio/Whisper: 4 tests ✅
  - Embedding: 5 tests ✅
  - Multimodal VLM: 3 tests ✅
Test data included: 17 HuggingFace config.json files for new architectures
Command used: pytest tests/unitary/with_extras/aqua/test_recommend.py -v
Backward compatibility verified: All text-generation tests continue to pass with identical behavior

Files Changed

Modified: constants.py, estimator.py, llm_config.py, recommend.py, test_recommend.py
New: strategies/__init__.py, strategies/base.py, strategies/text.py, strategies/multimodal.py, strategies/embedding.py, strategies/audio.py
Test data: 17 config files in test_data/recommend/config-json-files/

Upgrade shape recommender to support 4 architecture types using strategy pattern: - Text-generation (Llama, Mistral, Qwen, Falcon) - no changes to existing behavior - Multimodal VLMs (LLaVA, Nemotron-VL, Qwen2-VL, InternVL, Phi3-V) - Embedding models (BERT, RoBERTa, E5-Mistral, GTE, ModernBERT) - Audio/ASR (Whisper all sizes) Key changes: - Add ParsedModelConfig with detect_architecture() as single routing point - Add EmbeddingConfig, WhisperConfig, VisionConfig for new architectures - Add memory estimators: VisionMemoryEstimator, EmbeddingMemoryEstimator, WhisperMemoryEstimator - Implement strategy pattern with 4 concrete strategies in new strategies/ package - Add StrategyFactory for architecture-to-strategy routing - Remove text-generation-only gate from _get_model_config() - Add architecture-specific vLLM flags (--limit-mm-per-prompt, --task embedding, etc.) Models previously rejected with "not supported" errors now get proper recommendations. Zero breaking changes - all existing text-generation models work identically. Tested: 9/10 existing tests pass, 1 test has expected behavior change (Whisper now succeeds via new entry point instead of being rejected) Signed-off-by: Aryan Gosaliya <aryan.gosaliya@oracle.com>

…ure support - Add graceful error handling for multimodal models with incomplete sub-configs - Wrap text_config and vision_config parsing in try-except blocks - Allow either text or vision config to succeed for VLMs - Update test_llm_config_unsupported_models to reflect new error messages - Remove obsolete test_which_shapes_valid (replaced by TestNewArchitectures) - All 39 tests in test_recommend.py now pass - New architecture tests (audio, embedding, multimodal): 12/12 passing Signed-off-by: Aryan Gosaliya <aryan.gosaliya@oracle.com>

github-actions · 2026-02-11T18:59:21Z

📌 Cov diff with main:

📌 Overall coverage:

- 7 Whisper/audio model configs (openai/whisper-*) - 6 embedding model configs (BAAI/bge-*, sentence-transformers/*) - 4 multimodal VLM configs (llava-hf/*, lmms-lab/*) These config.json files are required for TestNewArchitectures tests to pass in GitHub Actions CI/CD workflow. Signed-off-by: Aryan Gosaliya <aryan.gosaliya@oracle.com>

github-actions · 2026-02-11T20:06:47Z

📌 Cov diff with main:

📌 Overall coverage:

_summarize_shapes_for_seq_lens uses LLMConfig as a type annotation but it was never imported after the V2 refactor, causing a NameError at class definition time and preventing the CLI from loading at all. Signed-off-by: Aryan Gosaliya <aryan.gosaliya@oracle.com>

github-actions · 2026-02-13T00:12:55Z

📌 Cov diff with main:

📌 Overall coverage:

github-actions · 2026-02-18T23:04:43Z

📌 Cov diff with main:

📌 Overall coverage:

… strategies Signed-off-by: Aryan Gosaliya <aryan.gosaliya@oracle.com>

github-actions · 2026-02-19T01:06:09Z

📌 Cov diff with main:

📌 Overall coverage:

mrDzurb · 2026-02-26T17:56:34Z

LGTM!

github-actions · 2026-02-26T19:05:16Z

📌 Cov diff with main:

📌 Overall coverage:

github-actions · 2026-02-28T17:20:23Z

📌 Cov diff with main:

📌 Overall coverage:

Aryanag2 requested review from VipulMascarenhas, ahosler, mayoor, mrDzurb and qiuosier as code owners February 11, 2026 18:29

oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Feb 11, 2026

Aryanag2 changed the title ~~Add multi-architecture support to AQUA Shape Recommender~~ [WIP]Add multi-architecture support to AQUA Shape Recommender Feb 11, 2026

Merge branch 'main' into feature/shape-recommender-v2-multi-architecture

fc4ee22

Add dynamic vLLM param selection for audio, embedding, and multimodal…

4ed9ab0

… strategies Signed-off-by: Aryan Gosaliya <aryan.gosaliya@oracle.com>

mrDzurb approved these changes Feb 26, 2026

View reviewed changes

Merge branch 'main' into feature/shape-recommender-v2-multi-architecture

3f1c4c7

mrDzurb requested review from lu-ohai, sambitkumohanty and smfirmin as code owners February 26, 2026 17:56

smfirmin approved these changes Feb 26, 2026

View reviewed changes

mrDzurb changed the title ~~[WIP]Add multi-architecture support to AQUA Shape Recommender~~ Add multi-architecture support to AQUA Shape Recommender Feb 27, 2026

Merge branch 'main' into feature/shape-recommender-v2-multi-architecture

71192cb

mrDzurb merged commit f605539 into main Feb 28, 2026
16 of 20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multi-architecture support to AQUA Shape Recommender#1336

Add multi-architecture support to AQUA Shape Recommender#1336
mrDzurb merged 8 commits intomainfrom
feature/shape-recommender-v2-multi-architecture

Aryanag2 commented Feb 11, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 13, 2026

Uh oh!

github-actions bot commented Feb 18, 2026

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

mrDzurb commented Feb 26, 2026

Uh oh!

github-actions bot commented Feb 26, 2026

Uh oh!

Uh oh!

github-actions bot commented Feb 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Aryanag2 commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Impact

Testing

Files Changed

Uh oh!

github-actions bot commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 13, 2026

Uh oh!

github-actions bot commented Feb 18, 2026

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

mrDzurb commented Feb 26, 2026

Uh oh!

github-actions bot commented Feb 26, 2026

Uh oh!

Uh oh!

github-actions bot commented Feb 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Aryanag2 commented Feb 11, 2026 •

edited

Loading