feat: add azure openai inference by leseb · Pull Request #38 · opendatahub-io/llama-stack-distribution

leseb · 2025-09-18T08:01:53Z

feat: add azure openai inference

New inference provider.

Relates to: RHAIENG-744
Signed-off-by: Sébastien Han seb@redhat.com

@coderabbitai ignore

coderabbitai · 2025-09-18T08:03:27Z

Note

Reviews paused

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Walkthrough

Bumps llama-stack to 0.2.22 across configs, adds a conditional Azure remote inference provider to build/run configurations and docs, and updates container dependencies by replacing openai with litellm. Pre-commit hook and build scripts align with the new version; no API signatures changed.

Changes

Cohort / File(s)	Summary
Version bump: llama-stack 0.2.21 → 0.2.22 `.pre-commit-config.yaml`, `distribution/Containerfile`, `distribution/Containerfile.in`, `distribution/build.py`	Update pinned/expected llama-stack version to 0.2.22 in pre-commit, container build, and build script requirements.
Azure inference provider (config + docs) `distribution/build.yaml`, `distribution/run.yaml`, `distribution/README.md`	Add remote Azure inference provider: static entry in build.yaml; conditional provider in run.yaml using env vars; document “inference
Container dependency change `distribution/Containerfile`	Replace Python dependency openai with litellm in pip install list; no other container changes.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant U as User
  participant R as Runner (distribution/run.yaml)
  participant PR as Provider Registry
  participant AZ as Azure Provider (remote::azure)
  participant Other as Other Inference Providers

  rect rgba(200,220,255,0.25)
    note over R,PR: Provider registration at startup
    R->>PR: Register providers from config
    alt env.AZURE_API_KEY is set
      note right of PR: Conditionally include Azure
      PR->>PR: Add provider_id "azure"
    else env.AZURE_API_KEY is empty
      PR->>PR: Skip Azure provider
    end
  end

  U->>R: Submit inference request
  R->>PR: Select provider (routing rules)
  alt Azure selected
    PR->>AZ: Forward request with api_key/base/version/type
    AZ-->>PR: Response
  else Other provider selected
    PR->>Other: Forward request
    Other-->>PR: Response
  end
  PR-->>R: Return response
  R-->>U: Deliver result

  note over R,A Z: New/changed: conditional Azure provider integration

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Poem

Ears up, I flip through YAML skies,
Azure clouds wink—surprise, surprise!
Llama trots to .22, so spry,
litellm packs our carrot pie.
Hooks are tidy, builds run true—
I thump in joy: “Ship it, woo!” 🥕✨

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Docstring Coverage	✅ Passed	No functions found in the changes. Docstring coverage check skipped.
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The title "feat: add azure openai inference" is a concise, single-sentence summary that correctly highlights the primary change in the PR (adding an Azure OpenAI inference provider); the bump to llama-stack is a secondary dependency update and does not make the title misleading.

✨ Finishing touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

leseb · 2025-09-18T08:04:21Z

@coderabbitai ignore

Elbehery · 2025-09-18T08:11:11Z

@leseb the downstream release does not include azure

Downstream release was cut at this MR

coderabbitai · 2025-09-18T08:11:57Z

✅ Actions performed

Reviews paused.

leseb · 2025-09-18T09:20:30Z

needs #39

Elbehery · 2025-09-18T09:24:04Z

pre-commit fails on

Error executing command: Command 'llama stack build --config distribution/build.yaml --print-deps-only' returned non-zero exit status 1.
Command output: # Dependencies for distribution/build.yaml
WARNING  2025-09-18 09:21:29,362 llama_stack.core.distribution:149 core: Failed to import module prompts: No module named                             
         'llama_stack.providers.registry.prompts'                                                                                                     

Command stderr: Traceback (most recent call last):
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/bin/llama", line 7, in <module>
    sys.exit(main())
             ^^^^^^
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/lib/python3.12/site-packages/llama_stack/cli/llama.py", line 53, in main
    parser.run(args)
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/lib/python3.12/site-packages/llama_stack/cli/llama.py", line 47, in run
    args.func(args)
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/lib/python3.12/site-packages/llama_stack/cli/stack/build.py", line 100, in _run_stack_build_command
    return run_stack_build_command(args)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/lib/python3.12/site-packages/llama_stack/cli/stack/_build.py", line 232, in run_stack_build_command
    normal_deps, special_deps, external_provider_dependencies = get_provider_dependencies(build_config)
                                                                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/lib/python3.12/site-packages/llama_stack/core/build.py", line 65, in get_provider_dependencies
    raise ValueError(f"Provider `{provider}` is not available for API `{api_str}`")
ValueError: Provider `provider_type='remote::azure' module=None` is not available for API `inference`

leseb · 2025-09-18T09:26:40Z

pre-commit fails on

Error executing command: Command 'llama stack build --config distribution/build.yaml --print-deps-only' returned non-zero exit status 1.
Command output: # Dependencies for distribution/build.yaml
WARNING  2025-09-18 09:21:29,362 llama_stack.core.distribution:149 core: Failed to import module prompts: No module named                             
         'llama_stack.providers.registry.prompts'                                                                                                     

Command stderr: Traceback (most recent call last):
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/bin/llama", line 7, in <module>
    sys.exit(main())
             ^^^^^^
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/lib/python3.12/site-packages/llama_stack/cli/llama.py", line 53, in main
    parser.run(args)
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/lib/python3.12/site-packages/llama_stack/cli/llama.py", line 47, in run
    args.func(args)
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/lib/python3.12/site-packages/llama_stack/cli/stack/build.py", line 100, in _run_stack_build_command
    return run_stack_build_command(args)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/lib/python3.12/site-packages/llama_stack/cli/stack/_build.py", line 232, in run_stack_build_command
    normal_deps, special_deps, external_provider_dependencies = get_provider_dependencies(build_config)
                                                                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/runner/.cache/pre-commit/repoiv_b5t3a/py_env-python3.12/lib/python3.12/site-packages/llama_stack/core/build.py", line 65, in get_provider_dependencies
    raise ValueError(f"Provider `{provider}` is not available for API `{api_str}`")
ValueError: Provider `provider_type='remote::azure' module=None` is not available for API `inference`

As expected since this branch does not run on 0.2.22, hence the wait for

leseb · 2025-09-18T15:06:17Z

@nathan-weinberg @Elbehery ready!

Elbehery

LGTM

New inference provider. Relates to: RHAIENG-744 Signed-off-by: Sébastien Han <seb@redhat.com>

leseb requested review from Elbehery, VaishnaviHire, cdoern, derekhiggins, nathan-weinberg, rhdedgar, rhuss and skamenan7 as code owners September 18, 2025 08:01

leseb force-pushed the add-azureai branch from b493e2e to 342f2f6 Compare September 18, 2025 09:20

leseb marked this pull request as draft September 18, 2025 09:21

leseb changed the title ~~feat: bump llama-stack and add azure openai inference~~ feat: add azure openai inference Sep 18, 2025

leseb force-pushed the add-azureai branch 2 times, most recently from da6199a to a282199 Compare September 18, 2025 15:04

leseb marked this pull request as ready for review September 18, 2025 15:04

nathan-weinberg approved these changes Sep 18, 2025

View reviewed changes

Elbehery approved these changes Sep 18, 2025

View reviewed changes

feat: add azure openai inference

0cec0f3

New inference provider. Relates to: RHAIENG-744 Signed-off-by: Sébastien Han <seb@redhat.com>

nathan-weinberg force-pushed the add-azureai branch from a282199 to 0cec0f3 Compare September 18, 2025 17:38

nathan-weinberg merged commit 1ceda2b into opendatahub-io:main Sep 18, 2025
4 checks passed

leseb deleted the add-azureai branch September 18, 2025 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add azure openai inference#38

feat: add azure openai inference#38
nathan-weinberg merged 1 commit intoopendatahub-io:mainfrom
leseb:add-azureai

leseb commented Sep 18, 2025 •

edited

Loading

Uh oh!

coderabbitai bot commented Sep 18, 2025 •

edited

Loading

Reviews paused

Uh oh!

leseb commented Sep 18, 2025

Uh oh!

Elbehery commented Sep 18, 2025

Uh oh!

coderabbitai bot commented Sep 18, 2025

Uh oh!

leseb commented Sep 18, 2025

Uh oh!

Elbehery commented Sep 18, 2025

Uh oh!

leseb commented Sep 18, 2025

Uh oh!

leseb commented Sep 18, 2025

Uh oh!

Elbehery left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

leseb commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Pre-merge checks and finishing touches

Uh oh!

leseb commented Sep 18, 2025

Uh oh!

Elbehery commented Sep 18, 2025

Uh oh!

coderabbitai bot commented Sep 18, 2025

Uh oh!

leseb commented Sep 18, 2025

Uh oh!

Elbehery commented Sep 18, 2025

Uh oh!

leseb commented Sep 18, 2025

Uh oh!

leseb commented Sep 18, 2025

Uh oh!

Elbehery left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

leseb commented Sep 18, 2025 •

edited

Loading

coderabbitai bot commented Sep 18, 2025 •

edited

Loading