[duplicate] Update to OpenAI API and set LLS>=0.2.23 by dmaniloff · Pull Request #25 · trustyai-explainability/llama-stack-provider-ragas

dmaniloff · 2025-10-10T15:59:57Z

Updated calls to embeddings and completions to use OpenAI spec.
Refactored provider specifications to return a list w/ inline and remote.
- Remote is included if dependencies are installed.
Unified demo notebooks into one, and run.yaml into one as well.
Updated tests.

- Changed llama-stack and llama-stack-client dependencies to point to GitHub repositories. - Unified demo notebooks into one, and run.yaml into one as well. - Refactored provider specifications to return a list. - Updated calls to embeddings and completions.

sourcery-ai

Hey there - I've reviewed your changes and they look great!

Prompt for AI Agents

Please address the comments from this code review:

## Individual Comments

### Comment 1
<location> `src/llama_stack_provider_ragas/inline/wrappers_inline.py:41-45` </location>
<code_context>
-                model_id=self.embedding_model_id,
-                contents=texts,
-                task_type=EmbeddingTaskType.document,
+            response = await self.inference_api.openai_embeddings(
+                model=self.embedding_model_id,
+                input=texts,
             )
-            return response.embeddings  # type: ignore
+            return [data.embedding for data in response.data]  # type: ignore
         except Exception as e:
             logger.error(f"Document embedding failed: {str(e)}")
</code_context>

<issue_to_address>
**issue:** Consider handling empty or malformed response data in embedding extraction.

Add validation to ensure response.data is a non-empty list of objects with an 'embedding' attribute to prevent runtime errors.
</issue_to_address>

### Comment 2
<location> `src/llama_stack_provider_ragas/inline/wrappers_inline.py:140-143` </location>
<code_context>
                 "provider": "llama_stack",
             }

+            # sampling params for this generation should be set via the benchmark config
+            # we will ignore the temperature and stop params passed in here
             for _ in range(n):
-                response = await self.inference_api.completion(
-                    model_id=self.model_id,
-                    content=prompt_text,
-                    sampling_params=gen_sampling_params,
+                response = await self.inference_api.openai_completion(
+                    model=self.model_id,
+                    prompt=prompt_text,
</code_context>

<issue_to_address>
**question:** The temperature and stop parameters passed to agenerate_text are now ignored.

If this behavior is intentional, please add documentation to clarify that function arguments for temperature and stop are ignored in favor of self.sampling_params.
</issue_to_address>

### Comment 3
<location> `src/llama_stack_provider_ragas/inline/wrappers_inline.py:168-170` </location>
<code_context>
                 )

+                # Extract text from OpenAI completion response
+                choice = response.choices[0] if response.choices else None
+                text = choice.text if choice else ""
+
                 # Store Llama Stack response info in llm_output
</code_context>

<issue_to_address>
**suggestion (bug_risk):** Defaulting to empty string if no choices are returned may mask upstream issues.

Consider logging a warning or error when no choices are returned, as this could indicate an issue with the model or API.

```suggestion
                # Extract text from OpenAI completion response
                choice = response.choices[0] if response.choices else None
                if not response.choices:
                    import logging
                    logging.warning("OpenAI completion response returned no choices. This may indicate an issue with the model or API.")
                text = choice.text if choice else ""
```
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

src/llama_stack_provider_ragas/inline/wrappers_inline.py

- Updated LlamaStackInlineLLM and LlamaStackRemoteLLM to accept SamplingParams directly. - Removed unused prompt logging and token estimation methods. - Enhanced error handling for completion responses. - Adjusted tests to reflect changes in sampling parameters structure and usage. - Improved integration with Kubeflow by ensuring proper sampling parameters are passed.

dmaniloff · 2025-10-13T16:22:52Z

pyproject.toml

+[tool.hatch.metadata]
+allow-direct-references = true
+


only temporary to allow git+https://github.com in the dependencies.

dmaniloff · 2025-10-13T16:23:05Z

pyproject.toml

+    "llama-stack @ git+https://github.com/llamastack/llama-stack.git",
+    "llama-stack-client @ git+https://github.com/llamastack/llama-stack-client-python.git",


replace w/ 0.3.0 once ready.

sourcery-ai

Hey there - I've reviewed your changes and they look great!

Prompt for AI Agents

Please address the comments from this code review:

## Individual Comments

### Comment 1
<location> `src/llama_stack_provider_ragas/inline/wrappers_inline.py:39` </location>
<code_context>
     async def aembed_documents(self, texts: list[str]) -> list[list[float]]:
         """Embed documents using Llama Stack inference API."""
         try:
</code_context>

<issue_to_address>
**suggestion:** Consider handling empty or malformed embedding responses.

Currently, response.data is used without validation, which may cause exceptions if the API returns unexpected data. Please add checks to handle missing or malformed response data.
</issue_to_address>

### Comment 2
<location> `src/llama_stack_provider_ragas/remote/wrappers_remote.py:173-174` </location>
<code_context>
+                    stop=self.sampling_params.stop if self.sampling_params else None,
                 )

+                if not response.choices:
+                    logger.warning("Completion response returned no choices")
+
+                # Extract text from OpenAI completion response
</code_context>

<issue_to_address>
**suggestion:** Warns on empty choices but still appends empty generation.

Consider skipping the append or raising an error instead of adding an empty generation when no choices are returned.
</issue_to_address>

### Comment 3
<location> `src/llama_stack_provider_ragas/remote/kubeflow/components.py:111-116` </location>
<code_context>

+    # sampling_params is passed in from the benchmark config as model_dump()
+    # we need to convert it back to a SamplingParams object
+    sampling_params_obj = SamplingParams.model_validate(sampling_params)
+
     llm = LlamaStackRemoteLLM(
         base_url=llama_stack_base_url,
         model_id=model,
-        sampling_params=sampling_params,
+        sampling_params=sampling_params_obj,
     )
     embeddings = LlamaStackRemoteEmbeddings(
</code_context>

<issue_to_address>
**suggestion (bug_risk):** Converts dict sampling_params to SamplingParams object.

If sampling_params is malformed, model_validate may raise an exception. Please add error handling or pre-validation to ensure robustness.

Suggested implementation:

```python
    from llama_stack.apis.inference import SamplingParams
    from pydantic import ValidationError

```

```python
    # sampling_params is passed in from the benchmark config as model_dump()
    # we need to convert it back to a SamplingParams object
    try:
        sampling_params_obj = SamplingParams.model_validate(sampling_params)
    except ValidationError as e:
        logger.error(f"Invalid sampling_params: {e}")
        raise

```
</issue_to_address>

### Comment 4
<location> `pyproject.toml:28` </location>
<code_context>
 keywords = ["llama-stack", "ragas", "evaluation"]
 dependencies = [
     "setuptools-scm",
-    "llama-stack==0.2.23",
+    "llama-stack @ git+https://github.com/llamastack/llama-stack.git",
+    "llama-stack-client @ git+https://github.com/llamastack/llama-stack-client-python.git",
     "greenlet==3.2.4", # inline/files/localfs errors saying greenlet not found
</code_context>

<issue_to_address>
**suggestion:** Direct references to git repositories for dependencies.

Pin the git dependencies to a specific commit or tag to ensure build reproducibility and stability.

Suggested implementation:

```
    "llama-stack @ git+https://github.com/llamastack/llama-stack.git@<commit-or-tag>",
    "llama-stack-client @ git+https://github.com/llamastack/llama-stack-client-python.git@<commit-or-tag>",

```

Replace `<commit-or-tag>` with the actual commit hash or tag you want to pin for each repository. For example, if you want to pin to commit `abc1234`, use:
`git+https://github.com/llamastack/llama-stack.git@abc1234`
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2025-10-13T16:27:15Z

src/llama_stack_provider_ragas/remote/wrappers_remote.py

+                if not response.choices:
+                    logger.warning("Completion response returned no choices")


suggestion: Warns on empty choices but still appends empty generation.

Consider skipping the append or raising an error instead of adding an empty generation when no choices are returned.

pyproject.toml

dmaniloff · 2025-10-13T17:19:08Z

distribution/run.yaml

@nathan-weinberg I will post a PR to the distro to reflect these naming (provider_id/type) changes.

nathan-weinberg

Will do a more in-depth review once LLS 0.3.0 has released

src/llama_stack_provider_ragas/remote/ragas_remote_eval.py

ruivieira

@dmaniloff Changes LGTM, thanks.

Just added #27 as a reminder.

…ock files.

Elbehery · 2025-10-14T13:34:39Z

Thanks @dmaniloff for the quick fix 🙏🏽

Just we need this to be used for LLS 0.2.23

Would it make sense to have a new patch release for this purpose ? (i.e. 0.3.2)

I am not sure if this would be needed for LLS 0.3.0, as the conflict causing deep (ibm-watson) is going to be dropped for LLS 0.3.0

please correct me if I'm wrong

cc @ruivieira

…rade pyarrow to >=21.0.0, and add datasets dependency.

….toml and uv.lock

dmaniloff marked this pull request as draft October 10, 2025 16:00

sourcery-ai bot reviewed Oct 10, 2025

View reviewed changes

src/llama_stack_provider_ragas/inline/wrappers_inline.py Outdated Show resolved Hide resolved

src/llama_stack_provider_ragas/inline/wrappers_inline.py Show resolved Hide resolved

src/llama_stack_provider_ragas/inline/wrappers_inline.py Show resolved Hide resolved

dmaniloff added 4 commits October 13, 2025 12:11

Conditionally include remote if dependencies are installed.

b8cfeae

Update basic_demo notebook and constants for Ragas evaluation.

d6e39ed

uv sync.

7c39ed9

dmaniloff commented Oct 13, 2025

View reviewed changes

trustyai-explainability deleted a comment from sourcery-ai bot Oct 13, 2025

dmaniloff requested review from RobGeada, ruivieira and saichandrapandraju October 13, 2025 16:24

dmaniloff changed the title ~~upgrade to llama-stack 0.3.0.~~ Upgrade to llama-stack 0.3.0. Oct 13, 2025

dmaniloff marked this pull request as ready for review October 13, 2025 16:25

sourcery-ai bot reviewed Oct 13, 2025

View reviewed changes

trustyai-explainability deleted a comment from sourcery-ai bot Oct 13, 2025

Use constants for provider types in inline and remote providers.

ee7c63e

dmaniloff requested review from cdoern and nathan-weinberg October 13, 2025 17:17

dmaniloff commented Oct 13, 2025

View reviewed changes

Update README & docs.

8314f45

nathan-weinberg reviewed Oct 13, 2025

View reviewed changes

src/llama_stack_provider_ragas/remote/ragas_remote_eval.py Show resolved Hide resolved

ruivieira assigned dmaniloff Oct 14, 2025

ruivieira added the enhancement New feature or request label Oct 14, 2025

ruivieira approved these changes Oct 14, 2025

View reviewed changes

Update pandas dependency to version <2.3.0 in pyproject.toml and uv.l…

7f5a56c

…ock files.

dmaniloff requested a review from Elbehery October 14, 2025 13:24

Pin to LLS 0.2.23.

73e0496

dmaniloff added 13 commits October 14, 2025 10:26

Version bump.

36e613c

revert changes to provider.py

af80f2f

Update dependencies in pyproject.toml: remove llama-stack-client, upg…

3b9ec66

…rade pyarrow to >=21.0.0, and add datasets dependency.

revert changes to pyproject.

e1b82aa

revert changes to uv.lock

a95e028

Merge branch 'main' into openai-api-downgrade-pandas

ca9c978

revert changes to provider.py modules.

cb233cb

revert changes to vscode config.

79453f4

revert distro & demo changes.

c487948

revert changes to constants.py

5a055bb

revert doc changes.

0d14b2a

part of revert changes to constants.py.

6982453

version bump.

ab62532

dmaniloff marked this pull request as draft October 15, 2025 15:56

dmaniloff added 3 commits October 16, 2025 12:29

Merge branch 'main' into openai-api-downgrade-pandas

405c110

Update llama-stack dependency to allow versions >=0.2.23 in pyproject…

208c880

….toml and uv.lock

Merge branch 'main' into openai-api-downgrade-pandas

34cbd69

dmaniloff closed this pull request by merging all changes into trustyai-explainability:main in d5eaa31 Oct 17, 2025

dmaniloff changed the title ~~Upgrade to llama-stack 0.3.0.~~ [duplicate] Update to OpenAI API and set LLS>=0.2.23 Oct 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[duplicate] Update to OpenAI API and set LLS>=0.2.23#25

[duplicate] Update to OpenAI API and set LLS>=0.2.23#25
25 commits merged intotrustyai-explainability:mainfrom
dmaniloff:upgrade-to-lls-0.3.0

dmaniloff commented Oct 10, 2025 •

edited

Loading

Uh oh!

sourcery-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dmaniloff Oct 13, 2025

Uh oh!

dmaniloff Oct 13, 2025

Uh oh!

sourcery-ai bot left a comment

Uh oh!

sourcery-ai bot Oct 13, 2025

Uh oh!

Uh oh!

dmaniloff Oct 13, 2025

Uh oh!

nathan-weinberg left a comment

Uh oh!

Uh oh!

ruivieira left a comment

Uh oh!

Elbehery commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		"llama-stack @ git+https://github.com/llamastack/llama-stack.git",
		"llama-stack-client @ git+https://github.com/llamastack/llama-stack-client-python.git",

		if not response.choices:
		logger.warning("Completion response returned no choices")

		[tool.hatch.metadata]
		allow-direct-references = true

Conversation

dmaniloff commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dmaniloff Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

dmaniloff Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dmaniloff Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

nathan-weinberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ruivieira left a comment

Choose a reason for hiding this comment

Uh oh!

Elbehery commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dmaniloff commented Oct 10, 2025 •

edited

Loading