feat: add cpu/cuda config for prompt guard #2194

mhdawson · 2025-05-16T18:36:43Z

What does this PR do?

Previously prompt guard was hard coded to require cuda which prevented it from being used on an instance without a cuda support.

This PR allows prompt guard to be configured to use either cpu or cuda.

Closes #2133

Test Plan (Edited after incorporating suggestion)

started stack configured with prompt guard as follows on a system without a GPU
and validated prompt guard could be used through the APIs
validated on a system with a gpu (but without llama stack) that the python selecting between cpu and cuda support returned the right value when a cuda device was available.
ran the unit tests as per - https://github.com/meta-llama/llama-stack/blob/main/tests/unit/README.md

Previously prompt guard was hard coded to require cuda which prevented it from being used on an instance without a cuda support. This PR allows prompt guard to be configured to use either cpu or cuda. Signed-off-by: Michael Dawson <[email protected]>

ashwinb · 2025-05-16T19:00:37Z

llama_stack/providers/inline/safety/prompt_guard/prompt_guard.py

@@ -75,7 +75,7 @@ def __init__(
        self.temperature = temperature
        self.threshold = threshold

-        self.device = "cuda"
+        self.device = self.config.guard_execution_type


can we just check if cuda is available and use that otherwise use CPU? no need for a specific configuration like this to be added.

@ashwinb I'll take a look at that and update

ashwinb

requesting changes for my inline comment

Signed-off-by: Michael Dawson <[email protected]>

mhdawson · 2025-05-26T17:53:10Z

@ashwinb updated based on your suggestion. Thanks for taking the time to review my PR.

ashwinb

cool

mhdawson requested review from ashwinb, yanxi0830, hardikjshah, raghotham, ehhuang, terrytangyuan and leseb as code owners May 16, 2025 18:36

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label May 16, 2025

ashwinb reviewed May 16, 2025

View reviewed changes

ashwinb requested changes May 19, 2025

View reviewed changes

squash: address comments

7b9a6ed

Signed-off-by: Michael Dawson <[email protected]>

mhdawson requested a review from bbrowning as a code owner May 26, 2025 17:50

ashwinb approved these changes May 28, 2025

View reviewed changes

ashwinb merged commit a654467 into meta-llama:main May 28, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add cpu/cuda config for prompt guard #2194

feat: add cpu/cuda config for prompt guard #2194

Uh oh!

mhdawson commented May 16, 2025 •

edited

Loading

Uh oh!

ashwinb May 16, 2025

Uh oh!

mhdawson May 26, 2025

Uh oh!

ashwinb left a comment

Uh oh!

mhdawson commented May 26, 2025

Uh oh!

ashwinb left a comment

Uh oh!

Uh oh!

Uh oh!

feat: add cpu/cuda config for prompt guard #2194

feat: add cpu/cuda config for prompt guard #2194

Uh oh!

Conversation

mhdawson commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Plan (Edited after incorporating suggestion)

Uh oh!

ashwinb May 16, 2025

Choose a reason for hiding this comment

Uh oh!

mhdawson May 26, 2025

Choose a reason for hiding this comment

Uh oh!

ashwinb left a comment

Choose a reason for hiding this comment

Uh oh!

mhdawson commented May 26, 2025

Uh oh!

ashwinb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mhdawson commented May 16, 2025 •

edited

Loading