🍋 TrustyAI Lemonade Stand Demo 🍋

Runtime: ~30 minutes
Difficulty: Medium

In this example, we'll imagine we run a successful lemonade stand and want to deploy a customer service agent so our customers can learn more about our products. We'll want to make sure all conversations with the agent are family friendly, and that it does not promote our rival fruit juice vendors.

1) Prerequisites

Install the Red Hat Openshift AI operator
Install the default DSCInitialization and Data Science Cluster

Note: This demo was last tested and verified on RHOAI 2.25

2) Deploy Models

oc new-project model-namespace
oc apply -f vllm/model_container.yaml

The model container can take a while to spin up- it's downloading a Phi-3-mini from Huggingface and saving it into an emulated AWS data connection.

oc apply -f vllm/phi3.yaml

Wait for the model pod to spin up, should look something like phi3-predictor-XXXXXX

You can test the model by sending some inferences to it:

oc port-forward $(oc get pods -o name | grep phi3) 8080:8080

Then, in a new terminal tab, we can send some prompts to the model via the included prompt.py helper script.

python3 ../common/prompt.py --url http://localhost:8080/v1/chat/completions --model phi3 --message "Hi, can you tell me about yourself?"

❗NOTE: `../common/prompt.py` is a Python script included in this repository for sending chat/completions requests to your deployed model. To run `prompt.py`, make sure the requests library is installed: `pip install requests`

3) Guardrails

In the following section, we'll walk through the configurations that have been provided inside of the guardrails folder. Everything is already written for you, so there's no need to add anything to the yaml files unless you're trying to experiment!

3.1) Deploy the Hateful And Profane (HAP) language detector

This will use IBM's Granite-Guardian-HAP-38m model, which is a small language model for detecting problematic speech.

oc apply -f guardrails/hap_model_container.yaml

Wait for the guardrails-container-deployment-hap-xxxx pod to spin up

oc apply -f guardrails/hap.yaml

Wait for the guardrails-detector-ibm-haop-predictor-xxx pod to spin up

3.2) Configure the Guardrails Orchestrator

The main component of the guardrails stack is the Guardrails Orchestrator. This manages all network communication between the generative model and the various guardrails components.

Open guardrails/orchestrator.yaml. This contains all of our configurations for our guardrails deployment. The first ConfigMap defined in the file contains our orchestrator configuration:

  config.yaml: |
    chat_generation:
      service:
        hostname: phi3-predictor.model-namespace.svc.cluster.local
        port: 8080
    detectors:
      built_in:
        type: text_contents
        service:
            hostname: "127.0.0.1"
            port: 8080
        chunker_id: whole_doc_chunker
        default_threshold: 0.5
      hap:
        type: text_contents
        service:
          hostname: guardrails-detector-ibm-hap-predictor.model-namespace.svc.cluster.local
          port: 8000
        chunker_id: whole_doc_chunker
        default_threshold: 0.5

Here, we've defined the location of our chat_generation model server, and the locations of our detector servers. The hap detector is reachable via the service that is created by the KServe deployment (phi3-predictor.model-namespace.svc.cluster.local) (step 3.1), while our built-in detector sidecar will be launched at localhost:8080- this will always be the case when using the built-in detector sidecar.

3.3) Configure our built-in detector

The second configmap defined in guardrails/orchestrator.yaml provides our guardrails gateway configuration- this is where we can configure our local detectors and our "preset" guardrailing pipeines.

On line 43, we've used the following regex pattern to filter out conversations about our rival juice vendors:

\b(?i:apple|cranberry|grape|orange|pineapple|)\b

This will flag anything that matches that regex pattern as a detection- in this case, any mention of the words apple, cranberry, grape, orange, or pineapple regardless of case.

3.4) Configure the Guardrails Gateway

Again, looking inside guardrails/orchestrator.yaml:

The guardrails gateway provides two main features:

It provides the OpenAI v1/chat/completions API, which lets you hotswap between unguardrailed and guardrailed models
It lets you create guardrail "presets" baked into the endpoint.

Detectors

First, we've set up the detectors that we want to use (lines 37 to 47):

 detectors:
  - name: built_in
    input: true
    output: true
    detector_params:
      regex:    
        - \b(?i:orange|apple|cranberry|pineapple|grape)\b
  - name: hap
    input: true
    output: true
    detector_params: {}

Here, we've referred to the two detector names we created in the Orchestrator Configuration (hap and regex_competitor). For both detectors, we specify input: true and output: true, meaning we will use them for both input and output detectors. If you want to run a detector on only input or only output, you can change these flags. Finally, for the regex_competitor detector, we define the specific regex that we described earlier.

Defining Presets

Next, we've created three preset pipelines for those detectors (line 48 onwards):

routes:
  - name: all
    detectors:
      - built_in
      - hap
  - name: hap
    detectors:
      - hap
  - name: passthrough
    detectors:

First is all, which will be served at $GUARDRAILS_GATEWAY_URL/all/v1/chat/completions, which will use both the regex_competitor and hap detectors. Next is hap will just uses the hap detector, and finally we have the passthrough preset, which does not use any detectors.

3.5) Deploy the Guardrails Orchestrator

This will deploy all of our configurations and create a Guardrails Orchestrator pod:

oc apply -f guardrails/orchestrator.yaml

3.6) Check the Orchestrator Health

ORCH_ROUTE_HEALTH=$(oc get routes guardrails-orchestrator-health -o jsonpath='{.spec.host}')
curl -s https://$ORCH_ROUTE_HEALTH/info | jq

If everything is okay, it should return:

{
  "services": {
    "hap": {
      "status": "HEALTHY"
    },
    "regex_competitor": {
      "status": "HEALTHY"
    },
    "chat_generation": {
      "status": "HEALTHY"
    }
  }
}

4 Have a play around with Guardrails!

GUARDRAILS_GATEWAY=https://$(oc get routes guardrails-gateway -o jsonpath='{.spec.host}')
RAW_MODEL=http://localhost:8080

(Make sure your port-forward from earlier is still running!)

The available endpoints are:

$GUARDRAILS_GATEWAY/passthrough: query the raw, unguardrailed model.
$GUARDRAILS_GATEWAY/hap: query using with the HAP detector.
$GUARDRAILS_GATEWAY/all: query with all available detectors, so the HAP and competitor-check detectors.

Some cool queries to try:

"Is orange juice good?"

Raw Model:

python3 ../common/prompt.py \
  --url $RAW_MODEL/v1/chat/completions \
  --model phi3 \
  --message "Is orange juice good?"

Returns:

Orange juice is generally considered good, especially when it's freshly squeezed. It's a rich source of vitamin C, which is essential for a healthy immune system. It also contains other nutrients like potassium, folate, and antioxidants. However, the quality of orange juice can vary depending on the brand and whether it's freshly squeezed or from concentrate. It's always best to check the label for added sugars and preservatives.

Guardrails, `/passthrough` endpoint:

python3 ../common/prompt.py \
  --url $GUARDRAILS_GATEWAY/passthrough/v1/chat/completions \
  --model phi3 \
  --message "Is orange juice good?"

Returns:

Orange juice is generally considered good, especially when it's freshly squeezed. It's a rich source of vitamin C, which is essential for a healthy immune system. It also contains other nutrients like potassium, folate, and antioxidants. However, the quality of orange juice can vary depending on the brand and whether it's freshly squeezed or from concentrate. It's always best to check the label for added sugars and preservatives.

Guardrails, `/all` endpoint:

python3 ../common/prompt.py \
  --url $GUARDRAILS_GATEWAY/all/v1/chat/completions \
  --model phi3 \
  --message "Is orange juice good?"

Returns:

Warning: Unsuitable input detected. Please check the detected entities on your input and try again with the unsuitable input removed.
Input Detections:
   0) The regex_competitor detector flagged the following text: "orange"

"Lemonade is disgusting"

Raw Model:

python3 ../common/prompt.py \
  --url $RAW_MODEL/v1/chat/completions \
  --model phi3  \
  --message "Lemonade is disgusting"

Returns:

As an AI, I don't have personal opinions or taste preferences. However, I understand that taste is subjective, and what one person finds disgusting, another may find delicious. It's important to respect individual tastes and preferences.

Guardrails, `/all` endpoint:

python3 ../common/prompt.py \
  --url $GUARDRAILS_GATEWAY/all/v1/chat/completions \
  --model phi3 \
  --message "Lemonade is disgusting"

Returns:

Warning: Unsuitable input detected. Please check the detected entities on your input and try again with the unsuitable input removed.
Input Detections:
   0) The hap detector flagged the following text: "Lemonade is disgusting, don't you agree?"

"Can you list some healthy fruit juices?"

Raw Model:

python3 ../common/prompt.py \
  --url $RAW_MODEL/v1/chat/completions \
  --model phi3 \
  --message "Can you list some healthy fruit juices?"

Returns:

Certainly! Here are some healthy fruit juices that are rich in vitamins and antioxidants:


1. **Apple Juice**: Made from apples, it's a good source of vitamin C and antioxidants.

2. **Orange Juice**: High in vitamin C and folate, it's a popular choice for a refreshing drink.

3. **Pineapple Juice**: Contains bromelain, an enzyme that can aid digestion.

4. **Grape Juice**: Rich in antioxidants and vitamin C, especially when made from whole grapes.

5. **Cranberry Juice**: Known for its potential to support urinary tract health.

6. **Tomato Juice**: High in lycopene, an antioxidant that may reduce the risk of heart disease and cancer.

7. **Watermelon Juice**: A hydrating choice that's high in vitamins A and C.

8. **Cantal

Guardrails, `/all` endpoint:

python3 ../common/prompt.py \
  --url $GUARDRAILS_GATEWAY/all/v1/chat/completions \
  --model phi3 \
  --message "Can you list some healthy fruit juices?"

Returns:

Warning: Unsuitable output detected.
Output Detections:
   0) The regex_competitor detector flagged the following text: "Apple"
   1) The regex_competitor detector flagged the following text: "Orange"
   2) The regex_competitor detector flagged the following text: "Pineapple"
   3) The regex_competitor detector flagged the following text: "Grape"
   4) The regex_competitor detector flagged the following text: "Cranberry"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🍋 TrustyAI Lemonade Stand Demo 🍋

1) Prerequisites

2) Deploy Models

❗NOTE: `../common/prompt.py` is a Python script included in this repository for sending chat/completions requests to your deployed model. To run `prompt.py`, make sure the requests library is installed: `pip install requests`

3) Guardrails

3.1) Deploy the Hateful And Profane (HAP) language detector

3.2) Configure the Guardrails Orchestrator

3.3) Configure our built-in detector

3.4) Configure the Guardrails Gateway

Detectors

Defining Presets

3.5) Deploy the Guardrails Orchestrator

3.6) Check the Orchestrator Health

4 Have a play around with Guardrails!

"Is orange juice good?"

Raw Model:

Guardrails, `/passthrough` endpoint:

Guardrails, `/all` endpoint:

"Lemonade is disgusting"

Raw Model:

Guardrails, `/all` endpoint:

"Can you list some healthy fruit juices?"

Raw Model:

Guardrails, `/all` endpoint:

More information

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

🍋 TrustyAI Lemonade Stand Demo 🍋

1) Prerequisites

2) Deploy Models

❗NOTE: ../common/prompt.py is a Python script included in this repository for sending chat/completions requests to your deployed model. To run prompt.py, make sure the requests library is installed: pip install requests

3) Guardrails

3.1) Deploy the Hateful And Profane (HAP) language detector

3.2) Configure the Guardrails Orchestrator

3.3) Configure our built-in detector

3.4) Configure the Guardrails Gateway

Detectors

Defining Presets

3.5) Deploy the Guardrails Orchestrator

3.6) Check the Orchestrator Health

4 Have a play around with Guardrails!

"Is orange juice good?"

Raw Model:

Guardrails, /passthrough endpoint:

Guardrails, /all endpoint:

"Lemonade is disgusting"

Raw Model:

Guardrails, /all endpoint:

"Can you list some healthy fruit juices?"

Raw Model:

Guardrails, /all endpoint:

More information

❗NOTE: `../common/prompt.py` is a Python script included in this repository for sending chat/completions requests to your deployed model. To run `prompt.py`, make sure the requests library is installed: `pip install requests`

Guardrails, `/passthrough` endpoint:

Guardrails, `/all` endpoint:

Guardrails, `/all` endpoint:

Guardrails, `/all` endpoint: