Galileo Custom Metrics Demo

This demo shows how to fetch and display custom metrics from the Galileo API after running LLM calls.

Files

logstreams/logstream_demo.py - Runs LLM calls with Galileo logging
logstreams/fetch_session_metrics.py - Fetches and displays session metrics
logstreams/fetch_logstream_metrics.py - Fetches and displays logstream metrics
experiments/create_dataset.py - Creates a dataset for legal advice detection testing
experiments/run_experiment.py - Runs an experiment using the dataset to test metrics
experiments/fetch_experiment.py - Fetches and displays experiment results by ID
env.template - Environment variables template
README.md - This file

Setup

Install dependencies:

pip install galileo openai python-dotenv requests

Set up environment variables:

# Copy the template and fill in your values
cp env.template .env
# Edit .env with your actual credentials

Required environment variables:

GALILEO_API_KEY - Your Galileo API key
GALILEO_PROJECT - Your Galileo project name
GALILEO_PROJECT_ID - Your Galileo project ID
GALILEO_LOG_STREAM - Your Galileo log stream
GALILEO_CONSOLE_URL - Your Galileo console URL
GALILEO_API_URL - Your Galileo API URL
OPENAI_API_KEY - Your OpenAI API key

Usage

Step 1: Run LLM calls

python logstreams/logstream_demo.py

Step 2: Fetch metrics (in another terminal or after waiting)

# Fetch session metrics
python logstreams/fetch_session_metrics.py <session_id>

# Fetch logstream metrics (all sessions in a logstream)
python logstreams/fetch_logstream_metrics.py
# OR with explicit project and logstream names
python logstreams/fetch_logstream_metrics.py "My Project" "My Logstream"

Step 3: Create dataset for testing (optional)

python experiments/create_dataset.py

Step 4: Run experiment (optional)

python experiments/run_experiment.py

Step 5: Fetch experiment results (optional)

python experiments/fetch_experiment.py <experiment_id>

What it does

`logstreams/logstream_demo.py`:

Runs legal advice LLM calls with Galileo logging
Shows contrast between legal and non-legal questions
Displays session ID for metrics fetching

`logstreams/fetch_session_metrics.py`:

Fetches session metrics from Galileo API
Displays comprehensive metrics from all levels (session, trace, span)
Shows custom metrics like legal advice detection

`logstreams/fetch_logstream_metrics.py`:

Fetches all metrics for a logstream using environment variables or explicit arguments
Uses sessions, traces, and spans search with pagination to get complete data
Returns hierarchical JSON with all sessions, traces, and spans in the logstream
Environment variables: GALILEO_PROJECT and GALILEO_LOG_STREAM
Command-line usage: python fetch_logstream_metrics.py "Project Name" "Logstream Name"

`experiments/create_dataset.py`:

Creates a dataset with legal advice input-output pairs
Contains examples where users ask for legal advice
Shows proper refusals where the system politely declines to give legal advice
Perfect for testing the "Legal Advice Offered" metric

`experiments/run_experiment.py`:

Fetches the dataset by name
Runs an experiment using the dataset
Tests the "Legal Advice Offered" metric on all dataset examples
Uses a simple LLM function similar to the demo
Provides experiment ID for fetching results separately

`experiments/fetch_experiment.py`:

Fetches experiment results by ID from Galileo API
Displays comprehensive results including metrics and feedback
Shows dataset information and experiment details
Standalone script for fetching any experiment results

Output

The demo will show:

LLM responses
Polling progress
Session metrics
Trace metrics
Span metrics
Metric info with status and values

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Galileo Custom Metrics Demo

Files

Setup

Usage

Step 1: Run LLM calls

Step 2: Fetch metrics (in another terminal or after waiting)

Step 3: Create dataset for testing (optional)

Step 4: Run experiment (optional)

Step 5: Fetch experiment results (optional)

What it does

`logstreams/logstream_demo.py`:

`logstreams/fetch_session_metrics.py`:

`logstreams/fetch_logstream_metrics.py`:

`experiments/create_dataset.py`:

`experiments/run_experiment.py`:

`experiments/fetch_experiment.py`:

Output

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
experiments		experiments
logstreams		logstreams
.gitignore		.gitignore
README.md		README.md
env.template		env.template
requirements.txt		requirements.txt

rungalileo/observe-custom-metrics-from-code

Folders and files

Latest commit

History

Repository files navigation

Galileo Custom Metrics Demo

Files

Setup

Usage

Step 1: Run LLM calls

Step 2: Fetch metrics (in another terminal or after waiting)

Step 3: Create dataset for testing (optional)

Step 4: Run experiment (optional)

Step 5: Fetch experiment results (optional)

What it does

logstreams/logstream_demo.py:

logstreams/fetch_session_metrics.py:

logstreams/fetch_logstream_metrics.py:

experiments/create_dataset.py:

experiments/run_experiment.py:

experiments/fetch_experiment.py:

Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

`logstreams/logstream_demo.py`:

`logstreams/fetch_session_metrics.py`:

`logstreams/fetch_logstream_metrics.py`:

`experiments/create_dataset.py`:

`experiments/run_experiment.py`:

`experiments/fetch_experiment.py`:

Packages