Save and load UQ Ensemble by virenbajaj · Pull Request #61 · cvs-health/uqlm

virenbajaj · 2025-06-17T03:51:19Z

Add Save/Load Configuration Functionality to UQEnsemble

Summary

This PR adds the ability to save and load UQEnsemble configurations, enabling users to persist tuned weights, thresholds, and component configurations for reproducible experiments and model deployment.

Changes Made

New Methods Added

save_llm_config(): Helper function to extract and serialize LLM configuration parameters
load_llm_config(): Helper function to recreate LLM instances from saved configuration
save_config(): Instance method to save ensemble configuration to JSON file
load_config(): Class method to create UQEnsemble instance from saved configuration

Key Features

Minimal Configuration Storage: Saves only essential parameters (weights, threshold, components, LLM configs)
Mixed Component Support: Handles both named string scorers and LLM instance scorers
LLM Serialization: Automatically extracts and saves LLM parameters (model, temperature, etc.)
Flexible Loading: Supports loading with or without providing new LLM instances
Secure Parameter Handling: Excludes internal LangChain attributes and endpoint parameters from saved configs

Parameter Exclusion

The system excludes the following from saved configurations:

Internal attributes: Defined in internal_attrs set (e.g., config_specs, lc_attributes, model_config)
Endpoint parameters: Defined in endpoint_attrs set (e.g., base_url, endpoint, azure_endpoint, api_base)

Environment Variables

API credentials must be set in environment variables for the load to work properly. The LLM modules automatically read standard environment variable names such as:

AZURE_OPENAI_API_KEY, AZURE_OPENAI_ENDPOINT for Azure OpenAI
GOOGLE_APPLICATION_CREDENTIALS for Google Vertex AI

Configuration Format

The saved JSON includes:

weights: Ensemble component weights (typically from tuning)
thresh: Decision threshold
components: List of scorer components (named strings or serialized LLM references)
llm_config: Main LLM configuration for recreation
llm_scorers: Configurations for LLM-based scorer components

Example Configuration

{
  "weights": [
    0.22517161424882978,
    0.5256472050407895,
    0.20544462057286958,
    0.04343460765924121,
    0.00030195247826969915
  ],
  "thresh": 0.85,
  "components": [
    "exact_match",
    "noncontradiction",
    "normalized_probability",
    "judge_1",
    "judge_2"
  ],
  "llm_config": {
    "class_name": "ChatVertexAI",
    "module": "langchain_google_vertexai.chat_models",
    "convert_system_message_to_human": false,
    "default_metadata": [],
    "endpoint_version": "v1beta1",
    "full_model_name": "projects/anbc-dev-csr-va/locations/us-central1/publishers/google/models/gemini-1.5-flash",
    "location": "us-central1",
    "logprobs": true,
    "max_retries": 6,
    "model_family": "2",
    "model_name": "gemini-1.5-flash",
    "n": 1,
    "perform_literal_eval_on_string_raw_content": true,
    "project": "anbc-dev-csr-va",
    "request_parallelism": 5,
    "streaming": false,
    "verbose": false
  },
  "llm_scorers": {
    "judge_1": {
      "class_name": "ChatVertexAI",
      "module": "langchain_google_vertexai.chat_models",
      "convert_system_message_to_human": false,
      "default_metadata": [],
      "endpoint_version": "v1beta1",
      "full_model_name": "projects/anbc-dev-csr-va/locations/us-central1/publishers/google/models/gemini-1.5-flash",
      "location": "us-central1",
      "logprobs": true,
      "max_retries": 6,
      "model_family": "2",
      "model_name": "gemini-1.5-flash",
      "n": 1,
      "perform_literal_eval_on_string_raw_content": true,
      "project": "anbc-dev-csr-va",
      "request_parallelism": 5,
      "streaming": false,
      "verbose": false
    },
    "judge_2": {
      "class_name": "AzureChatOpenAI",
      "module": "langchain_openai.chat_models.azure",
      "deployment_name": "gpt-4o",
      "model_version": "",
      "openai_api_type": "azure",
      "openai_api_version": "2024-02-15-preview",
      "streaming": false,
      "temperature": 1.0,
      "verbose": false
    }
  }
}

Usage

# Save configuration
ensemble.save_config("my_ensemble_config.json")

# Load configuration (requires environment variables to be set)
loaded_ensemble = UQEnsemble.load_config("my_ensemble_config.json")

# Load with provided LLM instance
loaded_ensemble = UQEnsemble.load_config("my_ensemble_config.json", llm=new_llm)

Benefits

Security: Sensitive credentials are never stored in configuration files
Reproducibility: Save tuned configurations for consistent results across experiments
Deployment: Easy model deployment with optimized weights and thresholds
Collaboration: Share tuned ensemble configurations between team members
Version Control: Track ensemble configurations alongside code changes

Technical Notes

Only serializes parameters that can be reconstructed (no internal state)
Requires same LLM libraries to be available when loading
Validates component compatibility during loading
Uses dynamic imports to recreate LLM instances from class information
Security: API keys and endpoint parameters are excluded from saved configs
Error Handling: Clear error messages when credentials are missing

Dependencies

Existing langchain dependencies for LLM recreation

dylanbouchard · 2025-06-22T20:38:05Z

+                            llm_params[param_name] = env_value
+                        else:
+                            # Try common environment variable names as fallback
+                            common_env_vars = {


As discussed, let's remove storing/loading of sensitive credentials and rely on proper naming conventions in environment variables.

dylanbouchard · 2025-06-22T20:39:09Z

-    "    azure_endpoint=os.getenv(\"API_BASE\"),\n",
-    "    openai_api_type=os.getenv(\"API_TYPE\"),\n",
-    "    openai_api_version=os.getenv(\"API_VERSION\"),\n",
+    "    deployment_name=os.getenv(\"AZURECHATOPENAI_DEPLOYMENT_NAME\"),\n",


Let's update the naming convention for .env files so we don't have to directly specify these in the constructor. Ideally we can have something like this in all notebooks instead:

# Use of AzureChatOpenAI requires .env file be populated with AZURE_OPENAI_API_KEY, AZURE_OPENAI_ENDPOINT from dotenv import load_dotenv, find_dotenv load_dotenv(find_dotenv()) from langchain_openai import AzureChatOpenAI llm = AzureChatOpenAI( deployment_name="gpt-4o", openai_api_version='2024-02-15-preview', openai_api_type='azure', temperature=1, # User to set temperature )

dylanbouchard

Amazing work, Viren!

Viren Bajaj added 2 commits June 17, 2025 03:49

WIP save and load uqe

9a590d5

with tests

0e85671

dylanbouchard linked an issue Jun 18, 2025 that may be closed by this pull request

Improved UQEnsemble storing #2

Closed

dylanbouchard marked this pull request as ready for review June 18, 2025 14:02

mohitcek approved these changes Jun 18, 2025

View reviewed changes

Viren Bajaj added 6 commits June 18, 2025 16:34

Merge branch 'develop'

9bcf23a

format with ruff

053175c

add env variable support and test

f8cf174

update env var names

000f9f8

add dot env to dependencies

bf0b2fd

ruff format

25dd47a

dylanbouchard self-requested a review June 22, 2025 20:37

dylanbouchard requested changes Jun 22, 2025

View reviewed changes

Viren Bajaj and others added 10 commits June 25, 2025 15:05

simplified llm config save and load

061c3b9

reset to old commit

52ba5e0

Merge branch 'main' of github.com:virenbajaj/uqlm

138a496

Merge branch 'vb/save_load_config'

fbd00a9

change llm env args

75c348f

Merge branch 'develop' into main

381ee7a

remove dotenv dependency

7a8cbc3

remove endpoints from

933023b

remove endpoints from config - tests

bc7fb5e

format

7c06535

dylanbouchard approved these changes Jun 25, 2025

View reviewed changes

dylanbouchard merged commit c1089e0 into cvs-health:develop Jun 25, 2025
24 of 25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Save and load UQ Ensemble#61

Save and load UQ Ensemble#61
dylanbouchard merged 18 commits into
cvs-health:developfrom
virenbajaj:main

virenbajaj commented Jun 17, 2025 •

edited

Loading

Uh oh!

dylanbouchard Jun 22, 2025

Uh oh!

dylanbouchard Jun 22, 2025

Uh oh!

dylanbouchard left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

virenbajaj commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add Save/Load Configuration Functionality to UQEnsemble

Summary

Changes Made

New Methods Added

Key Features

Parameter Exclusion

Environment Variables

Configuration Format

Example Configuration

Usage

Benefits

Technical Notes

Dependencies

Uh oh!

dylanbouchard Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

dylanbouchard Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

dylanbouchard left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

virenbajaj commented Jun 17, 2025 •

edited

Loading