Save and load UQ Ensemble [Viren] by dylanbouchard · Pull Request #72 · cvs-health/uqlm

dylanbouchard · 2025-06-25T18:35:50Z

Add Save/Load Configuration Functionality to UQEnsemble

Summary

This PR adds the ability to save and load UQEnsemble configurations, enabling users to persist tuned weights, thresholds, and component configurations for reproducible experiments and model deployment.

Changes Made

New Methods Added

save_llm_config(): Helper function to extract and serialize LLM configuration parameters
load_llm_config(): Helper function to recreate LLM instances from saved configuration
save_config(): Instance method to save ensemble configuration to JSON file
load_config(): Class method to create UQEnsemble instance from saved configuration

Key Features

Minimal Configuration Storage: Saves only essential parameters (weights, threshold, components, LLM configs)
Mixed Component Support: Handles both named string scorers and LLM instance scorers
LLM Serialization: Automatically extracts and saves LLM parameters (model, temperature, etc.)
Flexible Loading: Supports loading with or without providing new LLM instances
Secure Parameter Handling: Excludes internal LangChain attributes and endpoint parameters from saved configs

Parameter Exclusion

The system excludes the following from saved configurations:

Internal attributes: Defined in internal_attrs set (e.g., config_specs, lc_attributes, model_config)
Endpoint parameters: Defined in endpoint_attrs set (e.g., base_url, endpoint, azure_endpoint, api_base)

Environment Variables

API credentials must be set in environment variables for the load to work properly. The LLM modules automatically read standard environment variable names such as:

AZURE_OPENAI_API_KEY, AZURE_OPENAI_ENDPOINT for Azure OpenAI
GOOGLE_APPLICATION_CREDENTIALS for Google Vertex AI

Configuration Format

The saved JSON includes:

weights: Ensemble component weights (typically from tuning)
thresh: Decision threshold
components: List of scorer components (named strings or serialized LLM references)
llm_config: Main LLM configuration for recreation
llm_scorers: Configurations for LLM-based scorer components

Example Configuration

{
  "weights": [
    0.22517161424882978,
    0.5256472050407895,
    0.20544462057286958,
    0.04343460765924121,
    0.00030195247826969915
  ],
  "thresh": 0.85,
  "components": [
    "exact_match",
    "noncontradiction",
    "normalized_probability",
    "judge_1",
    "judge_2"
  ],
  "llm_config": {
    "class_name": "ChatVertexAI",
    "module": "langchain_google_vertexai.chat_models",
    "convert_system_message_to_human": false,
    "default_metadata": [],
    "endpoint_version": "v1beta1",
    "full_model_name": "projects/anbc-dev-csr-va/locations/us-central1/publishers/google/models/gemini-1.5-flash",
    "location": "us-central1",
    "logprobs": true,
    "max_retries": 6,
    "model_family": "2",
    "model_name": "gemini-1.5-flash",
    "n": 1,
    "perform_literal_eval_on_string_raw_content": true,
    "project": "anbc-dev-csr-va",
    "request_parallelism": 5,
    "streaming": false,
    "verbose": false
  },
  "llm_scorers": {
    "judge_1": {
      "class_name": "ChatVertexAI",
      "module": "langchain_google_vertexai.chat_models",
      "convert_system_message_to_human": false,
      "default_metadata": [],
      "endpoint_version": "v1beta1",
      "full_model_name": "projects/anbc-dev-csr-va/locations/us-central1/publishers/google/models/gemini-1.5-flash",
      "location": "us-central1",
      "logprobs": true,
      "max_retries": 6,
      "model_family": "2",
      "model_name": "gemini-1.5-flash",
      "n": 1,
      "perform_literal_eval_on_string_raw_content": true,
      "project": "anbc-dev-csr-va",
      "request_parallelism": 5,
      "streaming": false,
      "verbose": false
    },
    "judge_2": {
      "class_name": "AzureChatOpenAI",
      "module": "langchain_openai.chat_models.azure",
      "deployment_name": "gpt-4o",
      "model_version": "",
      "openai_api_type": "azure",
      "openai_api_version": "2024-02-15-preview",
      "streaming": false,
      "temperature": 1.0,
      "verbose": false
    }
  }
}

Usage

# Save configuration
ensemble.save_config("my_ensemble_config.json")

# Load configuration (requires environment variables to be set)
loaded_ensemble = UQEnsemble.load_config("my_ensemble_config.json")

# Load with provided LLM instance
loaded_ensemble = UQEnsemble.load_config("my_ensemble_config.json", llm=new_llm)

Benefits

Security: Sensitive credentials are never stored in configuration files
Reproducibility: Save tuned configurations for consistent results across experiments
Deployment: Easy model deployment with optimized weights and thresholds
Collaboration: Share tuned ensemble configurations between team members
Version Control: Track ensemble configurations alongside code changes

Technical Notes

Only serializes parameters that can be reconstructed (no internal state)
Requires same LLM libraries to be available when loading
Validates component compatibility during loading
Uses dynamic imports to recreate LLM instances from class information
Security: API keys and endpoint parameters are excluded from saved configs
Error Handling: Clear error messages when credentials are missing

Dependencies

Existing langchain dependencies for LLM recreation

…ren virenbajaj4@gmail.com * WIP save and load uqe * with tests * format with ruff * add env variable support and test * update env var names * add dot env to dependencies * ruff format * simplified llm config save and load * reset to old commit * change llm env args * remove dotenv dependency * remove endpoints from * remove endpoints from config - tests * format --------- Co-authored-by: Viren Bajaj <bajajv@aetna.com> Co-authored-by: Viren <virenbajaj4@gmail.com>

Viren Bajaj and others added 18 commits June 17, 2025 03:49

WIP save and load uqe

9a590d5

with tests

0e85671

Merge branch 'develop'

9bcf23a

format with ruff

053175c

add env variable support and test

f8cf174

update env var names

000f9f8

add dot env to dependencies

bf0b2fd

ruff format

25dd47a

simplified llm config save and load

061c3b9

reset to old commit

52ba5e0

Merge branch 'main' of github.com:virenbajaj/uqlm

138a496

Merge branch 'vb/save_load_config'

fbd00a9

change llm env args

75c348f

Merge branch 'develop' into main

381ee7a

remove dotenv dependency

7a8cbc3

remove endpoints from

933023b

remove endpoints from config - tests

bc7fb5e

format

7c06535

dylanbouchard merged commit 295380e into develop Jun 25, 2025
49 of 50 checks passed

dylanbouchard deleted the save_load_uqensemble branch June 25, 2025 18:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Save and load UQ Ensemble [Viren]#72

Save and load UQ Ensemble [Viren]#72
dylanbouchard merged 18 commits into
developfrom
save_load_uqensemble

dylanbouchard commented Jun 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

dylanbouchard commented Jun 25, 2025

Add Save/Load Configuration Functionality to UQEnsemble

Summary

Changes Made

New Methods Added

Key Features

Parameter Exclusion

Environment Variables

Configuration Format

Example Configuration

Usage

Benefits

Technical Notes

Dependencies

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants