LILO hash computation and stamping for data freshness (#4992) by ItsMrLin · Pull Request #4992 · facebook/Ax

ItsMrLin · 2026-03-06T21:26:11Z

Summary:

Add hash-based data freshness tracking for LILO (Language-in-the-Loop)
pairwise preference labels.

When LILOPairwiseMetric produces labels, it now stamps a SHA-256 hash of the
experiment's LILO inputs (metric data for input_metric_names + LLM messages)
onto the trial's _properties. If any of these inputs change (new data arrives,
data is updated, or the user modifies LLM messages), the hash changes,
indicating that existing LILO labels are stale.

Changes:

Add LILO_INPUT_HASH key to Keys enum in constants.py
Create ax/utils/common/hash_utils.py with compute_lilo_input_hash
(standalone hash function) and get_current_lilo_hash (convenience helper
that looks up the pairwise DerivedMetric on an experiment, extracts
input_metric_names, and computes the hash — returns None if no pairwise
metric is registered)
Stamp hash in LILOPairwiseMetric._compute_derived_values after producing labels
Add tests for hash determinism, sensitivity to data/message changes, stamping,
and get_current_lilo_hash helper

Reviewed By: saitcakmak

Differential Revision: D95284287

meta-codesync · 2026-03-06T21:26:21Z

@ItsMrLin has exported this pull request. If you are a Meta employee, you can view the originating Diff in D95284287.

Summary: Add hash-based data freshness tracking for LILO (Language-in-the-Loop) pairwise preference labels. When LILOPairwiseMetric produces labels, it now stamps a SHA-256 hash of the experiment's LILO inputs (metric data for input_metric_names + LLM messages) onto the trial's _properties. If any of these inputs change (new data arrives, data is updated, or the user modifies LLM messages), the hash changes, indicating that existing LILO labels are stale. Changes: - Add `LILO_INPUT_HASH` key to `Keys` enum in `constants.py` - Create `ax/utils/common/hash_utils.py` with `compute_lilo_input_hash` (standalone hash function) and `get_current_lilo_hash` (convenience helper that looks up the pairwise `DerivedMetric` on an experiment, extracts `input_metric_names`, and computes the hash — returns `None` if no pairwise metric is registered) - Stamp hash in `LILOPairwiseMetric._compute_derived_values` after producing labels - Add tests for hash determinism, sensitivity to data/message changes, stamping, and `get_current_lilo_hash` helper Reviewed By: saitcakmak Differential Revision: D95284287

codecov-commenter · 2026-03-06T21:59:27Z

Codecov Report

❌ Patch coverage is 3.70370% with 26 lines in your changes missing coverage. Please review.
✅ Project coverage is 96.80%. Comparing base (1e48d0f) to head (4bd8cb2).

Files with missing lines	Patch %	Lines
ax/utils/common/hash_utils.py	0.00%	26 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #4992      +/-   ##
==========================================
- Coverage   96.84%   96.80%   -0.04%     
==========================================
  Files         604      605       +1     
  Lines       65022    65049      +27     
==========================================
+ Hits        62971    62973       +2     
- Misses       2051     2076      +25

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Summary: Add hash-based data freshness tracking for LILO (Language-in-the-Loop) pairwise preference labels. When LILOPairwiseMetric produces labels, it now stamps a SHA-256 hash of the experiment's LILO inputs (metric data for input_metric_names + LLM messages) onto the trial's _properties. If any of these inputs change (new data arrives, data is updated, or the user modifies LLM messages), the hash changes, indicating that existing LILO labels are stale. Changes: - Add `LILO_INPUT_HASH` key to `Keys` enum in `constants.py` - Create `ax/utils/common/hash_utils.py` with `compute_lilo_input_hash` (standalone hash function) and `get_current_lilo_hash` (convenience helper that looks up the pairwise `DerivedMetric` on an experiment, extracts `input_metric_names`, and computes the hash — returns `None` if no pairwise metric is registered) - Stamp hash in `LILOPairwiseMetric._compute_derived_values` after producing labels - Add tests for hash determinism, sensitivity to data/message changes, stamping, and `get_current_lilo_hash` helper Reviewed By: saitcakmak Differential Revision: D95284287

Summary: Pull Request resolved: facebook#4992 Add hash-based data freshness tracking for LILO (Language-in-the-Loop) pairwise preference labels. When LILOPairwiseMetric produces labels, it now stamps a SHA-256 hash of the experiment's LILO inputs (metric data for input_metric_names + LLM messages) onto the trial's _properties. If any of these inputs change (new data arrives, data is updated, or the user modifies LLM messages), the hash changes, indicating that existing LILO labels are stale. Changes: - Add `LILO_INPUT_HASH` key to `Keys` enum in `constants.py` - Create `ax/utils/common/hash_utils.py` with `compute_lilo_input_hash` (standalone hash function) and `get_current_lilo_hash` (convenience helper that looks up the pairwise `DerivedMetric` on an experiment, extracts `input_metric_names`, and computes the hash — returns `None` if no pairwise metric is registered) - Stamp hash in `LILOPairwiseMetric._compute_derived_values` after producing labels - Add tests for hash determinism, sensitivity to data/message changes, stamping, and `get_current_lilo_hash` helper Reviewed By: saitcakmak Differential Revision: D95284287

Summary: Add hash-based data freshness tracking for LILO (Language-in-the-Loop) pairwise preference labels. When LILOPairwiseMetric produces labels, it now stamps a SHA-256 hash of the experiment's LILO inputs (metric data for input_metric_names + LLM messages) onto the trial's _properties. If any of these inputs change (new data arrives, data is updated, or the user modifies LLM messages), the hash changes, indicating that existing LILO labels are stale. Changes: - Add `LILO_INPUT_HASH` key to `Keys` enum in `constants.py` - Create `ax/utils/common/hash_utils.py` with `compute_lilo_input_hash` (standalone hash function) and `get_current_lilo_hash` (convenience helper that looks up the pairwise `DerivedMetric` on an experiment, extracts `input_metric_names`, and computes the hash — returns `None` if no pairwise metric is registered) - Stamp hash in `LILOPairwiseMetric._compute_derived_values` after producing labels - Add tests for hash determinism, sensitivity to data/message changes, stamping, and `get_current_lilo_hash` helper Reviewed By: saitcakmak Differential Revision: D95284287

Summary: Pull Request resolved: facebook#4992 Add hash-based data freshness tracking for LILO (Language-in-the-Loop) pairwise preference labels. When LILOPairwiseMetric produces labels, it now stamps a SHA-256 hash of the experiment's LILO inputs (metric data for input_metric_names + LLM messages) onto the trial's _properties. If any of these inputs change (new data arrives, data is updated, or the user modifies LLM messages), the hash changes, indicating that existing LILO labels are stale. Changes: - Add `LILO_INPUT_HASH` key to `Keys` enum in `constants.py` - Create `ax/utils/common/hash_utils.py` with `compute_lilo_input_hash` (standalone hash function) and `get_current_lilo_hash` (convenience helper that looks up the pairwise `DerivedMetric` on an experiment, extracts `input_metric_names`, and computes the hash — returns `None` if no pairwise metric is registered) - Stamp hash in `LILOPairwiseMetric._compute_derived_values` after producing labels - Add tests for hash determinism, sensitivity to data/message changes, stamping, and `get_current_lilo_hash` helper Reviewed By: saitcakmak Differential Revision: D95284287

Summary: Add hash-based data freshness tracking for LILO (Language-in-the-Loop) pairwise preference labels. When LILOPairwiseMetric produces labels, it now stamps a SHA-256 hash of the experiment's LILO inputs (metric data for input_metric_names + LLM messages) onto the trial's _properties. If any of these inputs change (new data arrives, data is updated, or the user modifies LLM messages), the hash changes, indicating that existing LILO labels are stale. Changes: - Add `LILO_INPUT_HASH` key to `Keys` enum in `constants.py` - Create `ax/utils/common/hash_utils.py` with `compute_lilo_input_hash` (standalone hash function) and `get_current_lilo_hash` (convenience helper that looks up the pairwise `DerivedMetric` on an experiment, extracts `input_metric_names`, and computes the hash — returns `None` if no pairwise metric is registered) - Stamp hash in `LILOPairwiseMetric._compute_derived_values` after producing labels - Add tests for hash determinism, sensitivity to data/message changes, stamping, and `get_current_lilo_hash` helper Reviewed By: saitcakmak Differential Revision: D95284287

Summary: Pull Request resolved: facebook#4992 Add hash-based data freshness tracking for LILO (Language-in-the-Loop) pairwise preference labels. When LILOPairwiseMetric produces labels, it now stamps a SHA-256 hash of the experiment's LILO inputs (metric data for input_metric_names + LLM messages) onto the trial's _properties. If any of these inputs change (new data arrives, data is updated, or the user modifies LLM messages), the hash changes, indicating that existing LILO labels are stale. Changes: - Add `LILO_INPUT_HASH` key to `Keys` enum in `constants.py` - Create `ax/utils/common/hash_utils.py` with `compute_lilo_input_hash` (standalone hash function) and `get_current_lilo_hash` (convenience helper that looks up the pairwise `DerivedMetric` on an experiment, extracts `input_metric_names`, and computes the hash — returns `None` if no pairwise metric is registered) - Stamp hash in `LILOPairwiseMetric._compute_derived_values` after producing labels - Add tests for hash determinism, sensitivity to data/message changes, stamping, and `get_current_lilo_hash` helper Reviewed By: saitcakmak Differential Revision: D95284287

meta-codesync · 2026-03-13T20:36:40Z

This pull request has been merged in 72d145c.

meta-cla Bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Mar 6, 2026

meta-codesync Bot added fb-exported meta-exported labels Mar 6, 2026

ItsMrLin force-pushed the export-D95284287 branch from 3dab843 to 83ae087 Compare March 9, 2026 04:10

ItsMrLin force-pushed the export-D95284287 branch from 83ae087 to a3a52f3 Compare March 9, 2026 04:12

ItsMrLin force-pushed the export-D95284287 branch from a3a52f3 to 942b0f9 Compare March 9, 2026 04:14

ItsMrLin force-pushed the export-D95284287 branch from 942b0f9 to 03cb43d Compare March 9, 2026 16:54

meta-codesync Bot changed the title ~~LILO hash computation and stamping for data freshness~~ LILO hash computation and stamping for data freshness (#4992) Mar 13, 2026

ItsMrLin force-pushed the export-D95284287 branch from 03cb43d to 7c3c852 Compare March 13, 2026 02:05

ItsMrLin force-pushed the export-D95284287 branch from 7c3c852 to 836fa17 Compare March 13, 2026 02:08

ItsMrLin force-pushed the export-D95284287 branch from 836fa17 to 53dc52d Compare March 13, 2026 17:13

ItsMrLin force-pushed the export-D95284287 branch from 53dc52d to b2487a0 Compare March 13, 2026 17:13

ItsMrLin force-pushed the export-D95284287 branch from b2487a0 to 4bd8cb2 Compare March 13, 2026 17:16

meta-codesync Bot closed this in 72d145c Mar 13, 2026

facebook-github-tools Bot added the Merged label Mar 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LILO hash computation and stamping for data freshness (#4992)#4992

LILO hash computation and stamping for data freshness (#4992)#4992
ItsMrLin wants to merge 1 commit into
facebook:mainfrom
ItsMrLin:export-D95284287

ItsMrLin commented Mar 6, 2026 •

edited by meta-codesync Bot

Loading

Uh oh!

meta-codesync Bot commented Mar 6, 2026

Uh oh!

codecov-commenter commented Mar 6, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ItsMrLin commented Mar 6, 2026 • edited by meta-codesync Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

meta-codesync Bot commented Mar 6, 2026

Uh oh!

codecov-commenter commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

meta-codesync Bot commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ItsMrLin commented Mar 6, 2026 •

edited by meta-codesync Bot

Loading

codecov-commenter commented Mar 6, 2026 •

edited

Loading