feat(sdk): speculative cloud loader decision layer by theGlenn · Pull Request #250 · xybrid-ai/xybrid

theGlenn · 2026-06-09T18:34:30Z

Summary

Adds the loader-side decision layer for speculative cloud fallback: a process-global set_speculative_cloud / is_speculative_cloud_enabled toggle, a per-load ModelLoader::with_speculative_cloud override, and will_speculate, which gates on speculation being enabled, a resolvable cloud API key, and the model not already being cached locally (via the pure speculative_gate helper). A per-load override beats the global default, and non-registry sources (bundle/directory/HuggingFace) never speculate. This is the loader half of the feature; the cloud-execution routing and background download are tracked separately.

Type of Change

New feature

Checklist

Tests pass (cargo test)
Code follows project style (see RUST_GUIDE.md)
Documentation updated (if applicable)
No breaking changes (or breaking changes documented)

Add set_speculative_cloud / is_speculative_cloud_enabled global toggle and per-load ModelLoader::with_speculative_cloud override. will_speculate gates on enabled + resolvable cloud key + model not already cached locally, via a pure speculative_gate helper. Override beats the process-global default; non-registry sources never speculate.

…e-cloud-fallback

vercel · 2026-06-09T18:34:36Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
xybrid	Ready	Preview, Comment	Jun 13, 2026 1:39pm

gemini-code-assist

Code Review

This pull request introduces a speculative cloud fallback feature to the SDK, allowing models to be served from the cloud gateway while downloading locally. The reviewer provided valuable feedback on optimizing performance: short-circuiting evaluation in will_speculate to avoid unnecessary disk I/O and environment lookups, using CacheManager directly instead of instantiating a heavy RegistryClient in is_extracted_locally, and leveraging crate::has_api_key() instead of constructing a full CloudConfig in cloud_api_key_present.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-09T18:35:38Z

+    pub fn will_speculate(&self) -> bool {
+        speculative_gate(
+            self.speculative_enabled(),
+            cloud_api_key_present(),
+            self.is_extracted_locally(),
+        )
+    }


The current implementation of will_speculate passes the results of self.speculative_enabled(), cloud_api_key_present(), and self.is_extracted_locally() to speculative_gate. Because Rust uses eager evaluation for function arguments, this means all three checks are always executed, even when speculation is disabled (which is the default).

This results in unnecessary environment variable lookups and disk I/O (via is_extracted_locally checking the local cache directory) on every call to will_speculate.

Using short-circuiting boolean operators (&&) ensures that we only perform the API key check and disk I/O when speculation is actually enabled.

pub fn will_speculate(&self) -> bool { self.speculative_enabled() && cloud_api_key_present() && !self.is_extracted_locally() }

Done in 2a3e3ef — will_speculate now short-circuits with &&, so the key lookup and the local-cache disk check only run when speculation is enabled. Inlined the gate and dropped the speculative_gate helper.

gemini-code-assist · 2026-06-09T18:35:38Z

+    fn is_extracted_locally(&self) -> bool {
+        match &self.source {
+            ModelSource::Registry { id, .. } => {
+                RegistryClient::from_env().is_ok_and(|client| client.is_extracted(id))
+            }
+            _ => true,
+        }
+    }


Instantiating RegistryClient::from_env() just to check if a model is extracted locally is extremely heavy. RegistryClient::from_env() creates a new HTTP agent (ureq::Agent) with connection/thread pools, allocates circuit breakers, and queries environment variables, none of which are needed for a local cache check.

Since RegistryClient::is_extracted simply forwards the call to CacheManager::is_extracted, we can instantiate CacheManager directly to avoid the heavy overhead of creating an HTTP client.

Suggested change

fn is_extracted_locally(&self) -> bool {

match &self.source {

ModelSource::Registry { id, .. } => {

RegistryClient::from_env().is_ok_and(|client| client.is_extracted(id))

}

_ => true,

}

}

fn is_extracted_locally(&self) -> bool {

match &self.source {

ModelSource::Registry { id, .. } => {

crate::cache::CacheManager::new().is_ok_and(|cache| cache.is_extracted(id))

}

_ => true,

}

}

Done in 2a3e3ef — is_extracted_locally now uses CacheManager::new() directly instead of RegistryClient::from_env(), avoiding the HTTP agent and circuit-breaker allocation for a pure local cache check.

gemini-code-assist · 2026-06-09T18:35:38Z

+fn cloud_api_key_present() -> bool {
+    xybrid_core::cloud::CloudConfig::default()
+        .resolve_api_key()
+        .is_some()
 }


Using xybrid_core::cloud::CloudConfig::default() to resolve the API key is inefficient because CloudConfig::default() queries unrelated environment variables (like XYBRID_GATEWAY_URL and XYBRID_PLATFORM_URL) to build the default gateway URL.

Since crate::has_api_key() is already defined in the SDK and performs the exact same check (checking the in-memory key and falling back to the XYBRID_API_KEY environment variable) without constructing a CloudConfig, we can use it here to avoid unnecessary environment lookups.

fn cloud_api_key_present() -> bool { crate::has_api_key() }

Done in 2a3e3ef — switched to crate::has_api_key(), which checks the in-memory key and XYBRID_API_KEY without constructing a CloudConfig or resolving the gateway URL.

Address review: resolve the cloud key via has_api_key() instead of constructing a CloudConfig; check the local cache with CacheManager directly instead of spinning up a RegistryClient HTTP agent; and short-circuit will_speculate so the key lookup and disk check are skipped when speculation is disabled. Inline the gate and drop the now-redundant speculative_gate helper.

…e-cloud-fallback

theGlenn added 2 commits June 9, 2026 19:06

Merge remote-tracking branch 'origin/master' into theGlenn/speculativ…

fdb8b25

…e-cloud-fallback

gemini-code-assist Bot reviewed Jun 9, 2026

View reviewed changes

theGlenn added 3 commits June 9, 2026 19:46

Merge remote-tracking branch 'origin/master' into theGlenn/speculativ…

39ffbe9

…e-cloud-fallback

Merge remote-tracking branch 'origin/master' into theGlenn/speculativ…

ca4af53

…e-cloud-fallback

vercel Bot deployed to Preview June 13, 2026 13:39 View deployment

theGlenn merged commit 62bcce5 into master Jun 14, 2026
22 checks passed

theGlenn deleted the theGlenn/speculative-cloud-fallback branch June 15, 2026 03:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sdk): speculative cloud loader decision layer#250

feat(sdk): speculative cloud loader decision layer#250
theGlenn merged 5 commits into
masterfrom
theGlenn/speculative-cloud-fallback

theGlenn commented Jun 9, 2026

Uh oh!

vercel Bot commented Jun 9, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 9, 2026

Uh oh!

theGlenn Jun 9, 2026

Uh oh!

gemini-code-assist Bot Jun 9, 2026

Uh oh!

theGlenn Jun 9, 2026

Uh oh!

gemini-code-assist Bot Jun 9, 2026

Uh oh!

theGlenn Jun 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

theGlenn commented Jun 9, 2026

Summary

Type of Change

Checklist

Uh oh!

vercel Bot commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

theGlenn Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

theGlenn Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

theGlenn Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vercel Bot commented Jun 9, 2026 •

edited

Loading