feat: add on-device LLM inference via LiteRT-LM by utzcoz · Pull Request #165 · aaif-goose/goose-mobile

utzcoz · 2026-04-30T14:32:05Z

Adds an on-device model provider that runs Gemma 4 E2B/E4B locally using Google's LiteRT-LM (NPU/GPU accelerated). Users can download models from a built-in registry, manage them in settings, and select on-device inference from onboarding or the provider dropdown — no API key required.

Changes:

New ON_DEVICE_LITERT provider with LiteRTProviderHandler wired into Agent.callLlm and tool resolution
Model registry loaded from assets/models_litert.json; downloads go through Android DownloadManager into app-private storage
ModelManagementScreen for browsing, downloading, and deleting on-device models; integrated into onboarding LLM configuration
OnDeviceModelManager scans for downloaded models on app startup (via GoslingApplication) so saved model identifiers resolve correctly across restarts/reinstalls
On-device system prompt preserves installed apps, screen resolution, and user memories so models can use tools instead of guessing URLs
Settings screen gracefully handles empty on-device model lists with a "No models downloaded" placeholder

Inspired by recent Google AI Edge Gallery + Gemma 4 E2B/E4B release.

Adds an on-device model provider that runs Gemma 4 E2B/E4B locally using Google's LiteRT-LM (NPU/GPU accelerated). Users can download models from a built-in registry, manage them in settings, and select on-device inference from onboarding or the provider dropdown — no API key required. Changes: - New ON_DEVICE_LITERT provider with LiteRTProviderHandler wired into Agent.callLlm and tool resolution - Model registry loaded from assets/models_litert.json; downloads go through Android DownloadManager into app-private storage - ModelManagementScreen for browsing, downloading, and deleting on-device models; integrated into onboarding LLM configuration - OnDeviceModelManager scans for downloaded models on app startup (via GoslingApplication) so saved model identifiers resolve correctly across restarts/reinstalls - On-device system prompt preserves installed apps, screen resolution, and user memories so models can use tools instead of guessing URLs - Settings screen gracefully handles empty on-device model lists with a "No models downloaded" placeholder

utzcoz · 2026-04-30T15:36:54Z

Hi @michaelneale, PTAL.

utzcoz force-pushed the feat/on-device-model-support branch from 06343e4 to 2d1097d Compare April 30, 2026 15:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add on-device LLM inference via LiteRT-LM#165

feat: add on-device LLM inference via LiteRT-LM#165
utzcoz wants to merge 1 commit intoaaif-goose:mainfrom
utzcoz:feat/on-device-model-support

utzcoz commented Apr 30, 2026

Uh oh!

utzcoz commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

utzcoz commented Apr 30, 2026

Uh oh!

utzcoz commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant