Add Local AI Integration Skill#70
Merged
Merged
Conversation
danielholanda
requested changes
Jun 23, 2026
Co-authored-by: Daniel Holanda <holand.daniel@gmail.com>
danielholanda
approved these changes
Jun 23, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
This branch brings the local-ai-app-integration skill from an NPU-only prototype to a production-ready skill that works on any Windows x64
machine. The work spans the skill itself, its reference docs, the walkthrough, and a new behavioural test suite.
What changed and why
SKILL.md — major rewrite of the opinionated path
The original skill had two silent failure modes discovered during live execution:
the leading v. Fixed by querying the GitHub API for the asset by name pattern instead of hand-building the URL.
returns HTTP 200 with an empty result when weights aren't on disk — indistinguishable from a broken integration without logging. Fixed by making
POST /api/v1/pull an explicit required step before first inference, and explicitly documenting that /load should not be called at startup (its
request shape has changed across lemond versions).
Additional improvements:
install → model pull → first result) so silent failures are diagnosable
mid-transcription
can grab it in that window
reference.md — corrected STT backend matrix and pull vs. load clarification
The old table had these inverted and implied agents needed to manually pick between them.
is the only reliable signal.
speech-to-text.
walkthroughs/local-ai-app-integration.md — broadened hardware requirement
Removed the NPU-only prerequisite. The walkthrough now works on any Windows x64 PC, with a hardware priority table showing what each tier gets
(NPU → iGPU → CPU). All steps are identical regardless of hardware. Also replaced the vague "clone and move files" install instructions with
copy-pasteable bash and PowerShell commands.
tests/test_local_ai_app_integration.py — new behavioural test suite
Added 4 behavioural tests that run the skill end-to-end against the dictate app and assert:
Plus sanity checks on skill structure (SKILL.md present, frontmatter valid, checklist items present).