feat(windows): add Windows cross-platform support by rodrigoluizs · Pull Request #401 · app-vox/vox

rodrigoluizs · 2026-03-14T12:18:13Z

Summary

Add cross-platform architecture with src/main/platform/ module (types, utils, dynamic loader)
Extract macOS paster and permissions into platform/darwin/
Implement Windows paster using user32.dll SendInput API via koffi FFI
Implement Windows permissions module (no permission gates on Windows)
Add finishWithPeriod support to platform PasteOptions
Platform-aware titleBarStyle and menu roles for Windows compatibility
Resolve whisper binary name on Windows (.exe extension)
Add postinstall-whisper.js script to download pre-built whisper.cpp binary for Windows
Add persistent whisper-server process on non-macOS to eliminate cold-start latency
Use 75% of CPU cores for whisper inference on CPU-only platforms
Skip language auto-detection on CPU-only platforms to halve transcription latency
Add Windows build job to release CI workflow
Rewire all consumers (app, ipc, shortcuts) to use platform modules

Test plan

App starts on Windows with no errors
Verify whisper transcription works end-to-end on Windows
Test paste via SendInput (Ctrl+V simulation) on Windows
Verify tray, HUD, and shortcuts work on Windows
Confirm macOS functionality is unchanged (regression)
Run npm run typecheck, npm run lint, npx vitest run
Test Windows release build via CI

github-actions · 2026-03-14T12:22:08Z

CI Summary

Check	Status
Typecheck	✅ Passed
Lint	✅ Passed
Lint CSS	✅ Passed
Design Tokens	✅ Passed
Test	✅ Passed
Build	✅ Passed

Run #962

github-actions · 2026-03-14T12:25:05Z

✅MegaLinter analysis: Success

Descriptor	Linter	Files	Errors	Warnings	Elapsed time
✅ JSON	jsonlint	1	0	0	0.1s
✅ JSON	npm-package-json-lint	yes	no	no	1.18s
✅ JSON	prettier	1	0	0	0.32s
✅ JSON	v8r	1	0	0	7.75s
✅ REPOSITORY	checkov	yes	no	no	25.42s
✅ REPOSITORY	devskim	yes	no	no	2.3s
✅ REPOSITORY	dustilock	yes	no	no	1.62s
✅ REPOSITORY	gitleaks	yes	no	no	6.77s
✅ REPOSITORY	git_diff	yes	no	no	0.32s
✅ REPOSITORY	grype	yes	no	no	45.58s
✅ REPOSITORY	kics	yes	no	no	3.52s
✅ REPOSITORY	kingfisher	yes	no	no	5.45s
✅ REPOSITORY	secretlint	yes	no	no	4.08s
✅ REPOSITORY	syft	yes	no	no	5.93s
✅ REPOSITORY	trivy	yes	no	no	9.88s
✅ REPOSITORY	trivy-sbom	yes	no	no	3.23s
✅ REPOSITORY	trufflehog	yes	no	no	4.16s
✅ YAML	prettier	1	0	0	0.41s
✅ YAML	v8r	1	0	0	3.25s
✅ YAML	yamllint	1	0	0	0.51s

See detailed reports in MegaLinter artifacts
Set VALIDATE_ALL_CODEBASE: true in mega-linter.yml to validate all sources, not only the diff

Show us your support by starring ⭐ the repository

…, and loader

…issions.ts

- app.ts, ipc.ts, shortcuts/manager.ts now import from platform/ - Removed darwin-only guard from launch-at-login - Deleted src/main/input/paster.ts (replaced by platform modules) - Platform loader uses lazy Proxy to avoid test-time require issues

Dynamic template-string require (`./\${os}/paster`) can't be resolved by Vite/Rollup in bundled builds. Use conditional static string paths so the bundler includes both platform modules in the bundle.

Replace dynamic require() with static ES imports so Vite/Rollup can resolve both platform modules at build time. Both modules defer native library loads to function call time, making it safe to bundle both.

- Add electron-builder win/nsis config to package.json - Add build-windows job on windows-latest runner - Generate icon.ico from logo.png via ImageMagick in CI - Update release job to download and publish both platforms - Windows code signing via CSC_LINK/CSC_KEY_PASSWORD (optional)

MinGW GCC 15.2 generates AVX code with incorrect stack alignment on Windows, causing segfaults during whisper inference. The official MSVC-compiled release binaries handle AVX correctly. - Add cross-platform postinstall script that downloads pre-built whisper.cpp v1.8.3 binary on Windows (macOS still compiles from source) - Update whisper.ts to use whisper-cli.exe on Windows (the v1.8.3 binary name) - Add scripts/ to ESLint ignores (CJS postinstall script) Performance: 7.4s for 11s audio (vs 200s without AVX, vs segfault with MinGW AVX)

On macOS Metal GPU handles compute so thread count is less critical. On Windows (CPU-only), whisper defaults to 4 threads which wastes available cores. Using 75% of available cores nearly halves inference time on a 16-core system (7.6s -> 4.0s for 11s of audio).

Whisper auto-detection runs the encoder twice (detect + transcribe), doubling latency on CPU. On macOS Metal GPU this is negligible but on Windows it adds ~3s to every transcription. When the user has speech languages configured, use the first one directly instead of auto-detect. The LLM correction layer handles any cross-language artifacts. Before: 13.3s (auto, 4t) → After: ~4s (fixed lang, 12t)

Replace the spawn-per-transcription approach (whisper-cli.exe) with a persistent whisper-server.exe process on non-macOS platforms. The server loads the model once at app launch and accepts audio via HTTP POST on localhost, eliminating the ~500ms model-loading overhead on every transcription. Additionally switches from beam-search decoding (--best-of 5 --beam-size 5) to greedy decoding, saving ~600ms per transcription. The LLM correction layer compensates for any quality difference. - Add WhisperServer class with lifecycle management and HTTP-based readiness polling - Server auto-restarts when the user changes whisper models - Falls back to CLI mode if the server fails to start - macOS keeps the existing CLI approach (Metal GPU makes cold-start negligible) Benchmark (JFK 11s audio, small model, 12 threads): Before: ~4,500ms (CLI + beam search) After: ~3,300ms (server + greedy)

…d default Prefix unused httpRequestHandler variable with underscore to satisfy ESLint no-unused-vars rule. Update win32 paster test to account for finishWithPeriod defaulting to true (period kept unless explicitly false).

Replace stale input/paster mock with platform module mock so tests pass on Linux CI where no platform-specific paster is available.

codecov · 2026-03-14T19:59:23Z

Codecov Report

❌ Patch coverage is 41.03586% with 148 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/main/audio/whisper-server.ts	55.04%	43 Missing and 6 partials ⚠️
src/main/platform/win32/paster.ts	35.71%	42 Missing and 3 partials ⚠️
src/main/audio/whisper.ts	16.66%	18 Missing and 2 partials ⚠️
src/main/app.ts	0.00%	10 Missing ⚠️
src/main/platform/darwin/permissions.ts	0.00%	10 Missing ⚠️
src/main/ipc.ts	0.00%	5 Missing ⚠️
src/main/platform/index.ts	0.00%	4 Missing ⚠️
src/main/shortcuts/manager.ts	40.00%	3 Missing ⚠️
src/main/windows/home.ts	0.00%	0 Missing and 2 partials ⚠️

📢 Thoughts on this report? Let us know!

rodrigoluizs added feature New feature implementation platform:windows Windows specific labels Mar 14, 2026

rodrigoluizs self-assigned this Mar 14, 2026

rodrigoluizs force-pushed the feat/windows-cross-platform branch from 1c42f7a to 3d86011 Compare March 14, 2026 12:20

rodrigoluizs added 20 commits March 14, 2026 19:50

docs(plans): add Windows cross-platform support design

ce8da24

docs(plans): add Windows cross-platform implementation plan

7a6b0a4

chore: remove spec docs (moved to specs repo)

ee7c8e5

fix(whisper): resolve binary name on Windows (.exe extension)

8d6851b

refactor(platform): add platform module scaffolding with types, utils…

889641a

…, and loader

refactor(platform): extract macOS paster to platform/darwin/paster.ts

3923e16

refactor(platform): extract macOS permissions to platform/darwin/perm…

4fb8590

…issions.ts

feat(platform): add Windows paster with user32.dll keyboard simulation

03a584c

feat(platform): add Windows permissions module

426963f

fix(windows): platform-aware titleBarStyle and menu roles

2f4af03

fix(platform): remove unused shell import from app.ts

857e512

fix(platform): use static require paths for bundler compatibility

345f44c

Dynamic template-string require (`./\${os}/paster`) can't be resolved by Vite/Rollup in bundled builds. Use conditional static string paths so the bundler includes both platform modules in the bundle.

fix(platform): use static imports for Vite bundler compatibility

602dd67

Replace dynamic require() with static ES imports so Vite/Rollup can resolve both platform modules at build time. Both modules defer native library loads to function call time, making it safe to bundle both.

rodrigoluizs force-pushed the feat/windows-cross-platform branch from a6c6607 to 14ef49d Compare March 14, 2026 18:54

fix(tests): mock platform module in shortcut manager tests

7b74a4c

Replace stale input/paster mock with platform module mock so tests pass on Linux CI where no platform-specific paster is available.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(windows): add Windows cross-platform support#401

feat(windows): add Windows cross-platform support#401
rodrigoluizs wants to merge 21 commits intomainfrom
feat/windows-cross-platform

rodrigoluizs commented Mar 14, 2026

Uh oh!

github-actions bot commented Mar 14, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 14, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rodrigoluizs commented Mar 14, 2026

Summary

Test plan

Uh oh!

github-actions bot commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CI Summary

Uh oh!

github-actions bot commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅MegaLinter analysis: Success

Uh oh!

codecov bot commented Mar 14, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions bot commented Mar 14, 2026 •

edited

Loading

github-actions bot commented Mar 14, 2026 •

edited

Loading