Sample: Audio Transcription with Foundry Local

This sample demonstrates how to use Foundry Local for speech-to-text (audio transcription) using the Whisper model — entirely on-device, with no cloud services required.

What This Shows

Loading the whisper-tiny model via the Foundry Local SDK
Transcribing an audio file (.wav, .mp3, etc.) to text
Both standard and streaming transcription modes
Automatic hardware acceleration (NPU > GPU > CPU)

Prerequisites

Foundry Local installed on your machine
Node.js 18+

Getting Started

Install the Foundry Local SDK:

npm install foundry-local-sdk

Place an audio file (e.g., recording.wav or recording.mp3) in the project directory, then run:

node src/app.js

How It Works

The Foundry Local SDK handles everything:

Model discovery — finds the best whisper-tiny variant for your hardware
Model download — downloads the model if not already cached
Model loading — loads the model into memory with optimized hardware acceleration
Transcription — runs Whisper inference entirely on-device

No need for whisper.cpp, @huggingface/transformers, or any other separate STT tool.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sample: Audio Transcription with Foundry Local

What This Shows

Prerequisites

Getting Started

How It Works

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Sample: Audio Transcription with Foundry Local

What This Shows

Prerequisites

Getting Started

How It Works