This page provides detailed hardware, software, platform requirements, and supported models to help you set up and run the application efficiently.
- OS: Windows 11
- Recommended processor: Intel® Core Ultra Series 1, 2, and 3 Processors (with integrated GPU support)
- Memory: 32 GB RAM (minimum recommended)
- Storage: At least 50 GB free (for models and logs)
- GPU/Accelerator: Intel® iGPU (Core Ultra Series 1, Arc GPU, or higher) for summarization acceleration
- NPU: Intel® NPU (Core Ultra Series 1 or higher) for Video pipelines
- NPU Driver: Please download and install the latest version from Intel NPU Driver Download Page
- Python: 3.12
- Node.js: v18+ (for frontend)
- Whisper (all models supported)
- Recommended:
whisper-smallor lower for CPU efficiency - Runs on CPU (Whisper is CPU-centric)
- Recommended:
- FunASR (Paraformer)
- Recommended for Chinese transcription (
paraformer-zh)
- Recommended for Chinese transcription (
- Supports transcription of .mp3/.wav audio files up to 45 minutes long.
- Qwen Models (OpenVINO / IPEX)
Qwen2.0-7B-InstructQwen2.5-7B-Instruct
- Summarization supports up to 7,500 tokens (≈ 45 minutes of audio) on GPU
- int8 → Recommended for lower-end CPUs (fast + efficient)
- fp16 → Recommended for higher-end systems (better accuracy, GPU acceleration)
- int4 → Supported, but may reduce accuracy (use only if memory-constrained)
- Supports 3 Video pipelines (front, back and board) up to 45 minutes
- Supports .mp4 format
Run summarization on GPU (Intel® iGPU / Arc GPU) for faster performance.