Skip to content

Latest commit

 

History

History
43 lines (33 loc) · 1.87 KB

File metadata and controls

43 lines (33 loc) · 1.87 KB

System Requirements

This page provides detailed hardware, software, platform requirements, and supported models to help you set up and run the application efficiently.

Software and Hardware Requirements

  • OS: Windows 11
  • Recommended processor: Intel® Core Ultra Series 1, 2, and 3 Processors (with integrated GPU support)
  • Memory: 32 GB RAM (minimum recommended)
  • Storage: At least 50 GB free (for models and logs)
  • GPU/Accelerator: Intel® iGPU (Core Ultra Series 1, Arc GPU, or higher) for summarization acceleration
  • NPU: Intel® NPU (Core Ultra Series 1 or higher) for Video pipelines
  • NPU Driver: Please download and install the latest version from Intel NPU Driver Download Page
  • Python: 3.12
  • Node.js: v18+ (for frontend)

Supported Models

ASR (Automatic Speech Recognition)

  • Whisper (all models supported)
    • Recommended: whisper-small or lower for CPU efficiency
    • Runs on CPU (Whisper is CPU-centric)
  • FunASR (Paraformer)
    • Recommended for Chinese transcription (paraformer-zh)
  • Supports transcription of .mp3/.wav audio files up to 45 minutes long.

Summarization (LLMs)

  • Qwen Models (OpenVINO / IPEX)
    • Qwen2.0-7B-Instruct
    • Qwen2.5-7B-Instruct
  • Summarization supports up to 7,500 tokens (≈ 45 minutes of audio) on GPU

Supported Weight Formats

  • int8 → Recommended for lower-end CPUs (fast + efficient)
  • fp16 → Recommended for higher-end systems (better accuracy, GPU acceleration)
  • int4 → Supported, but may reduce accuracy (use only if memory-constrained)

Video Pipeline

  • Supports 3 Video pipelines (front, back and board) up to 45 minutes
  • Supports .mp4 format

Run summarization on GPU (Intel® iGPU / Arc GPU) for faster performance.