Problem
The AionUi web chat currently supports text and image messages from AI assistants, but has no way to display or play video and audio content inline.
Many AI coding tools (Claude Code, Gemini CLI, Codex, OpenCode, etc.) and MCP servers can produce video and audio outputs:
- ComfyUI MCP can generate videos (LTXV, AnimateDiff, SVD)
- Video analysis MCP tools return frame grids and scene clips
- Audio MCP tools can generate/playback audio
- Agents may share video recordings, screen captures, or audio samples
Today these either fail to render, appear as raw base64/links, or are silently dropped.
Use Cases
- ComfyUI video generation — view generated videos (LTXV, AnimateDiff) directly in chat without downloading
- Video frame analysis — see extracted frames and scene-change comparisons inline
- Audio generation — hear TTS output, music generation, or audio-reactive results
- Screen recordings — share/review agent screen captures
- Multimodal model outputs — display rich media from vision-language models that return video/audio
Problem
The AionUi web chat currently supports text and image messages from AI assistants, but has no way to display or play video and audio content inline.
Many AI coding tools (Claude Code, Gemini CLI, Codex, OpenCode, etc.) and MCP servers can produce video and audio outputs:
Today these either fail to render, appear as raw base64/links, or are silently dropped.
Use Cases