ai-audio

Here are 90 public repositories matching this topic...

Enemyx-net / VibeVoice-ComfyUI

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.

text-to-speech tts voice-cloning ai-voice voice-generation ai-audio t2s ai-tts ai-voice-clone ai-voice-clonining voice-generator comfyui-nodes comfyui-custom-node comfyui-custom-nodes-text-to-speech vibevoice vibevoice-microsoft

Updated Feb 18, 2026
Python

diodiogod / TTS-Audio-Suite

Star

A ComfyUI custom node integration for local multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual), F5-TTS, Higgs Audio 2, 3, and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools

Updated Jun 24, 2026
Python

gantasmo / theDAW

Sponsor

Star

Full-featured DAW, DJ, and VJ app interoperable w/Ableton, Reaper, Resolume & more. Stable Audio 3, Magenta RT2, Suno API, Chimera track fusion, Demucs stems, MIDI generate/notate, img > spectrogram > music, drawing > music, VST3 & .gan plugins, automix & key-lock, GLSL shaders, volumetric video, Quest 3 XR interface, MIDI auto-map, RAG assistant

Updated Jul 8, 2026
TypeScript

JaySpiffy / draft-to-take

Sponsor

Star

Draft to Take beta: local-first AI audio production studio powered by IndexTTS2, Docker, Qwen, OmniVoice, SFX, ambience, and music sidecars.

docker text-to-speech gpu self-hosted tts speech-synthesis multi-speaker voice-cloning fastapi ai-audio timeline-editor local-ai indextts index-tts indextts2 speaker-prep draft-to-take

Updated Jul 5, 2026
Batchfile

dan-k-k / vocal-gate

Star

Free real-time AI Noise Gate VST3/AU plugin. Removes coughs, sneezes, and other artifacts from your live streams, podcasts, and videos.

Updated Apr 5, 2026
C++

gantasmo / StableDAW

Sponsor

Star

ARCHIVED USE theDAW at https://github.com/gantasmo/theDAW - Browser-based AI audio DAW for Stable Audio 3 with text-to-audio, inpainting, LoRA training, FFmpeg effects, waveform editing, sequencer, piano roll, and persistent library.

react python ffmpeg cuda daw lora music-generation audio-gene vite pythorch fastapi text-to-audio music-ai audio-inpainting ai-audio generative-ai waveform-editor stable-audio stable-audio-3

Updated Jun 13, 2026
TypeScript

Ali-Shariati-Najafabadi / Real-Time-Deepfake-Pipeline

Star

Real-Time Deepfake Pipeline

audio real-time video ai skype realtime faceswap gan webcam zoom microsoft-teams audio-processing deepfake deepfake-detection ai-audio real-time-deepfake

Updated Jun 5, 2025
Python

RhythrosaLabs / soundstorm

Star

AI-powered audio manipulation studio — sample pack creation, algorithmic composition, text-to-audio generation, and ChatGPT on one screen

Updated Jul 3, 2026
Python

aman179102 / podvoice

Star

Local-first CLI that turns Markdown scripts into multi-speaker podcast-style audio using Coqui XTTS v2.

python cli text-to-speech automation opensource podcast tts developer-tools content-creation local-first ai-audio coqui-tts xtts offline-ai open-source-cli markdown-to-audio local-first-ai

Updated Mar 29, 2026
Python

soumya997 / Music-Generation-Using-Deep-Learning

Star

Music Generation Using Deep Learning🎶🎵

nlp machine-learning deep-learning tensorflow2 musicgeneration ai-audio

Updated Jun 26, 2021
Jupyter Notebook

cliffbackerhope / AI-Audio-Content-Creator

Star

AI Audio Content Creation Platform for Podcasts, Narration, Voice Generation and Audio Production. Create Professional Audio Content from Text with Modern Audio Workflows.

Updated Jun 13, 2026
Python

theelderemo / ai-audio-tools

Star

Community list of AI tools for audio and music

Updated Feb 28, 2026

Yuan-ManX / ai-voice-agents

Star

AI Voice Agents: Exploring the Next Generation of Human-Machine Interaction! 🎙️🤖🎧

ai deep-learning ai-agents ai-voice meachine-learning ai-audio ai-agents-framework

Updated Aug 30, 2024

gabrielsenadev / audioinsight

Star

AudioInsight is a web application that processes audio, generates transcriptions, and allows users to ask questions about the related audio.

full-stack webdev whisper audio-processing audio-to-text ai-audio cloudflare-ai

Updated Jan 18, 2026
TypeScript

sezer-muhammed / EBookReaderFullStack

Star

A local-first EPUB reader with high-fidelity neural text-to-speech, word-level synchronization, and Next.js/FastAPI/ONNX stack.

nextjs tts epub-reader onnx fastapi neural-tts ai-audio

Updated Feb 26, 2026
TypeScript

richardr1126 / KittenTTS-FastAPI

Star

High-performance KittenTTS API server with a built-in web UI, OpenAI-compatible routes, long-form text support, and optional CUDA acceleration.

Updated Apr 6, 2026
Python

ALucek / companion-guide-challenge

Star

An approach to Andrej Karpathy's LLM challenge, as outlined here: https://twitter.com/karpathy/status/1760740503614836917

audio blogs video-to-text ai-audio ai-video

Updated Mar 13, 2024
Jupyter Notebook

liushafeiniao / aiwave

Star

AI 音效生成平台 —— 用一句话描述场景，秒出专业级音效。面向视频创作者、游戏开发者、播客主播。🎵 aiwave.art

sfx content-creator audio-tools indie-game-dev text-to-audio ai-audio sound-generation ai-sound-effects

Updated Jun 26, 2026

DynamicDevices / meta-dynamicdevices

Star

Professional Yocto BSP Layer for Dynamic Devices Edge Computing Platforms - AI Audio Processing, E-Ink Displays, Power Management, Wireless Connectivity, i.MX8MM/i.MX93 Support

Updated Jun 3, 2026
Shell

Dineshkumar-Ponnusamy / maya-voice-ai

Star

Maya Voice AI is an open-source project that demonstrates the Maya1 model, capable of generating realistic voice audio from text input with rich emotional and descriptive control. This repository provides a demo for text-to-speech synthesis using advanced language models and the SNAC codec, focusing on high-quality audio at 24kHz.

python open-source text-to-speech deep-learning speech-synthesis maya voice-ai emotional-voice-conversion ai-audio audio-generation snac-codec

Updated Nov 10, 2025
Python

Improve this page

Add a description, image, and links to the ai-audio topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-audio topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-audio

Here are 90 public repositories matching this topic...

Enemyx-net / VibeVoice-ComfyUI

diodiogod / TTS-Audio-Suite

gantasmo / theDAW

JaySpiffy / draft-to-take

dan-k-k / vocal-gate

gantasmo / StableDAW

Ali-Shariati-Najafabadi / Real-Time-Deepfake-Pipeline

RhythrosaLabs / soundstorm

aman179102 / podvoice

soumya997 / Music-Generation-Using-Deep-Learning

cliffbackerhope / AI-Audio-Content-Creator

theelderemo / ai-audio-tools

Yuan-ManX / ai-voice-agents

gabrielsenadev / audioinsight

sezer-muhammed / EBookReaderFullStack

richardr1126 / KittenTTS-FastAPI

ALucek / companion-guide-challenge

liushafeiniao / aiwave

DynamicDevices / meta-dynamicdevices

Dineshkumar-Ponnusamy / maya-voice-ai

Improve this page

Add this topic to your repo