ML/AI Repos 🤖
Official implementations for paper: Anydoor: zero-shot object-level image customization
Instant voice cloning by MIT and MyShell. Audio foundation model.
Generative Agents: Interactive Simulacra of Human Behavior
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
A realtime sketch to image demo using LCM and the gradio library.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
🤖 Scrape data from HTML websites automatically by just providing examples
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
⚡️ OpenAI PHP is a supercharged community-maintained PHP API client that allows you to interact with OpenAI API.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023)
Generative Models by Stability AI
Open-source AI Landing page generator for everyone!
Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)
Official Code for DragGAN (SIGGRAPH 2023)
Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easy Docker setup for Stable Diffusion with user-friendly UI
Stable Diffusion web UI
Pythonic AI generation of images and videos
A multi-voice TTS system trained with an emphasis on quality
Robust Speech Recognition via Large-Scale Weak Supervision
A latent text-to-image diffusion model
A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…