-
-
SafeEar Public
Forked from LetterLiGo/SafeEarThe Official Code Repo of SafeEar (Accepted by CCS 2024)
-
IIANet Public
This is the demo of our paper "IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation".
-
-
Apollo-data-preprocess Public
Apollo training data preprocessing scripts
-
Apollo Public
Music repair method to convert lossy MP3 compressed music to lossless music.
-
TIGER Public
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
-
SonicSim Public
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
-
-
RSTnet Public
Forked from yangdongchao/RSTnetReal-time Speech-Text Foundation Model Toolkit
-
-
-
speechbrain Public
Forked from speechbrain/speechbrainA PyTorch-based Speech Toolkit
-
CTCNet Public
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
-
orbit Public
Forked from isaac-sim/IsaacLabUnified framework for robot learning built on NVIDIA Isaac Sim
Python Other UpdatedApr 27, 2024 -
torchmetrics Public
Forked from Lightning-AI/torchmetricsTorchmetrics - Machine learning metrics for distributed, scalable PyTorch applications.
Python Apache License 2.0 UpdatedApr 24, 2024 -
TDANet Public
An efficient speech separation method
-
LRS3-For-Speech-Separation Public
Multi-modal speech separation task data generation script on LRS3 data set.
-
Look2hear Public
A toolkit for researchers in the multimodal sound separation.
16 UpdatedOct 20, 2023 -
sdx-submissions Public
Forked from sdx-workshop/sdx-submissionsSound Demixing Challenge Submission Repo
-
-
S4M Public
Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models
-
avlit Public
Forked from hmartelb/avlitOfficial source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model" (AVLIT)
-
Conv-TasNet Public
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
-
AV-ConvTasNet Public
Unofficial Time Domain Audio Visual Speech Separation Implementation
-
-
Dual-Path-RNN-Pytorch Public
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
-
GenerSpeech Public
Forked from Rongjiehuang/GenerSpeechPyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
-
Executable code based on Google articles
-
asteroid Public
Forked from asteroid-team/asteroidPytorch-based audio source separation toolkit || Current highlight : the new recipe for Microsoft's DNS Challenge !