Change the repository type filter
All
Repositories list
33 repositories
Mel-McNet
PublicFS-EEND
PublicThe official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 20…ATST-SED
PublicThis repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".VING
PublicFN-SSL
PublicThe Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]VINP
PublicCleanMel
PublicRec-RIR
Publicaudiossl
PublicA library built for easier audio self-supervised training, downstream tasks evaluationRealMAN
PublicA description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]RVAE-EM
PublicOfficial PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [I…NBSS
PublicThe official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberationUMA-ASR
PublicThis repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).SAR-SSL
PublicA python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [T…ATST-RCT
PublicFullSubNet
PublicPyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."McNet
PublicThe official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023RCT
PublicNarrowband_DeepFiltering
PublicRTF_InterFrameSpecSub
PublicRS_noisePSD
PublicDP_RTF_SSL
Publicbss_ctf_lasso
Publicdereverb_ctf_nonneg
PublicBSS_CTF_EM
PublicLSTM-noisePSD
Public
ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.