Skip to content
View haoheliu's full-sized avatar
🚩
Focusing
🚩
Focusing
  • UoSurrey, Centre for Vision, Speech and Signal Processing (CVSSP)
  • Guildford GU2 7XH Stag Hill, UK
  • 16:13 (UTC -12:00)

Block or report haoheliu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
haoheliu/README.md

Haohe's GitHub stats

Pinned Loading

  1. AudioLDM AudioLDM Public

    AudioLDM: Generate speech, sound effects, music and beyond, with text.

    Python 2.8k 249

  2. AudioLDM2 AudioLDM2 Public

    Text-to-Audio/Music Generation

    Python 2.5k 202

  3. versatile_audio_super_resolution versatile_audio_super_resolution Public

    Versatile audio super resolution (any -> 48kHz) with AudioSR.

    Python 1.6k 176

  4. SemantiCodec-inference SemantiCodec-inference Public

    Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

    Python 230 20

  5. audioldm_eval audioldm_eval Public

    This toolbox aims to unify audio generation model evaluation for easier comparison.

    Python 364 37

  6. AudioLDM-training-finetuning AudioLDM-training-finetuning Public

    AudioLDM training, finetuning, evaluation and inference.

    Python 282 55