forked from thorwhalen/hum
-
Notifications
You must be signed in to change notification settings - Fork 1
Notes
Thor Whalen edited this page Mar 2, 2023
·
2 revisions
Generates audio from words. Think Dall-E (or Craiyon or Midjourne) but for sound.
Written in python (but readme has only CLI examples). (To install: pip install audioldm)
huggingface hosts a GUI to try it out. I tried "" and got this sound (could only figure out how to download as video).
How can we use this? To generate and transform (therefore expand, enhance, etc.) targeted audio data. This can help when we don't have any, or not enough, data to get good models, or a good sense of how robust our models are.