Training on non-vocal data, for noise removal? #554
Replies: 7 comments 4 replies
-
| 
         Very interested about this too, I am a sound designer as well  | 
  
Beta Was this translation helpful? Give feedback.
-
| 
         +1 from me  | 
  
Beta Was this translation helpful? Give feedback.
-
| 
         +1 for me  | 
  
Beta Was this translation helpful? Give feedback.
-
| 
         @Landeplage @g-frega Yes this is possible. To create a dataset for separating bird sounds you need two sounds, first you need the bird sound(s) with zero to as little background noise as possible. Second you need sounds that you want to separate the bird sounds from, the second sound can not contain bird sounds. I would be interested in collaborating on something like this, I can train the models if someone can provide the data, I have no idea how well this would perform but if anyone is interested my discord username is kimberleyjsn or email [email protected]  | 
  
Beta Was this translation helpful? Give feedback.
-
| 
         Hi, I have already created exactly such a model. This model will be added soon in one of the next UVR GUI updates. In the meantime, you can use it on my UVR Online website. You can test the premium version for 7 days for free: https://www.patreon.com/uvronline  | 
  
Beta Was this translation helpful? Give feedback.
-
| 
         Hello all Izotope RX 11 is based on the Demucs model for stem separation. Steinberg Spectralayers also uses Demucs, but the developer indicates that he trained the models to achieve better results than the original Demucs and to separate more stems. Training models seems complex to me in many ways, due to my limited mathematical, acoustic, english and AI knowledge. It also appears that to obtain good results, you need powerful hardware; a RTX 3090 seems to be a minimum, or you have to pay for AI cloud computing rentals. I may be wrong, but only a specialist like Aufr could confirm my hypothesis. It would indeed be great to be able to train models ourselves to remove noise from vocals (speech). I tried the Gabox Fv7z model, which gives the best results for my personal case, but there are still some noises that none of the other models can remove without deleting vocal elements. Even Spectralayers 11 separator cannot do it (plus, the demo version is ending). It’s clear that if I had the possibility to train a model closer to the conditions I encounter, that would be really great. For that, given my limited knowledge, I would need a step-by-step procedure and to know if it’s possible with an RTX 2070? Happy development and training to everyone. The work done by Anjok, Aufr, Jarredou, and many others is truly remarkable. I take this opportunity to thank them for their time and for sharing their work.  | 
  
Beta Was this translation helpful? Give feedback.
-
| 
         I'm from MVsep and discovered the denoise model by aufr33 there. I can't put into words how amazing it is and how much it's helped me with audio restoration since it was introduced. I'm wondering if it's possible for an updated training data to be uploaded for it? The last update was about a year ago now. It's great, but there's still areas for improvement. It would also be amazing to train other denoising options for other situations like heavy hiss, different kinds of hiss, etc.  | 
  
Beta Was this translation helpful? Give feedback.

Uh oh!
There was an error while loading. Please reload this page.
-
I've been testing UVR for a few days, and it's really amazing what the MDX-Net process can do.
I'm a sound designer and recordist, and so immediately I got the idea to use this method for non-musical audio as well. Is it possible to train it to remove noise from recordings? I'm thinking sounds like birds, preamp noise, traffic, talking. If the issue would be gathering data for this, I know about 2700 recordists who could be interested in lending their recordings to use.
I'm not sure what the training data requirements are, but I assume that one would need both a mixed recording along with already separated recordings?
A similar tool exists for removing the sound of birds from recordings:
https://www.boomlibrary.com/sound-effects/debird/
iZotope RX is another industry-standard noise removal tool, although I'm not sure whether they use machine learning for their algorithms, but they definitely moving in that direction:
https://www.izotope.com/en/products/rx.html
Beta Was this translation helpful? Give feedback.
All reactions