Open
Description
Hi,
The performance of the model are really good when the voice is clean, however if the background is not clean with some noisy or room reverb, the recall rate is really low. is it possible to add some background noise or reverb into keyword audio sample to increase the detect rate under complex scene, Will it affect the recognition success rate of the model? Is such data enhancement done during training?
Metadata
Assignees
Labels
No labels