Open
Description
I has used old version of GOP provided by @jimbozhang and I'm using new version of GOP (gop_speechocean762)
When I test with 1 audio but different text, model return high score for not existing phone in audio.
E.g: Text is "kick", Audio contains only 1 word "change"
Result:
K: 1.98
IH: 1.27
K: 1.05
I think the problem is the dataset is not balance.
0: 1339
1: 1828
2: 44079