Open
Description
Hi, Thanks for developing this great tool kit. I had 2 questions about the conformer model :-
- For the conformer model in
examples/conformer
, I think almost all the parameter are similar to conformer(S) of https://arxiv.org/pdf/2005.08100.pdf . However, the performance gap between the paper and conformer model inexamples/conformer
seems to be quite big (2.7 v/s 6.44 for test-clean). What do you think might be the reason for this?
One reason I can see is that 2.7 is obtained with beam-search whereas 6.44 without. But I don't think just beam search can bring that difference. Can you give me some pointers on how can I reduce this gap? Also, Did you try decoding with beam search for examples/conformer
?
- I was trying to decode
examples/conformer
with beam searchtest_subword_conformer.py
using the pre-trained model provided via drive. For this I just modifiedbeam-width
parameter in config.yml. But the decoding is taking very large time (about 30 min per batch, the total number of batches in test clean ~650) on Nvidia p40 with 24GB memory.
Is this the expected behaviour or do I need to something more than changing beam-width
from 0 to 4/8. What was the decoding time for you?
Thanks,
Abhinav