State of the Art for conformer and beam decoding

Hi, Thanks for developing this great tool kit. I had 2 questions about the conformer model :-

1. For the conformer model in ```examples/conformer``` ,  I think almost all the parameter are similar to conformer(S) of https://arxiv.org/pdf/2005.08100.pdf . However, the performance gap between the paper and conformer model in ```examples/conformer``` seems to be quite big (2.7 v/s 6.44 for test-clean). What do you think might be the reason for this?

One reason I can see is that 2.7 is obtained with beam-search whereas 6.44 without. But I don't think just beam search can bring that difference. Can you give me some pointers on how can I reduce this gap? Also,  Did you try decoding with beam search for ```examples/conformer``` ?

2. I was trying to decode ```examples/conformer``` with beam search ```test_subword_conformer.py``` using the pre-trained model provided via [drive](https://drive.google.com/drive/folders/1VAihgSB5vGXwIVTl3hkUk95joxY1YbfW?usp=sharing). For this I just modified ```beam-width``` parameter in config.yml. But the decoding is taking very large time (about 30 min per batch, the total number of batches in test clean ~650) on Nvidia p40 with 24GB memory. 

Is this the expected behaviour or do I need to something more than changing ```beam-width``` from 0 to 4/8. What was the decoding time for you? 

Thanks,
Abhinav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

State of the Art for conformer and beam decoding #106

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

State of the Art for conformer and beam decoding #106

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions