Skip to content

State of the Art for conformer and beam decoding #106

Open
@abhinavg4

Description

@abhinavg4

Hi, Thanks for developing this great tool kit. I had 2 questions about the conformer model :-

  1. For the conformer model in examples/conformer , I think almost all the parameter are similar to conformer(S) of https://arxiv.org/pdf/2005.08100.pdf . However, the performance gap between the paper and conformer model in examples/conformer seems to be quite big (2.7 v/s 6.44 for test-clean). What do you think might be the reason for this?

One reason I can see is that 2.7 is obtained with beam-search whereas 6.44 without. But I don't think just beam search can bring that difference. Can you give me some pointers on how can I reduce this gap? Also, Did you try decoding with beam search for examples/conformer ?

  1. I was trying to decode examples/conformer with beam search test_subword_conformer.py using the pre-trained model provided via drive. For this I just modified beam-width parameter in config.yml. But the decoding is taking very large time (about 30 min per batch, the total number of batches in test clean ~650) on Nvidia p40 with 24GB memory.

Is this the expected behaviour or do I need to something more than changing beam-width from 0 to 4/8. What was the decoding time for you?

Thanks,
Abhinav

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions