Question regarding LogSoftmax+nn.CrossEntropyLoss

Hi, doesn't the nn.CrossEntropyLoss already apply a logsoftmax? You have used a separate LogSoftmax in your decoder too. Did the model achieve Lev distance 10 despite this? Kindly clarify.
Also, are the default hyperparameters in your main.py what you used to get Lev. distance 10 on WSJ? With the defaults, the convergence seems too slow for me. After how many epochs did you see proper words and did you use mel_spectrograms as input features? 
TIA!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question regarding LogSoftmax+nn.CrossEntropyLoss #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question regarding LogSoftmax+nn.CrossEntropyLoss #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions