Thank you. The problem is just with convergence speed rather than accuracy then. I will try to replicate with Julia 1.0.

…

On Tue, Oct 16, 2018 at 8:09 AM erenay dayanik ***@***.***> wrote: I run a couple of experiments by using exactly the same script and code in the repository. (The environment: Julia 0.6.2, Knet v0.9.1). I share a chart below to share the results I obtained. As you said, they’re not same as the one shared in README. However, the model did not get stuck around ~60%. At the end of the training, I obtained accuracy around ~91% on dev set in general. I remembered that I trained this model (and obtained the corresponding learning curve) on the old cluster (somon & kuacctest) meaning that I might have used even older versions of Knet & Autograd. One possible reason might be the change in the dropout usage. Forget gate bias values of the LSTM might also affect the results. As far as I remember, I was setting them to 1.0 manually on the old cluster (by changing the source code of Knet). If one of these is the problem, playing with hyperparameters and the seed might be enough to recover the loss in the performance which is currently I am doing. If I get improvement, I'll post it here too. I don’t think something serious happened since we’re still able to achieve 91% performance. The saved models can be found here <https://goo.gl/e4Y3WZ> . [image: validation accuracy chart] <https://user-images.githubusercontent.com/13196191/47014337-462dec80-d14a-11e8-93f6-d8014bfd48b1.png> — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <#3 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABvNpguhFA6qBAJZVfCq70cGaetfJ8QTks5ulcxygaJpZM4XX__8> .

cannot replicate convergence #3

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions