Thanks for your codes. However, I noticed that your reproduction seems to ignore the last dropout layer. Does this have any impact on the model ? 