Hi, thanks for your great work.
I'm trying to reproduce your model, but somehow my model performance always deteriorates around 1 point of HR@10.
The code is actually not that complex, where my preprocessing logic and model architecture follows your paper exactly.
My env is tf2.0, any idea why this happens?