Skip to content

Low scores on repo's HSTU on MovieLens #10

@kzhirong

Description

@kzhirong

Hi, thanks for maintaining this repo. I'm running the HSTU implementation in this repo on MovieLens-1M and getting numbers that are substantially below what Meta report in the original HSTU paper (NDCG@10 ≈ 0.1720). My setup follows the paper's protocol. Leave-one-out split, multi-epoch full-shuffle training, full-vocabulary ranking at eval. I tried both full-vocab cross-entropy and sampled-softmax (128 negatives). CE gave NDCG@10 = 0.1140 and SS gave 0.0948. I wanted to check with you before drawing any conclusions, since the README references the Meta paper as the source of this implementation. Can you clarify whether the HSTU here is intended/expected to reproduce Meta's MovieLens results, or has it only been validated on the Amazon datasets shown in the README benchmarks?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions