Skip to content

[FEATURE] Absolute-Timestamp Splitting #689

@lthoang

Description

@lthoang

Description

A data-splitting strategy to use specific absolute timestamps to cut item sequences for training and evaluation, respectively. This strategy aligns with real-world scenarios but is not widely used in research. Researchers are encouraged to experiment with this splitting strategy.

Expected behavior with the suggested feature

Specially, given a chronological user interaction sequence of length N:

  • Training part: item interactions with timestamp range (-∞, t_1);
  • Validation part: item interactions with timestamp range [t_1, t_2);
  • Testing part: item interactions with timestamp range [t_2, +∞).

Reference: https://dl.acm.org/doi/pdf/10.1145/3569930

Other Comments

The Amazon Reviews 2023 dataset has already been split here: https://amazon-reviews-2023.github.io/data_processing/5core.html#absolute-timestamp-splitting

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions