I've noticed an inconsistency in the use of different folds for training (fold 1) and testing (fold 5), leading to videos being shared between the two sets. This inconsistency raises concerns about the reproducibility and reliability of the reported results in the paper, leading to potential issues in comparing and evaluating different models. Moreover, when I attempted to use the same fold for both training and testing with the provided parameter configuration, I obtained lower performance results than those mentioned in the paper. Can you please let me know if there is a specific reason for the current fold selection strategy?
I've noticed an inconsistency in the use of different folds for training (fold 1) and testing (fold 5), leading to videos being shared between the two sets. This inconsistency raises concerns about the reproducibility and reliability of the reported results in the paper, leading to potential issues in comparing and evaluating different models. Moreover, when I attempted to use the same fold for both training and testing with the provided parameter configuration, I obtained lower performance results than those mentioned in the paper. Can you please let me know if there is a specific reason for the current fold selection strategy?