there could be one script for trainig, and one script for generating results the training script should point to the version of the dataset etc.