1. Add 512, 1k, 2k and 4k versions. For now, we will only use the 512 version. 2. Investigate if we can keep the same split for all the 4 cases. 3. The test set will be checked with the Kaggle platform.