What‘s the organized format of the dataset in the code, and is there a need for an additional annotation format?