You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
OMAR-RQ models are offered in different configurations, each with its own strengths and weaknesses.
119
119
Models based on mel spectrogram (**base** and **multicodebook**) tend to perform better on semantic tasks such as auto-tagging, structure recognition, and difficulty estimation.
@@ -147,8 +147,8 @@ Check the default [config file](../cfg/rq_single_view/config.gin) to see the dif
147
147
148
148
At least the following parameters should be modified:
149
149
150
-
-`DiscotubeMultiViewAudioDataModule.data_dir` -> This should point to your base data folder.
151
-
-`DiscotubeMultiViewAudioDataModule.filelist_train` -> This should point to a filelist of training audio paths relative to the `data_dir` (one audio file per line).
150
+
-`DiscotubeMultiViewAudioDataModule.data_dir` -> Your base data folder.
151
+
-`DiscotubeMultiViewAudioDataModule.filelist_train` -> Filelist of training audio paths relative to the `data_dir` (one file per line).
152
152
-`DiscotubeMultiViewAudioDataModule.filelist_val` -> Same for the tracks on the validation split.
0 commit comments