You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
> **Note:** Different models were trained with different sample rates.
138
140
> It is responsibility of the user to ensure that the input audio is sampled at the correct rate.
139
141
140
142
OMAR-RQ models are offered in different configurations, each with its own strengths and weaknesses.
141
143
Models based on mel spectrogram (**base** and **multicodebook**) tend to perform better on semantic tasks such as auto-tagging, structure recognition, and difficulty estimation.
142
144
On the other hand, **multifeature-24hz-fsq** offers the best performance in tonal and temporal tasks such as pitch and chord estimation, and beat tracking.
145
+
The **base-freesound-small** and **base-freesound-large** models were trained with [Freesound](https://freesound.org/) data.
143
146
144
147
### Hugging Face Model IDs
145
148
@@ -148,6 +151,8 @@ On the other hand, **multifeature-24hz-fsq** offers the best performance in tona
0 commit comments