You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[Model training](https://nvidia.github.io/NeMo-Skills/pipelines/training): Train models at speed-of-light using [NeMo-Aligner](https://github.com/NVIDIA/NeMo-Aligner/).
16
17
17
18
You can find the full documentation [here](https://nvidia.github.io/NeMo-Skills/).
@@ -24,12 +25,12 @@ commands and their options.
24
25
Using our pipelines we created [OpenMathReasoning dataset](https://huggingface.co/datasets/nvidia/OpenMathReasoning).
25
26
This dataset contains
26
27
27
-
* 306K unique mathematical problems sourced from [AoPS forums](https://artofproblemsolving.com/community) with:
28
+
* 306K unique mathematical problems sourced from [AoPS forums](https://artofproblemsolving.com/community) with:
28
29
* 3.2M long chain-of-thought (CoT) solutions
29
30
* 1.7M long tool-integrated reasoning (TIR) solutions
30
31
* 566K samples that select the most promising solution out of many candidates (GenSelect)
31
32
* Additional 193K problems sourced from AoPS forums (problems only, no solutions)
32
-
33
+
33
34
We used [Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) to preprocess problems, and
34
35
[DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1) and [QwQ-32B](https://huggingface.co/Qwen/QwQ-32B) to generate solutions.
0 commit comments