Add instructions for testing other models

tongyx361 · tongyx361 · commit 91fd960024f9 · 2024-08-18T05:05:16.000+08:00
diff --git a/README.md b/README.md
@@ -332,6 +332,14 @@ For other general inference settings, please modify the command or
 directly modify the
 [script](https://github.com/hkust-nlp/dart-math/blob/main/pipeline/gen.py).
 
+- To test **base** models, please add the corresponding **ID** to
+  `BASE_MODEL_IDS` from
+  [dart_math.utils](https://github.com/hkust-nlp/dart-math/blob/main/dart_math/utils.py).
+- To test **instruct** models, please add the corresponding **prompt
+  template** to `PROMPT_TEMPLATE_ID2DICT` from
+  [dart_math.utils](https://github.com/hkust-nlp/dart-math/blob/main/dart_math/utils.py)
+  and specify with `--prompt_template`.
+
 You can also add the `--gen_only` option to only generate responses
 without evaluation and use the
 [`EvaluatorMathBatch`](https://hkust-nlp.github.io/dart-math/eval.html#evaluatormathbatch)
diff --git a/nbs/index.ipynb b/nbs/index.ipynb
@@ -352,6 +352,9 @@
     "\n",
     "For other general inference settings, please modify the command or directly modify the [script](https://github.com/hkust-nlp/dart-math/blob/main/pipeline/gen.py).\n",
     "\n",
+    "- To test **base** models, please add the corresponding **ID** to `BASE_MODEL_IDS` from [dart_math.utils](https://github.com/hkust-nlp/dart-math/blob/main/dart_math/utils.py).\n",
+    "- To test **instruct** models, please add the corresponding **prompt template** to `PROMPT_TEMPLATE_ID2DICT` from [dart_math.utils](https://github.com/hkust-nlp/dart-math/blob/main/dart_math/utils.py) and specify with `--prompt_template`.\n",
+    "\n",
     "You can also add the `--gen_only` option to only generate responses without evaluation and use the `EvaluatorMathBatch` to grade the generations by yourself. Please check the [grading script](pipeline/grade.py) for example.\n"
    ]
   },
@@ -527,7 +530,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## 🌟 Star History"
+    "## 🌟 Star History\n"
    ]
   },
   {
@@ -540,7 +543,7 @@
     "   <source media=\"(prefers-color-scheme: light)\" srcset=\"https://api.star-history.com/svg?repos=hkust-nlp/dart-math&type=Date\" />\n",
     "   <img alt=\"Star History Chart\" src=\"https://api.star-history.com/svg?repos=hkust-nlp/dart-math&type=Date\" />\n",
     " </picture>\n",
-    "</a>"
+    "</a>\n"
    ]
   },
   {

Original file line number	Diff line number	Diff line change
`@@ -352,6 +352,9 @@`
`352`	`352`	`"\n",`
`353`	`353`	`"For other general inference settings, please modify the command or directly modify the [script](https://github.com/hkust-nlp/dart-math/blob/main/pipeline/gen.py).\n",`
`354`	`354`	`"\n",`
	`355`	+ "- To test base models, please add the corresponding ID to `BASE_MODEL_IDS` from [dart_math.utils](https://github.com/hkust-nlp/dart-math/blob/main/dart_math/utils.py).\n",
	`356`	+ "- To test instruct models, please add the corresponding prompt template to `PROMPT_TEMPLATE_ID2DICT` from [dart_math.utils](https://github.com/hkust-nlp/dart-math/blob/main/dart_math/utils.py) and specify with `--prompt_template`.\n",
	`357`	`+ "\n",`
`355`	`358`	"You can also add the `--gen_only` option to only generate responses without evaluation and use the `EvaluatorMathBatch` to grade the generations by yourself. Please check the [grading script](pipeline/grade.py) for example.\n"
`356`	`359`	`]`
`357`	`360`	`},`
`@@ -527,7 +530,7 @@`
`527`	`530`	`"cell_type": "markdown",`
`528`	`531`	`"metadata": {},`
`529`	`532`	`"source": [`
`530`		`- "## 🌟 Star History"`
	`533`	`+ "## 🌟 Star History\n"`
`531`	`534`	`]`
`532`	`535`	`},`
`533`	`536`	`{`
`@@ -540,7 +543,7 @@`
`540`	`543`	`" <source media=\"(prefers-color-scheme: light)\" srcset=\"https://api.star-history.com/svg?repos=hkust-nlp/dart-math&type=Date\" />\n",`
`541`	`544`	`" <img alt=\"Star History Chart\" src=\"https://api.star-history.com/svg?repos=hkust-nlp/dart-math&type=Date\" />\n",`
`542`	`545`	`" </picture>\n",`
`543`		`- "</a>"`
	`546`	`+ "</a>\n"`
`544`	`547`	`]`
`545`	`548`	`},`
`546`	`549`	`{`