Skip to content

Commit 864127c

Browse files
committed
update tutorial
1 parent 8a6c159 commit 864127c

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

docs/tutorials/01_start.ipynb

+4-4
Original file line numberDiff line numberDiff line change
@@ -813,9 +813,9 @@
813813
"source": [
814814
"Although we tried to chose default model parameters that work well in a wide range of scenarios, hyperparameter search will often find an emulator model with a better fit. Internally, `AutoEmulate` compares the performance of different models and hyperparameters using cross-validation on the training data, which can be computationally expensive and time-consuming for larger datasets. To speed it up, we can parallelise the process with `n_jobs`.\n",
815815
"\n",
816-
"For each model, we've pre-defined a search space for hyperparameters. When setting up `AutoEmulate` with `param_search=True`, we default to using random search with `param_search_iters = 20` iterations. We plan to add other hyperparameter search methods in the future. \n",
816+
"For each model, we've pre-defined a search space for hyperparameters. When setting up `AutoEmulate` with `param_search=True`, we default to using random search with `param_search_iters = 20` iterations. This means that 20 hyperparameter combinations from the search space are sampled and evaluated. We plan to add other hyperparameter search methods in the future. \n",
817817
"\n",
818-
"Let's do a hyperparameter search for the Gaussian Process and Random Forest models."
818+
"Let's do a hyperparameter search for the Support Vector Machines and Random Forest models."
819819
]
820820
},
821821
{
@@ -1352,7 +1352,7 @@
13521352
],
13531353
"source": [
13541354
"em = AutoEmulate()\n",
1355-
"em.setup(X, y, param_search=True, param_search_type=\"random\", param_search_iters=20, models=[\"GaussianProcess\", \"RandomForest\"], n_jobs=-2) # n_jobs=-2 uses all cores but one\n",
1355+
"em.setup(X, y, param_search=True, param_search_type=\"random\", param_search_iters=10, models=[\"SupportVectorMachines\", \"RandomForest\"], n_jobs=-2) # n_jobs=-2 uses all cores but one\n",
13561356
"em.compare()"
13571357
]
13581358
},
@@ -1427,7 +1427,7 @@
14271427
"metadata": {},
14281428
"source": [
14291429
"**Notes**: \n",
1430-
"* Some models, such as `GaussianProcess` can be slow to run hyperparameter search on larger datasets (say n > 1500). \n",
1430+
"* Some models, such as `GaussianProcess` can be slow when conducting hyperparameter search on larger datasets (say n > 1000). \n",
14311431
"* Use the `models` argument to only run hyperparameter search on a subset of models to speed up the process.\n",
14321432
"* When possible, use `n_jobs` to parallelise the hyperparameter search. With larger datasets, we recommend setting `param_search_iters` to a lower number, such as 5, to see how long it takes to run and then increase it if necessary.\n",
14331433
"* all models can be specified with short names too, such as `rf` for `RandomForest`, `gp` for `GaussianProcess`, `svm` for `SupportVectorMachines`, etc"

0 commit comments

Comments
 (0)