We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 890e8b3 commit 59d0488Copy full SHA for 59d0488
_modules/lzero/policy/unizero.html
@@ -376,7 +376,7 @@ <h1>Source code for lzero.policy.unizero</h1><div class="highlight"><pre>
376
<span class="c1"># ****** Explore by random collect ******</span>
377
<span class="c1"># (int) The number of episodes to collect data randomly before training.</span>
378
<span class="n">random_collect_episode_num</span><span class="o">=</span><span class="mi">0</span><span class="p">,</span>
379
-
+
380
<span class="c1"># ****** Explore by eps greedy ******</span>
381
<span class="n">eps</span><span class="o">=</span><span class="nb">dict</span><span class="p">(</span>
382
<span class="c1"># (bool) Whether to use eps greedy exploration in collecting data.</span>
0 commit comments