Add `TuRBOSampler` #319

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

sawa3030 wants to merge 20 commits into optuna:main from sawa3030:turbo

+1,937 −0

Collaborator

sawa3030 commented Oct 21, 2025 •

edited

Loading

Contributor Agreements

Please read the contributor agreements and if you agree, please click the checkbox below.

I agree to the contributor agreements.

Tip

Please follow the Quick TODO list to smoothly merge your PR.

Motivation

Add a trust-region Bayesian optimizer (TuRBO) to OptunaHub.

Description of the changes

Implements TuRBOSampler on based on GPSampler. Note that categorical parameters are currently unsupported, and multi-objective optimization is not available.

Please refer to the paper, Scalable Global Optimization via Local Bayesian Optimization for more information.

TODO List towards PR Merge

Please remove this section if this PR is not an addition of a new package.
Otherwise, please check the following TODO list:

Copy ./template/ to create your package
Replace <COPYRIGHT HOLDER> in LICENSE of your package with your name
Fill out README.md in your package
Add import statements of your function or class names to be used in __init__.py
(Optional) Add from __future__ import annotations at the head of any Python files that include typing to support older Python versions
Apply the formatter based on the tips in README.md
Check whether your module works as intended based on the tips in README.md

sawa3030 added 13 commits

September 19, 2025 15:19


          copied template

dfecbd1


          add gp modules

9229aed


          copied GPSampler

990481a


          add trusted region

c574e68


          change the length of trusted region

94e172a


          expand for multi trusted region

d4ac7e2


          store trial_ids for each trusted retgions

84b3760


          formatted and remove constrained optimization

a023892


          formatted and remove constrained optimization

3d323f8


          add readme and example

4748a53


          format

ece566a


          add _cached_params_by_tr and _cached_acqf_by_tr

9bbad4c


          address edge cases

81b1c6c

Collaborator Author

sawa3030 commented Oct 24, 2025

Performance

The performace was measured by comparing it with GPSampler using the Black-Box Optimization Benchmarking (BBOB) test suite on OptunaHub. Each sampler was run for 200 iterations and the experiment was repeated 10 times.

function_id = 1:

Sampler	Best value (mean ± std)	RunTime (mean ± std, s)
TuRBO	80.2633 ± 0.2363	8.62 ± 0.49
GP	79.4833 ± 0.0018	85.25 ± 10.12

function_id = 22:

Sampler	Best value (mean ± std)	RunTime (mean ± std, s)
TuRBO	−987.1081 ± 5.4540	9.38 ± 0.58
GP	−982.0005 ± 23.2204	51.39 ± 2.15

Although the best values are comparable, runtime is substantially reduced, especially at larger iteration counts —as expected for this approach.

sawa3030 added 3 commits

October 24, 2025 18:28


          use relative path import

84ae056


          Merge branch 'temp' into turbo

c38d5a1


          format

e6526e9

Collaborator Author

sawa3030 commented Oct 24, 2025

For ease of review: Given that this sampler mainly extends GPSampler, focusing on the diff starting from c574e68 should be sufficient to verify the logic. Thank you in advance!

sawa3030 marked this pull request as ready for review

October 24, 2025 09:40

c-bata assigned gen740

c-bata added the new-package label


          check the turbo behavior with v4.6.0

01094c7

gen740 reviewed

View reviewed changes

Member

gen740 left a comment

Thank you for the PR! I've left few comment.

package/samplers/turbo/sampler.py Outdated

Comment on lines 123 to 125

    
                      self._init_length = 0.8

                      self._max_length = 1.6

                      self._min_length = 0.5**7

Member

gen740 Nov 14, 2025

How abount setting these values via constructor arguments?

package/samplers/turbo/sampler.py Outdated

Comment on lines 146 to 151

    
                  def reset_trust_region(self, delete_trust_region_id: int) -> None:

                      self._trial_ids_for_trust_region[delete_trust_region_id] = []

                      self._length[delete_trust_region_id] = self._init_length

                      self._n_consecutive_success[delete_trust_region_id] = 0

                      self._n_consecutive_failure[delete_trust_region_id] = 0

                      self._best_value_in_current_trust_region[delete_trust_region_id] = None

Member

gen740 Nov 14, 2025

I think this function is internal, so it should start with an underscore.
Alternatively, it can be removed because it is used only once.

package/samplers/turbo/sampler.py Outdated

Comment on lines 195 to 207

    
                  def _get_best_params_for_multi_objective(

                      self,

                      normalized_params: np.ndarray,

                      standardized_score_vals: np.ndarray,

                  ) -> np.ndarray:

                      pareto_params = normalized_params[

                          _is_pareto_front(-standardized_score_vals, assume_unique_lexsorted=False)

                      ]

                      n_pareto_sols = len(pareto_params)

                      # TODO(nabenabe): Verify the validity of this choice.

                      size = min(self._n_local_search // 2, n_pareto_sols)

                      chosen_indices = self._rng.rng.choice(n_pareto_sols, size=size, replace=False)

                      return pareto_params[chosen_indices]

Member

gen740 Nov 14, 2025

This function is not used, how about removing it.

package/samplers/turbo/sampler.py

    
                  def sample_relative(

                      self, study: Study, trial: FrozenTrial, search_space: dict[str, BaseDistribution]

                  ) -> dict[str, Any]:

                      if search_space == {}:

Member

gen740 Nov 14, 2025

Suggested change

      
                    if search_space == {}:
          
                    if study._is_multi_objective():
          
                        raise ValueError("TurboSampler does not support multi-objective optimization.")
          
                    if search_space == {}:

How about checking whether this is multi-objective or not, as in the following code?

optunahub-registry/package/samplers/carbo/sampler.py

Lines 158 to 159 in 6818488

    
           if study._is_multi_objective(): 
        
               raise ValueError("CARBOSampler does not support multi-objective optimization.")

.

package/samplers/turbo/sampler.py Outdated

Comment on lines 215 to 218

    
                      for id in range(self._n_trust_region):

                          if len(self._trial_ids_for_trust_region[id]) < self._n_startup_trials:

                              self._trial_ids_for_trust_region[id].append(trial._trial_id)

                              return {}

Member

gen740 Nov 14, 2025

Suggested change

      
                    for id in range(self._n_trust_region):
          
                        if len(self._trial_ids_for_trust_region[id]) < self._n_startup_trials:
          
                            self._trial_ids_for_trust_region[id].append(trial._trial_id)
          
                            return {}
          
                    if any(len(ids) < self._n_startup_trials for ids in self._trial_ids_for_trust_region):
          
                        return {}

These lines can be simplified.

Collaborator Author

sawa3030 Nov 14, 2025

Thank you for the suggestion. While your approach simplifies the code significantly, in this case I need to append the trial_id to self._trial_ids_for_trust_region[id]. As you pointed out, the original code was somewhat hard to follow, so I have refactored it to make it clearer. I would appreciate any further suggestions or feedback you may have.

package/samplers/turbo/sampler.py Outdated

Comment on lines 232 to 234

    
                          if len(trials) < self._n_startup_trials:

                              self._trial_ids_for_trust_region[id].append(trial._trial_id)

                              return {}

Member

gen740 Nov 14, 2025

These lines maybe redundant.

package/samplers/turbo/sampler.py Outdated

Comment on lines 239 to 244

    
                          _sign = np.array(

                              [-1.0 if d == StudyDirection.MINIMIZE else 1.0 for d in study.directions]

                          )

                          standardized_score_vals, _, _ = _standardize_values(

                              _sign * np.array([trial.values for trial in trials])

                          )

Member

gen740 Nov 14, 2025

Turbo is single-objective only, so we can simplify this part.

sign = -1.0 if study.direction == StudyDirection.MINIMIZE else 1.0

package/samplers/turbo/sampler.py Outdated

Comment on lines 348 to 363

    
                                  if direction == StudyDirection.MINIMIZE:

                                      if values[0] < best_value:

                                          self._n_consecutive_success[trust_region_id] += 1

                                          self._n_consecutive_failure[trust_region_id] = 0

                                          self._best_value_in_current_trust_region[trust_region_id] = values[0]

                                      else:

                                          self._n_consecutive_success[trust_region_id] = 0

                                          self._n_consecutive_failure[trust_region_id] += 1

                                  else:

                                      if values[0] > best_value:

                                          self._n_consecutive_success[trust_region_id] += 1

                                          self._n_consecutive_failure[trust_region_id] = 0

                                          self._best_value_in_current_trust_region[trust_region_id] = values[0]

                                      else:

                                          self._n_consecutive_success[trust_region_id] = 0

                                          self._n_consecutive_failure[trust_region_id] += 1

Member

gen740 Nov 14, 2025

This branch can be simplified by checking whether direction == StudyDirection.MINIMIZE != values[0] < best_value

Collaborator Author

sawa3030 Nov 14, 2025

Although this is not a major issue, using (direction == StudyDirection.MINIMIZE) != (values[0] < best_value) allows the case where direction == StudyDirection.MAXIMIZE && values[0] == best_value, which is not ideal.
To make the logic clearer, I instead compute whether the value has improved first:

is_better = (
    values[0] < best_value
    if direction == StudyDirection.MINIMIZE
    else values[0] > best_value
)

package/samplers/turbo/sampler.py Outdated

Comment on lines 378 to 379

    
                          if self._length[trust_region_id] < self._min_length:

                              self.reset_trust_region(trust_region_id)

Member

gen740 Nov 14, 2025

How about check this condition first?

sawa3030 added 3 commits

November 14, 2025 17:29


          add init_length, max_length, min_length to args

7bf0a70


          address comments

498dd6c


          address comments

bc1a496

Collaborator Author

sawa3030 commented Nov 14, 2025

Thank you for the suggestions on this large PR. I’ve incorporated some changes based on your feedback. PTAL!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels