Feat (benchmark): Allow easy specification of different search algorithms within benchmark scripts #1357

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

nickfraser wants to merge 16 commits into Xilinx:dev from nickfraser:feat/flexible_benchmark

Collaborator

nickfraser commented Aug 22, 2025 •

edited

Loading

Builds on #1356. Introduces a BenchmarkSearchMixin which determines how we generate experiments with the benchmark utils. The first one GridSearchMixin, replicates the behaviour of the benchmark utils before this PR was made.

The second one is a RandomSearchMixin, which allow specification of various search algorithms that can be used to specify parameters, for example:

act_equalization:
  rand_type: choices
  rand_values: [null, "layerwise", "fx"]
act_equalization_alpha:
  rand_type: linear
  rand_values: [0.05, 0.95]
gptq:
  rand_type: const
  rand_values: true
learned_round_lr:
  rand_type: log2
  rand_values: [0.0001, 0.1]
learned_round_scale_lr:
  rand_type: exp2
  rand_values: [0.0001, 0.1]

Running as follows:

python llm_rand_benchmark.py --config benchmark_rand_template.yaml --dry-run --seed 1 --num-experiments 10

Will give the following output:

Num. experiments: 10
Benchmark args.:
        config: benchmark_rand_template.yaml
        results_folder: ./
        gpus: 0
        num_gpus_per_process: 1
        max_num_retries: 1
        dry_run: True
        num_experiments: 10
        max_experimental_configs: 100000
        seed: 1
Non-default args.:
        --act-equalization: type: choices, values: [None, 'layerwise', 'fx']
        --act-equalization-alpha: type: linear, min: 0.05, max: 0.95
        --gptq: type: const, value: True
        --learned-round-lr: type: log2, min: 0.0001, max: 0.1
        --learned-round-scale-lr: type: exp2, min: 0.0001, max: 0.1

Note, a limitation with the current approach is that the worker queue generation (i.e., the random search space) and execution (i.e., the experiments) are two separate steps, meaning that it's difficult to integrate experiment feedback into the search space selection (e.g., for Bayesian optimization or simulated annealing).

nickfraser added 6 commits

August 22, 2025 16:15


          Fix (ex/llm/benchmark): import error

06a8d5c


          Feat (ex/benchmark): Separated search algorithm into a Mixin which ca…

b0c37e9

…n be "plugged into" a benchmark configuration


          Fix (ex/bench): Moved print summary to mixin

57ef2cd


          Fix (ex/imagenet/txt2image): Updated benchmark script to extend `Grid…

ad107a1

…SearchBenchmarkUtils`


          Feat (ex/bench): Add a random search mixin

1d5c92b


          Feat (ex/bench): Added basic random template example

8e42195

nickfraser mentioned this pull request

Random benchmark search with dependent args #1362

Draft

nickfraser added 2 commits

August 26, 2025 17:21


          Fix (ex/llm/bench): bugfix in log2 search, assert when multiple adapt…

dd7c3c6

…ive rounding algorithms are enabled


          Merge branch 'dev' into feat/flexible_benchmark

ff7b33e

nickfraser marked this pull request as ready for review

December 2, 2025 12:43

nickfraser commented

View reviewed changes

Collaborator Author

nickfraser left a comment

Switch to Mixin.

src/brevitas_examples/llm/llm_args.py Outdated Show resolved Hide resolved

src/brevitas_examples/llm/benchmark/llm_rand_benchmark.py Outdated Show resolved Hide resolved

nickfraser added 3 commits

December 2, 2025 12:57


          feat (ex/llm/benchmark): Moved LLM-specific benchmark code to its own…

db8a9c8

… `Mixin`


          Fix style

bc35813


          Style (ex/llm/benchmark): remove unused imports

38f160c

nickfraser requested a review from pablomlago

December 2, 2025 13:07

pablomlago reviewed

View reviewed changes

src/brevitas_examples/llm/llm_args.py Outdated Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/benchmark/utils.py Outdated Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/benchmark/utils.py

		pass


		class BenchmarkSearchMixin(ABC):

Collaborator

pablomlago Dec 2, 2025 •

edited

Loading

Indicate that the concrete classes need to provide the abstract class variable argument_parser, e.g.

    @property
    @abstractmethod
    def argument_parser(self) -> ArgumentParser:
        pass

Collaborator Author

nickfraser Jan 6, 2026

Not 100% sure about this... It's already specified in class BenchmarkUtils(ABC):. You want it in class BenchmarkSearchMixin(ABC): as well? Or instead of or as well as BenchmarkUtils?

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/benchmark/utils.py Outdated Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/benchmark/utils.py

		return id_str


		class RandomSearchMixin(BenchmarkSearchMixin):

Collaborator

pablomlago Dec 2, 2025 •

edited

Loading

There is a subset of the functionality of standardize_args which is shared for GridSearchMixin and RandomSearchMixing, e.g. YAML reading. Is it possible to extract the common functionality to the abstract parent class.

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/benchmark/utils.py

+                      return args_dict
+                  @staticmethod
+                  def parse_config_args(args: List[str]) -> Namespace:

Collaborator

pablomlago Dec 2, 2025

Most of the arguments are shared between GridSearchMixin and RandomSearchMixin, I would consider extracting the common logic.

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/benchmark/utils.py Outdated Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/benchmark/utils.py

-                  q = q[start_index:end_index]
+                  args_dict = entrypoint_utils.standardize_args(script_args)
+                  # Generate a list of experiments
+                  q = entrypoint_utils.gen_search_space(args_dict, script_args)

Collaborator

pablomlago Dec 2, 2025

Maybe it is worth renaming q to something more self-explanatory.

nickfraser added 5 commits

January 6, 2026 17:21


          Fix (benchmark): import sys

b142adc


          style (ex/llm): Fix style for LLM assertion

4e522c0


          Fix (benchmark): fix arg list and type annotation for standardize_args

055e10f


          Fix (ex/benchmark): fix inheritance with multiple mixins

6471c9d


          Fix (ex/benchmark): update SD and imagenet benchmark script to use ne…

7a5a6e3

…w interface

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet