Add sparsity to benchmarking #1917

jainapurva · 2025-03-18T02:34:18Z

Add sparsity support for benchmarking. The following support has been added

Sparsity Techniques, with Sparsify_
Sparsity + Quantize
Support filter functions, will need to be added later

pytorch-bot · 2025-03-18T02:34:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1917

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit ef4cf36 with merge base 09c2760 ():

NEW FAILURE - The following job has failed:

.github/workflows/float8nocompile_test.yaml (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jcaip

thanks for working on this! just a couple questions but otherwise looks good

jcaip · 2025-03-24T23:00:50Z

benchmarks/microbenchmarks/benchmark_inference.py

@@ -44,11 +45,33 @@ def run(config: BenchmarkConfig) -> BenchmarkResult:

    # Use quantize_ to apply each quantization function to the model
    m_copy = deepcopy(base_model).eval().to(config.device)
-    quantization_config = string_to_config(
-        config.quantization, high_precision_dtype=config.high_precision_dtype
+    aoBaseConfig = string_to_config(


probably camel_case is better here?

jcaip · 2025-03-24T23:08:23Z

benchmarks/microbenchmarks/test/benchmark_config.yml

@@ -1,9 +1,13 @@
 # Sample configuration for inference benchmarks
 benchmark_mode: "inference"
 quantization_config_recipe_names:
-  - "baseline"
+  # - "baseline" Will always run a baseline instatance


should this be commented out?

We're running baseline case as default for any benchmarking param. the reason I listed it here as a comment is because I wanted to let users know that this will always run. Maybe I can simply add it to readme, and write the comment like
# Will run a baseline inference for model by default, without quantization for comparison

nit: yeah I think that's better, I would just make it clear that it's not some commented out code.

jcaip · 2025-03-24T23:08:30Z

benchmarks/microbenchmarks/test/benchmark_config.yml

-  - "int4wo-128"
+  - "marlin"
+sparsity_config_recipe_names:
+  # - "none" Will always run a without sparsity instance


Same as above

jcaip · 2025-03-24T23:09:13Z

benchmarks/microbenchmarks/test/test_benchmark_inference.py

+        # Mock string_to_config to return valid configs
+        from torchao.quantization import Int4WeightOnlyConfig
+        from torchao.sparsity.sparse_api import (
+            BlockSparseWeightConfig,


I don't think we need BlockSparseWeightConfig here - should be semi-structured sparsity no?

jcaip · 2025-03-24T23:10:09Z

benchmarks/microbenchmarks/test/test_benchmark_inference.py

+        self.assertIsInstance(result, BenchmarkResult)
+        self.assertTrue(hasattr(result, "model_inference_time_in_ms"))
+
+        # Test with block sparsity


Oh, I see - can we split this into two tests then, one for int4+2:4 marlin, and one for block sparsity?

Let me try that

ghstack-source-id: c9e900d ghstack-comment-id: 2752688588 Pull Request resolved: #1955

ghstack-source-id: ac08cd2 ghstack-comment-id: 2752688653 Pull Request resolved: #1956

ghstack-source-id: 4cf77f8 ghstack-comment-id: 2752688734 Pull Request resolved: #1957

ghstack-source-id: 8b459e2 ghstack-comment-id: 2752688801 Pull Request resolved: #1958

ghstack-source-id: ded0179 ghstack-comment-id: 2752688871 Pull Request resolved: #1959

ghstack-source-id: df82466 ghstack-comment-id: 2752688926 Pull Request resolved: #1960

This reverts commit d71baa3.

jcaip · 2025-03-26T17:01:37Z

benchmarks/microbenchmarks/test/test_utils.py

@@ -139,6 +191,7 @@ def test_generate_results_csv(self):
            BenchmarkResult(
                BenchmarkConfig(
                    quantization="int8wo",
+                    sparsity="None",


super nit: why string None and not just None here?

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 18, 2025

jainapurva requested review from jcaip, jerryzh168 and HDCharles March 24, 2025 22:45

jainapurva marked this pull request as ready for review March 24, 2025 22:46

jcaip reviewed Mar 24, 2025

View reviewed changes

jainapurva requested a review from jcaip March 25, 2025 21:02

jainapurva and others added 6 commits March 25, 2025 15:28

Add sparsity support in benchmarking

c49d512

ghstack-source-id: c9e900d ghstack-comment-id: 2752688588 Pull Request resolved: #1955

Sparsity tests

407ea1d

ghstack-source-id: ac08cd2 ghstack-comment-id: 2752688653 Pull Request resolved: #1956

Add sparsify_ to benchmarking

65d4065

ghstack-source-id: 4cf77f8 ghstack-comment-id: 2752688734 Pull Request resolved: #1957

Support sparsify_ run

cea30b1

ghstack-source-id: 8b459e2 ghstack-comment-id: 2752688801 Pull Request resolved: #1958

Review updates

ae70fec

ghstack-source-id: ded0179 ghstack-comment-id: 2752688871 Pull Request resolved: #1959

Add profile support

d71baa3

ghstack-source-id: df82466 ghstack-comment-id: 2752688926 Pull Request resolved: #1960

jainapurva force-pushed the bench-sparsity branch from 6e00835 to d71baa3 Compare March 25, 2025 22:30

Revert "Add profile support"

f2d24cd

This reverts commit d71baa3.

jcaip reviewed Mar 26, 2025

View reviewed changes

jcaip approved these changes Mar 26, 2025

View reviewed changes

jainapurva added 2 commits March 27, 2025 11:02

nit updates

2850389

Merge remote-tracking branch 'origin/main' into bench-sparsity

ef4cf36

jainapurva merged commit 3766ed7 into main Mar 27, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add sparsity to benchmarking #1917

Add sparsity to benchmarking #1917

Uh oh!

jainapurva commented Mar 18, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 18, 2025 •

edited

Loading

Uh oh!

jcaip left a comment

Uh oh!

jcaip Mar 24, 2025

Uh oh!

jcaip Mar 24, 2025

Uh oh!

jainapurva Mar 25, 2025 •

edited

Loading

Uh oh!

jcaip Mar 26, 2025

Uh oh!

jcaip Mar 24, 2025

Uh oh!

jainapurva Mar 25, 2025

Uh oh!

jcaip Mar 24, 2025

Uh oh!

jcaip Mar 24, 2025

Uh oh!

jainapurva Mar 25, 2025

Uh oh!

jainapurva Mar 25, 2025

Uh oh!

jcaip Mar 26, 2025

Uh oh!

Uh oh!

Uh oh!

Add sparsity to benchmarking #1917

Add sparsity to benchmarking #1917

Uh oh!

Conversation

jainapurva commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1917

❌ 1 New Failure

Uh oh!

jcaip left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jainapurva Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jainapurva commented Mar 18, 2025 •

edited

Loading

pytorch-bot bot commented Mar 18, 2025 •

edited

Loading

jainapurva Mar 25, 2025 •

edited

Loading