integration-vllm-test #2258

drisspg · 2025-05-24T00:00:22Z

Stacked PRs:

->integration-vllm-test #2258

integration-vllm-test

pytorch-bot · 2025-05-24T00:00:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2258

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit d152dac with merge base 1017c7e ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

stack-info: PR: #2258, branch: drisspg/stack/58

test/integration/test_vllm.py

jerryzh168 · 2025-05-28T00:03:05Z

test/integration/test_vllm.py

+        os.environ["VLLM_USE_V1"] = "1"
+        os.environ["VLLM_ENABLE_V1_MULTIPROCESSING"] = "0"
+        os.environ["VLLM_TEST_STANDALONE_COMPILE"] = "1"


are you missing VLLM_DISABLE_COMPILE_CACHE?

can you write down for each flag about whether they are always required v.s. temporary etc.

I made note here: #2239

You can see that VLLM_TEST_STANDALONE_COMPILE which wil lultimately be the default doesn't makes it so that we can compile subclasses

This is a good point, I think we should actually just set this in AO integration upstream

you mean modify these flags when people use AO?

Yeah, in the AO integration if someone is loading an AO model we should just sset this flag I will make a PR

jerryzh168 · 2025-05-28T00:04:10Z

test/integration/test_vllm.py

+            print(f"Quick test - Input: {test_input}, Output: {decoded}")
+
+        # Save quantized model
+        print(f"Saving quantized model to {output_dir}...")


should we delete these in the end?

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg added a commit that referenced this pull request May 24, 2025

integration-vllm-test

c0107a3

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg force-pushed the drisspg/stack/58 branch from f27fa03 to c0107a3 Compare May 24, 2025 00:00

This was referenced May 24, 2025

Fix Per Row scaling for inference #2253

Merged

Add a way to do power of 2 scaling #2256

Open

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 24, 2025

drisspg changed the base branch from drisspg/stack/57 to main May 24, 2025 00:35

drisspg added a commit that referenced this pull request May 24, 2025

integration-vllm-test

b0e4da1

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg force-pushed the drisspg/stack/58 branch from c0107a3 to b0e4da1 Compare May 24, 2025 00:35

drisspg changed the base branch from main to drisspg/stack/57 May 24, 2025 00:35

drisspg changed the base branch from drisspg/stack/57 to main May 24, 2025 17:12

drisspg added a commit that referenced this pull request May 24, 2025

integration-vllm-test

c7e639e

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg force-pushed the drisspg/stack/58 branch from b0e4da1 to c7e639e Compare May 24, 2025 17:12

drisspg changed the base branch from main to drisspg/stack/57 May 24, 2025 17:12

drisspg changed the base branch from drisspg/stack/57 to main May 24, 2025 17:42

drisspg added a commit that referenced this pull request May 24, 2025

integration-vllm-test

d12188b

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg force-pushed the drisspg/stack/58 branch from c7e639e to d12188b Compare May 24, 2025 17:42

drisspg changed the base branch from main to drisspg/stack/57 May 24, 2025 17:42

drisspg changed the base branch from drisspg/stack/57 to main May 27, 2025 20:36

drisspg added a commit that referenced this pull request May 27, 2025

integration-vllm-test

c06d97f

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg force-pushed the drisspg/stack/58 branch from d12188b to c06d97f Compare May 27, 2025 20:36

drisspg added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label May 27, 2025

drisspg added a commit that referenced this pull request May 27, 2025

integration-vllm-test

edd42f9

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg force-pushed the drisspg/stack/58 branch from c06d97f to edd42f9 Compare May 27, 2025 21:57

drisspg added a commit that referenced this pull request May 27, 2025

integration-vllm-test

f4bcd2d

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg force-pushed the drisspg/stack/58 branch from edd42f9 to f4bcd2d Compare May 27, 2025 22:17

drisspg added a commit that referenced this pull request May 27, 2025

integration-vllm-test

1fc853f

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg force-pushed the drisspg/stack/58 branch from f4bcd2d to 1fc853f Compare May 27, 2025 22:23

drisspg added a commit that referenced this pull request May 27, 2025

integration-vllm-test

3066141

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg force-pushed the drisspg/stack/58 branch from 1fc853f to 3066141 Compare May 27, 2025 23:02

jerryzh168 reviewed May 27, 2025

View reviewed changes

test/integration/test_vllm.py Show resolved Hide resolved

jerryzh168 reviewed May 28, 2025

View reviewed changes

jerryzh168 approved these changes May 28, 2025

View reviewed changes

jerryzh168 reviewed May 28, 2025

View reviewed changes

integration-vllm-test

d152dac

stack-info: PR: #2258, branch: drisspg/stack/58

drisspg force-pushed the drisspg/stack/58 branch from 3066141 to d152dac Compare May 28, 2025 00:43

drisspg merged commit 4d5f657 into main May 28, 2025
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

integration-vllm-test #2258

integration-vllm-test #2258

Uh oh!

drisspg commented May 24, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented May 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

jerryzh168 May 28, 2025

Uh oh!

drisspg May 28, 2025

Uh oh!

drisspg May 28, 2025 •

edited

Loading

Uh oh!

jerryzh168 May 28, 2025

Uh oh!

drisspg May 28, 2025

Uh oh!

jerryzh168 May 28, 2025

Uh oh!

Uh oh!

Uh oh!

integration-vllm-test #2258

integration-vllm-test #2258

Uh oh!

Conversation

drisspg commented May 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2258

✅ No Failures

Uh oh!

Uh oh!

jerryzh168 May 28, 2025

Choose a reason for hiding this comment

Uh oh!

drisspg May 28, 2025

Choose a reason for hiding this comment

Uh oh!

drisspg May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jerryzh168 May 28, 2025

Choose a reason for hiding this comment

Uh oh!

drisspg May 28, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 May 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

drisspg commented May 24, 2025 •

edited

Loading

pytorch-bot bot commented May 24, 2025 •

edited

Loading

drisspg May 28, 2025 •

edited

Loading