Adding test_group to lm-eval configs by debroy-rh · Pull Request #2623 · vllm-project/llm-compressor

debroy-rh · 2026-04-16T16:55:03Z

Adding test_group to the following lm-eval configs:
fp8_dynamic_per_token.yaml

This is to test rhaiis model-opt image for lm-eval accuracy.

github-actions · 2026-04-16T16:55:17Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

coderabbitai · 2026-04-16T16:55:30Z

Important

Review skipped

Auto incremental reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 9124ff09-509b-4d07-aa1d-dddecde62d2c

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

Walkthrough

Added a new top-level configuration field test_group: "rhaiis" to four test configuration YAML files across the tests/lmeval/configs/ and tests/lmeval/vl_configs/ directories without modifying any other existing fields or logic.

Changes

Cohort / File(s)	Summary
Test Configuration Files `tests/lmeval/configs/fp8_dynamic_per_token.yaml`, `tests/lmeval/configs/w4a16_actorder_none.yaml`, `tests/lmeval/configs/w4a4_nvfp4.yaml`, `tests/lmeval/vl_configs/vl_w4a16_actorder_weight.yaml`	Added `test_group: "rhaiis"` top-level configuration field to each YAML file.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~2 minutes

Suggested labels

enhancement, fp8, w4a16, nvfp4

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and accurately describes the main change: adding a test_group field to lm-eval configuration files.
Description check	✅ Passed	The description is directly related to the changeset, listing the specific files modified and explaining the purpose of adding the test_group field.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist

Code Review

This pull request adds the "test_group: rhaiis" field to several YAML configuration files in the tests/lmeval directory. Feedback highlights that the run_tests_in_rhaiis.sh script requires updates to correctly resolve the paths for these configuration files and to refine the model name extraction logic to handle nested keys in Vision-Language configurations.

gemini-code-assist · 2026-04-16T16:57:07Z

@@ -1,4 +1,5 @@
 cadence: "weekly"
+test_group: "rhaiis"


The run_tests_in_rhaiis.sh script (line 53) contains a hardcoded path prefix ${script_path}/configs/ which resolves to tests/e2e/vLLM/configs/. Since this configuration file is located in tests/lmeval/configs/, the script will fail to find it even if the correct directory is passed via the -c flag. The script should be updated to use the provided $CONFIG path instead of the hardcoded one: for MODEL_CONFIG in $(echo -e "$CONFIGS" | sed "s|^|${CONFIG}/|").

gemini-code-assist · 2026-04-16T16:57:07Z

@@ -1,4 +1,5 @@
 cadence: "weekly"
+test_group: "rhaiis"


The run_tests_in_rhaiis.sh script (line 59) uses grep 'model:' to extract the model name. For Vision-Language (VL) configurations like this one, there is typically a nested model key under the lmeval section (e.g., model: "hf-multimodal"). This causes the grep command to return multiple lines, which breaks the $model variable and the subsequent save_dir construction. The script should be updated to use a more specific pattern, such as grep '^model:', to only match the top-level key.

dsikka

I think RHAIIS testing should use vLLM lm-eval testing, not HF as is currently used
This is something we are going to update in upstream soon but definitely what we should use for product release

dhuangnm

It LGTM in general, just one question about a model config.

I think this PR needs to land along with the corresponding changes in the llm-compressor-testing repo. @debroy-rh can you please create the PR for the llm-compressor-testing repo as well so we can test the two PRs together before landing? Thanks.

Signed-off-by: Debolina Roy <debroy@redhat.com>

debroy-rh · 2026-04-20T19:56:53Z

@dhuangnm , @dsikka - I was trying to run the the lm-eval for vl_configs - [vl_w4a16_actorder_weight.yaml]. It was failing. Two reasons -

Parsing error - two "model" tag in the config file, the 2nd one under lm-eval. So, needed to update the tests/e2e/vLLM/run_tests_in_rhaiis.sh file.
Add torchvision to install_requires, so the same default install that already pulls in torch also pulls in torchvision, because the VLM / processor path needs it. Also, add an lmeval conftest bootstrap so pytest loads torchvision before Transformers.

Please take a look.
Run - https://github.com/neuralmagic/llm-compressor-testing/actions/runs/24686238622/job/72196254120

mergify · 2026-04-22T21:56:58Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🔴 Require two reviews

Waiting for:

#approved-reviews-by >= 2
#changes-requested-reviews-by = 0

This rule is failing.

PRs labelled "two-reviews" must have at least two approving reviews before merging.

#approved-reviews-by >= 2
#changes-requested-reviews-by = 0

gemini-code-assist Bot reviewed Apr 16, 2026

View reviewed changes

coderabbitai Bot added enhancement New feature or request fp8 For any issue / PR related to FP8 support nvfp4 For any PR / issue related to NVFP4 support w4a16 labels Apr 16, 2026

dsikka requested changes Apr 16, 2026

View reviewed changes

dhuangnm reviewed Apr 20, 2026

View reviewed changes

Comment thread tests/lmeval/configs/w4a16_actorder_none.yaml Outdated

debroy-rh added 3 commits April 20, 2026 12:03

Adding test_group to lm-eval configs

d729f10

Signed-off-by: Debolina Roy <debroy@redhat.com>

Adding test_group to lm-eval configs-2

7444ea5

Signed-off-by: Debolina Roy <debroy@redhat.com>

Adding test_group to lm-eval configs-3

2565420

Signed-off-by: Debolina Roy <debroy@redhat.com>

debroy-rh force-pushed the lmeval_conggrp branch from 2cb940a to 2565420 Compare April 20, 2026 16:03

debroy-rh added 4 commits April 20, 2026 12:35

Adding test_group to lm-eval configs-4

3fffd59

Signed-off-by: Debolina Roy <debroy@redhat.com>

Adding test_group to lm-eval configs-5

4e9e3dd

Signed-off-by: Debolina Roy <debroy@redhat.com>

Adding test_group to lm-eval configs-6

a41e4ee

Signed-off-by: Debolina Roy <debroy@redhat.com>

Adding test_group to lm-eval configs-7

9ddc638

Signed-off-by: Debolina Roy <debroy@redhat.com>

mergify Bot added the two-reviews When a PR requires two reviews label Apr 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding test_group to lm-eval configs#2623

Adding test_group to lm-eval configs#2623
debroy-rh wants to merge 7 commits intovllm-project:mainfrom
debroy-rh:lmeval_conggrp

debroy-rh commented Apr 16, 2026

Uh oh!

github-actions Bot commented Apr 16, 2026

Uh oh!

coderabbitai Bot commented Apr 16, 2026 •

edited

Loading

Review skipped

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 16, 2026

Uh oh!

gemini-code-assist Bot Apr 16, 2026

Uh oh!

dsikka left a comment

Uh oh!

dhuangnm left a comment

Uh oh!

Uh oh!

debroy-rh commented Apr 20, 2026

Uh oh!

mergify Bot commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

debroy-rh commented Apr 16, 2026

Uh oh!

github-actions Bot commented Apr 16, 2026

Uh oh!

coderabbitai Bot commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Suggested labels

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

dsikka left a comment

Choose a reason for hiding this comment

Uh oh!

dhuangnm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

debroy-rh commented Apr 20, 2026

Uh oh!

mergify Bot commented Apr 22, 2026

Merge Protections

🔴 Require two reviews

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

coderabbitai Bot commented Apr 16, 2026 •

edited

Loading