skip gdn core_attn_out check for 8k len due to random numeric error by YangQun1 · Pull Request #264 · vllm-project/vllm-xpu-kernels

YangQun1 · 2026-04-10T01:08:54Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS ABOVE HAVE BEEN CONSIDERED.

Purpose

Test Plan

Test Result

(Optional) Documentation Update

BEFORE SUBMITTING, PLEASE READ https://docs.vllm.ai/en/latest/contributing (anything written below this line will be removed by GitHub Actions)

Signed-off-by: yangqun <qun.yang@intel.com>

Copilot

Pull request overview

Updates the GDN attention kernel test to skip validation of core_attn_out for the 8192-token case due to intermittent numeric mismatches, aiming to reduce flaky failures in the XPU test suite.

Changes:

Broadens the 8k-length skip condition for core_attn_out validation in test_gdn_attention.

Comments suppressed due to low confidence (1)

tests/gdn_attn/test_gdn_attn.py:412

pytest.skip(...) here skips the remainder of the test, not just the core_attn_out assertion (it also prevents the later conv_state/ssm_state checks from running). If the intent is only to bypass validating core_attn_out for the 8192-token case, prefer conditionally skipping just that assertion (e.g., guard the assert_close(core_attn_out, ...) with if num_actual_tokens != 8192:) so other validations still execute.

    if num_actual_tokens == 8192:
        pytest.skip("FIXME, skip core_attn_out test because of random error")

    torch.testing.assert_close(core_attn_out,
                               ref_core_attn_out,

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-10T01:11:26Z

tests/gdn_attn/test_gdn_attn.py

+    if num_actual_tokens == 8192:
        pytest.skip("FIXME, skip core_attn_out test because of random error")



This change broadens the skip condition from ssm_state_is_fp32 and num_actual_tokens == 8192 to num_actual_tokens == 8192, which significantly reduces coverage for the 8k-token case across all dtypes/state configurations. If the mismatch is only present for specific configurations, keep the skip as narrow as possible; otherwise add a brief note (and ideally a tracking issue/bug ID) explaining why the 8k case is universally skipped.

Copilot · 2026-04-10T01:11:26Z

tests/gdn_attn/test_gdn_attn.py


-    if ssm_state_is_fp32 and num_actual_tokens == 8192:
+    if num_actual_tokens == 8192:
        pytest.skip("FIXME, skip core_attn_out test because of random error")


The skip message is very generic ("random error") and doesn’t provide enough context to act on. Consider including the failing symptom (e.g., max diff / which output diverges), the platform (PVC/XPU), and a link to a tracking issue so this doesn’t become a permanent skip.

Copilot · 2026-04-10T01:11:26Z

tests/gdn_attn/test_gdn_attn.py

    torch.testing.assert_close(z, ref_z, atol=atol, rtol=rtol)

-    if ssm_state_is_fp32 and num_actual_tokens == 8192:
+    if num_actual_tokens == 8192:


The PR description template is still unfilled (Purpose/Test Plan/Test Result). Please add at least a short purpose statement and a test command/result, especially since this change relaxes test validation for the 8k-token path.

jikunshang · 2026-04-10T03:18:53Z

please help root cause later, thanks!

YangQun1 · 2026-04-10T07:42:28Z

please help root cause later, thanks!

sure

skip gdn core_attn_out check for 8k len due to random numeric error

9df205a

Signed-off-by: yangqun <qun.yang@intel.com>

Copilot AI review requested due to automatic review settings April 10, 2026 01:08

Copilot started reviewing on behalf of YangQun1 April 10, 2026 01:09 View session

Copilot AI reviewed Apr 10, 2026

View reviewed changes

jikunshang approved these changes Apr 10, 2026

View reviewed changes

jikunshang enabled auto-merge (squash) April 10, 2026 03:18

mayuyuace approved these changes Apr 10, 2026

View reviewed changes

jikunshang merged commit c86c3f6 into vllm-project:main Apr 10, 2026
11 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

skip gdn core_attn_out check for 8k len due to random numeric error#264

skip gdn core_attn_out check for 8k len due to random numeric error#264
jikunshang merged 1 commit intovllm-project:mainfrom
YangQun1:dev/skip-test-2

YangQun1 commented Apr 10, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 10, 2026

Uh oh!

Copilot AI Apr 10, 2026

Uh oh!

Copilot AI Apr 10, 2026

Uh oh!

jikunshang commented Apr 10, 2026

Uh oh!

Uh oh!

YangQun1 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		if num_actual_tokens == 8192:
		pytest.skip("FIXME, skip core_attn_out test because of random error")

Conversation

YangQun1 commented Apr 10, 2026

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

jikunshang commented Apr 10, 2026

Uh oh!

Uh oh!

YangQun1 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants