Skip to content

other(torch_compile): parametrize e2e test over RBLN_WEIGHT_FREE (wf_on xfail)#647

Open
rebel-jinhwan wants to merge 1 commit into
devfrom
test/weight-free-e2e-parametrize
Open

other(torch_compile): parametrize e2e test over RBLN_WEIGHT_FREE (wf_on xfail)#647
rebel-jinhwan wants to merge 1 commit into
devfrom
test/weight-free-e2e-parametrize

Conversation

@rebel-jinhwan

@rebel-jinhwan rebel-jinhwan commented Jun 5, 2026

Copy link
Copy Markdown
Contributor

Add an autouse, parametrized fixture in the e2e conftest so every e2e test runs once with weight-free compilation off and once on, without touching the individual test files.

RBLN_WEIGHT_FREE is read by the rebel compiler at compile time inside the spawn-ed EngineCore worker, so setting it before the LLM is built is enough; no separate test process is needed. The compile-cache hash does not include the weight-free mode, so the cache is disabled to stop the on leg from silently cache-hitting the off artifact.

The wf_on leg is marked xfail (non-strict) since the weight-free path is not yet correctness-complete across all models/features; flip to strict=True once it is fully supported.

🚀 Summary of Changes

What does this PR do? What feature, fix, or improvement does it bring?


📌 Related Issues / Tickets

  • Resolves #
  • Related to #

✅ Type of Change

  • 🚀 Release (release)
  • ✨ Feature (feature)
  • 🧠 Model support (model)
  • 🧬 Core engine changes (core)
  • 🛠 Bug fix (fix)
  • ⚙️ Performance improvement (perf)
  • 🔁 Refactor or code cleanup (refactor)
  • 📄 Documentation (docs)
  • ❓ Other (other): please describe

🧪 How to Test

  1. Run ...
  2. Verify output: ...
  3. Edge case tested: ...

📸 Screenshots / Logs (if applicable)


📋 Checklist

  • PR title follows Conventional Commits format
  • This PR is linked to an existing issue
  • The test method is described, and the expected result is clearly stated
  • Relevant documentation has been updated (if applicable)

💬 Notes


Add an autouse, parametrized fixture in the e2e conftest so every e2e test
runs once with weight-free compilation off and once on, without touching the
individual test files.

RBLN_WEIGHT_FREE is read by the rebel compiler at compile time inside the
spawn-ed EngineCore worker, so setting it before the LLM is built is enough;
no separate test process is needed. The compile-cache hash does not include
the weight-free mode, so the cache is disabled to stop the on leg from
silently cache-hitting the off artifact.

The wf_on leg is marked xfail (non-strict) since the weight-free path is not
yet correctness-complete across all models/features; flip to strict=True once
it is fully supported.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@rebel-jinhwan rebel-jinhwan changed the title test(torch_compile): parametrize e2e over RBLN_WEIGHT_FREE (wf_on xfail) other(torch_compile): parametrize e2e over RBLN_WEIGHT_FREE (wf_on xfail) Jun 5, 2026
@rebel-jinhwan rebel-jinhwan changed the title other(torch_compile): parametrize e2e over RBLN_WEIGHT_FREE (wf_on xfail) other(torch_compile): parametrize e2e test over RBLN_WEIGHT_FREE (wf_on xfail) Jun 5, 2026
@rebel-jinhwan rebel-jinhwan marked this pull request as ready for review June 5, 2026 06:01
@rebel-jinhwan rebel-jinhwan self-assigned this Jun 5, 2026
@rebel-jinhwan rebel-jinhwan added the torch.compile torch.compile based implementation label Jun 5, 2026
@codecov

codecov Bot commented Jun 5, 2026

Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

torch.compile torch.compile based implementation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant