Skip to content

[MXFP8] Add marlin compressed-tensors MXFP8 support#151

Open
dsikka wants to merge 4 commits intomxfp8-marlinfrom
ct-mxfp8-marlin
Open

[MXFP8] Add marlin compressed-tensors MXFP8 support#151
dsikka wants to merge 4 commits intomxfp8-marlinfrom
ct-mxfp8-marlin

Conversation

@dsikka
Copy link
Copy Markdown

@dsikka dsikka commented Mar 20, 2026

Purpose

  • Add marlin support for MXFP8 CT models

Test Plan

  • Smoke test with the following models:
  1. nm-testing/Qwen3-0.6B-MXFP8
  2. nm-testing/TinyLlama-1.1B-Chat-v1.0-MXFP8

Test Result

  • Smoke tests pass and models produce coherent outputs

dsikka added 3 commits March 20, 2026 00:23
Signed-off-by: Dipika <dipikasikka1@gmail.com>
Signed-off-by: Dipika <dipikasikka1@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant