[Tuner][Codegen] Add iree_codegen.constraints op and supporting infra by kuhar · Pull Request #23687 · iree-org/iree

kuhar · 2026-03-07T05:09:42Z

Today, the tuner has to know a lot about the IREE compiler and its compilation pipelines, including what inputs they can compile, their configuration space, their lowering_config format, etc. This is partially encoded directly as Python logic and partially exposed through Python bindings. We want to move this to the compiler instead.

The constraints op encodes pipeline constraints over one or more root ops and can later be lowered to SMT-LIB for solving by the tuner, or verified against the selected lowering_config after dispatch configuration. See the discussion for the full proposal: #23521

Key design decisions:

The (bidirectional) mapping from root ops to their constraints is based on root op sets: [Tuner][Codegen] Add set identifier to root op attributes #23536 to allow for root ops that return zero or many values.
The knobs dict structurally mirrors the lowering_config / translation_info attributes, with tunable leaves as #iree_codegen.int_knob<"name">. This makes it possible for the tuner to mechanically substitute solved knob values back into concrete attributes without understanding their structure.
Problem dimensions come in as index operands (from tensor.dim / constants), so that static sizes get constant-folded into the SMT body and dynamic shapes remain symbolic.
The body uses upstream SMT dialect ops, so that we can directly export to SMT-LIB without a custom lowering.
Pipeline attr accepts both DispatchLoweringPassPipelineAttr and the new PipelineAttrInterface ([Codegen] Add PipelineAttrInterface and PassPipelineAttr #23590) to support custom pipelines.

Issue: #23535

Today, the tuner has to know a lot about the IREE compiler and its compilation pipelines, including what inputs they can compile, their configuration space, their lowering_config format, etc. This is partially encoded directly as Python logic and partially exposed through Python bindings. This is the first step towards moving the constraint generation responsibility from the tuner to the compiler. The constraints op encodes pipeline constraints over one or more root ops and can later be lowered to SMT-LIB for solving by the tuner, or verified against the selected lowering_config after dispatch configuration. See the discussion for the full proposal: iree-org#23521 Key design decisions: - The knobs dict structurally mirrors the lowering_config / translation_info attributes, with tunable leaves as `#iree_codegen.int_knob<"name">`. This makes it possible for the tuner to mechanically substitute solved knob values back into concrete attributes without understanding their structure. - Problem dimensions come in as index operands (from tensor.dim / constants), so that static sizes get constant-folded into the SMT body and dynamic shapes remain symbolic. - The body uses upstream SMT dialect ops, so that we can directly export to SMT-LIB without a custom lowering. - Pipeline attr accepts both DispatchLoweringPassPipelineAttr and the new PipelineAttrInterface (iree-org#23590) to support custom pipelines. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

amd-eochoalo · 2026-03-09T13:53:15Z

Is part of the implementation still missing? I know that none of these ops will be lowered, instead they are used by the tuner, but I'm a little bit curious where the tuner code that uses these knobs is. Is it on a different repo?

kuhar · 2026-03-09T14:00:13Z

Is part of the implementation still missing? I know that none of these ops will be lowered, instead they are used by the tuner, but I'm a little bit curious where the tuner code that uses these knobs is. Is it on a different repo?

I'm landing piecewise to limit the PR size. There are no consumers as of now. Check the discussion in the PR description for the e2e prototype.

amd-eochoalo

I still need a bit of time to review the overall design a bit more, but this PR looks good.

I was thinking about suggesting a different target that includes the SMT dialect (since it is only useful for the tuner), but I think keeping the build system simple where the cost is only linking against the SMT dialect is a good trade off.

kuhar · 2026-03-09T17:23:35Z

I was thinking about suggesting a different target that includes the SMT dialect (since it is only useful for the tuner),

The plan is to also use it for verification after the usual compilation path decides the lowering config. But even if we did move it to a new target, I'm not sure what the benefit would be since everything is statically linked anyway.

…anslationInfoAttr (#23868) Context: some changes happen from the IREE side: - #23590 - #23687 - #23816 and tuner CI error: https://github.com/nod-ai/amd-shark-ai/actions/runs/23314739415/job/67811065632?pr=2865#step:8:135 This PR fixes the C API assertion in `TranslationInfoAttr.get()` to accept `PipelineAttr` in addition to `DispatchLoweringPassPipelineAttr` Assisted-by: [Claude Code](https://claude.ai/code) Signed-off-by: Bangtian Liu <liubangtian@gmail.com>

kuhar requested review from Max191, RattataKing, amd-eochoalo, bangtianliu and qedawkins March 7, 2026 05:09

kuhar requested review from IanWood1, MaheshRavishankar and benvanik as code owners March 7, 2026 05:09

kuhar force-pushed the iree-codegen-constraints-op branch from ce9d314 to a619486 Compare March 7, 2026 05:37

kuhar requested a review from ScottTodd as a code owner March 7, 2026 05:37

kuhar force-pushed the iree-codegen-constraints-op branch from 2e85929 to 1a590b9 Compare March 7, 2026 13:54

Simplify

3dccac0

amd-eochoalo approved these changes Mar 9, 2026

View reviewed changes

bangtianliu approved these changes Mar 9, 2026

View reviewed changes

Comment thread compiler/src/iree/compiler/Codegen/Dialect/Codegen/IR/IREECodegenOps.cpp

ScottTodd removed their request for review March 9, 2026 17:28

kuhar merged commit 15a37a2 into iree-org:main Mar 9, 2026
80 of 84 checks passed

kuhar mentioned this pull request Mar 10, 2026

[Tuner] Compiler-side constraint generation #23535

Open

11 tasks

bangtianliu mentioned this pull request Mar 19, 2026

[Codegen][CAPI] Fix C API assertion for GPU pipeline attributes in TranslationInfoAttr #23868

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tuner][Codegen] Add iree_codegen.constraints op and supporting infra#23687

[Tuner][Codegen] Add iree_codegen.constraints op and supporting infra#23687
kuhar merged 2 commits intoiree-org:mainfrom
kuhar:iree-codegen-constraints-op

kuhar commented Mar 7, 2026 •

edited

Loading

Uh oh!

amd-eochoalo commented Mar 9, 2026

Uh oh!

kuhar commented Mar 9, 2026

Uh oh!

amd-eochoalo left a comment

Uh oh!

kuhar commented Mar 9, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kuhar commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amd-eochoalo commented Mar 9, 2026

Uh oh!

kuhar commented Mar 9, 2026

Uh oh!

amd-eochoalo left a comment

Choose a reason for hiding this comment

Uh oh!

kuhar commented Mar 9, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kuhar commented Mar 7, 2026 •

edited

Loading