[Torch] Support dynamic head_dim in attention conversion by keshavvinayak01 · Pull Request #23680 · iree-org/iree

keshavvinayak01 · 2026-03-06T09:53:00Z

When head_dim is dynamic, scale is computed at runtime via index_cast + sitofp + rsqrt instead of being folded to a constant.

Static head_dim behavior is unchanged.

Remove the NYI error for dynamic head_dim and compute scale at runtime via math.rsqrt when the dimension is not statically known. Add lit tests for dynamic head_dim cases. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01 · 2026-03-06T14:13:45Z

Closing, duplicate of #23636

keshavvinayak01 requested review from MaheshRavishankar and sjain-stanford March 6, 2026 09:53

keshavvinayak01 mentioned this pull request Mar 6, 2026

[Torch] Handle dynamic head dimensions for attention #23636

Merged

keshavvinayak01 closed this Mar 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Torch] Support dynamic head_dim in attention conversion#23680

[Torch] Support dynamic head_dim in attention conversion#23680
keshavvinayak01 wants to merge 1 commit intoiree-org:mainfrom
keshavvinayak01:users/keshavvinayak01/sdpa-dynamic-head-dim

keshavvinayak01 commented Mar 6, 2026

Uh oh!

keshavvinayak01 commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

keshavvinayak01 commented Mar 6, 2026

Uh oh!

keshavvinayak01 commented Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant