[fix(externalize)] Skip fully-specialised dims in submodule re-export by gokulkrishna98 · Pull Request #7 · apple/coreai-torch

gokulkrishna98 · 2026-06-11T18:32:04Z

Description:

Solving the issue: externalize: SDPA submodule re-export drops the upper bound on the key-length dim with a static query + dynamic KV context #1
TODO

Testing

TODO

When externalising an SDPA submodule, ``_dynamic_shapes_from_node`` requested a Dim for every SymInt in the upstream FakeTensor's shape — including SymInts whose ``.node.expr`` had been specialised to a literal int by the parent program. Those re-exported as an unbounded ``Dim(min=1)`` and ``torch.export`` rejected the submodule with ``L['key'].size()[2] <= IntInfinity()`` whenever a model used a static query length and a dynamic KV-context length (the prefill / decode shape used by hybrid linear-attention models). Skip SymInts whose expr is already a number — they are static dims and should not appear in ``dynamic_shapes`` at all. Signed-off-by: gokulkrishna98 <gokulkrishna98@users.noreply.github.com>

gokulkrishna98 force-pushed the dev/gokul/fix_sdpa_externalization branch from 7ef0b8e to ac0e0de Compare June 11, 2026 18:42

gokulkrishna98 self-assigned this Jun 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix(externalize)] Skip fully-specialised dims in submodule re-export#7

[fix(externalize)] Skip fully-specialised dims in submodule re-export#7
gokulkrishna98 wants to merge 1 commit into
apple:mainfrom
gokulkrishna98:dev/gokul/fix_sdpa_externalization

gokulkrishna98 commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gokulkrishna98 commented Jun 11, 2026

Description:

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant