[TORCH][MLIR] Added _sdpa_flash_attention op #4417

keshavvinayak01 · 2025-12-26T13:06:59Z

Following the discussion at iree-org/iree-turbine#1224

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

…lashAttentionOp

2. Fixed op signature 3. Added DecomposeComplexOps template for (flash_attn, flash_attn_for_cpu) -> sdpa rewrite. 4. Lit test to check correct decomposition. Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Copilot

Pull request overview

This PR adds support for the _scaled_dot_product_flash_attention and _scaled_dot_product_flash_attention_for_cpu operations in the Torch-MLIR dialect. These operations are decomposed into the existing scaled_dot_product_attention operation.

Key Changes:

Added decomposition patterns to convert flash attention ops to standard scaled dot product attention
Registered the new operations in the ODS generator
Added comprehensive test coverage for both new operations

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated no comments.

File	Description
lib/Dialect/Torch/Transforms/DecomposeComplexOps.cpp	Implements decomposition pattern template for both flash attention operations
include/torch-mlir/Dialect/Torch/IR/GeneratedTorchOps.td	Defines the two new flash attention operation signatures and parsing/printing logic
projects/pt1/python/torch_mlir/jit_ir_importer/build_tools/torch_ods_gen.py	Registers the new operations with their type signatures
test/Dialect/Torch/decompose-complex-ops.mlir	Adds test cases verifying decomposition behavior for both operations

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01 and others added 9 commits November 25, 2025 04:58

Added _sdpa_flash_attention op

68504aa

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

IntType -> SymInt && SymInt == AnyTorchScalarType

d633aa6

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

max_q, max_k are AnyTorchScalarType

66b38f3

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Corecting signature

fac25fb

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Update torch_ods_gen.py

88af190

Rename AtenScaledDotProductFlashAttentionOp to Aten_ScaledDotProductF…

bac4585

…lashAttentionOp

1. Added Flash Attention for CPU

894d6ed

2. Fixed op signature 3. Added DecomposeComplexOps template for (flash_attn, flash_attn_for_cpu) -> sdpa rewrite. 4. Lit test to check correct decomposition. Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Faulty rename of ops

142c027

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Merge branch 'main' of https://github.com/llvm/torch-mlir into HEAD

df676dd

keshavvinayak01 mentioned this pull request Dec 26, 2025

[TORCH][MLIR] Added _sdpa_flash_attention op #4389

Closed

keshavvinayak01 requested a review from Copilot December 26, 2025 13:09

Copilot AI reviewed Dec 26, 2025

View reviewed changes

keshavvinayak01 marked this pull request as ready for review December 27, 2025 04:09

keshavvinayak01 added 5 commits December 27, 2025 03:33

Corrected op signature of _scaled_dot_product_attention_for_cpu

7c08dc3

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Corrected pattern

884a735

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Formatting

e1458b5

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Formatting

a038e1f

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Lit test corrected

e41cf46

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TORCH][MLIR] Added _sdpa_flash_attention op #4417

[TORCH][MLIR] Added _sdpa_flash_attention op #4417

keshavvinayak01 commented Dec 26, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[TORCH][MLIR] Added _sdpa_flash_attention op #4417

Are you sure you want to change the base?

[TORCH][MLIR] Added _sdpa_flash_attention op #4417

Conversation

keshavvinayak01 commented Dec 26, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant