NXP backend: Linear + BatchNorm QAT fusing #16623

StrycekSimon · 2026-01-15T15:12:00Z

Summary

Adds two passes for inserting/removing simulated BatchNorm fusion for QAT training, similarly to how _fuse_conv_bn_qat adds simulated Conv+BatchNorm fusion in prepare_qat_pt2e function from TorchAO.

Test plan

Added integration tests that covers newly added implementation,

cc @robert-kalmar

pytorch-bot · 2026-01-15T15:12:04Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16623

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 11 New Failures, 1 Pending

As of commit d70a29e with merge base 7a9fb3f ():

NEW FAILURES - The following jobs have failed:

pull / android / run-emulator (gh)
The process '/opt/android/sdk/platform-tools/adb' failed with exit code 224
pull / test-coreml-bc-macos (macos-m1-stable) / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
pull / test-coreml-bc-macos (macos-m2-stable) / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
pull / test-moshi-linux / linux-job (gh)
RuntimeError: Command docker exec -t 356f9e228c7c2eed5c844b4f081538b521d601a2cd9c956bead1ec55f0295e5c /exec failed with exit code 1
pull / test-parakeet-xnnpack-linux / linux-job (gh)
RuntimeError: Command docker exec -t 27447cc15993d4e1ba88b48f2e3c7676362f6c97ceeecc0b750a24cc24d4b567 /exec failed with exit code 1
pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t 105f918dab59f09bae28b9e3f29fb3a9b15b00075bec1bf4daf3d1ab39783022 /exec failed with exit code 127
pull / test-samsung-quantmodels-linux / linux-job (gh)
RuntimeError: Command docker exec -t 43694dda6a9247090aba44bb0ff0a528517e822a5b60e9ae26af59e995c72b76 /exec failed with exit code 127
pull / test-vulkan-operators-linux / linux-job (gh)
RuntimeError: Command docker exec -t e9041bddeeec1c45af8a2920b1e09a1c9d300e54c83430546db49c66c994205a /exec failed with exit code 127
pull / unittest-arm-backend-with-no-deps (test_pytest_models_tosa) / linux-job (gh)
RuntimeError: Command docker exec -t 8d551438624b32083139b41df38683299fd06134873d8bcf079e3612f1d83aca /exec failed with exit code 1
pull / unittest-arm-backend-with-no-deps (test_pytest_ops_tosa) / linux-job (gh)
RuntimeError: Command docker exec -t 9fd20b195736dc8a912c8b01825a6e26c320554febf1bc43a11be8655283af00 /exec failed with exit code 1
Test Metal Backend / test-metal-backend-modules / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 127

This comment was automatically generated by Dr. CI and updates every 15 minutes.

backends/nxp/aten_passes/add_simulated_linear_bn_fusion_qat_pass.py

...nxp/aten_passes/simulated_linear_bn_fusion_passes/add_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/aten_passes/fuse_batch_norm_with_linear_pass.py

backends/nxp/aten_passes/add_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py

Copilot

Pull request overview

This pull request adds support for Linear + BatchNorm QAT (Quantization Aware Training) fusion in the NXP backend, following a similar approach to the existing Conv + BatchNorm fusion in TorchAO.

Changes:

Adds AddSimulatedLinearBatchNormFusionQATPass to insert simulated fusion during QAT training by normalizing linear weights and denormalizing outputs using BatchNorm statistics
Adds RemoveSimulatedLinearBatchNormFusionQATPass to remove the simulated fusion artifacts after QAT training is complete
Updates FuseBatchNormWithLinearPass to handle fake-quantized nodes during fusion
Updates LinearPattern quantizer to skip quantization between linear and batch norm nodes during QAT
Adds comprehensive integration tests covering the fusion workflow

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 10 comments.

Show a summary per file

File	Description
backends/nxp/tests/models.py	Adds LinearBNModule test model for testing Linear + BatchNorm patterns
backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py	Adds integration tests for simulated fusion, complete fusion pipeline, and graph equivalence
backends/nxp/aten_passes/add_simulated_linear_bn_fusion_qat_pass.py	Implements pass to add simulated Linear + BatchNorm fusion for QAT by normalizing weights and denormalizing outputs
backends/nxp/aten_passes/remove_simulated_linear_bn_fusion_qat_pass.py	Implements pass to remove simulated fusion artifacts after QAT training
backends/nxp/aten_passes/fuse_batch_norm_with_linear_pass.py	Adds _unwrap_if_fq helper function to handle fake-quantized nodes during fusion
backends/nxp/quantizer/patterns.py	Updates LinearPattern to skip output quantization when followed by BatchNorm during QAT

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/nxp/tests/models.py

backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py

...nxp/aten_passes/simulated_linear_bn_fusion_passes/add_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/aten_passes/add_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/aten_passes/fuse_batch_norm_with_linear_pass.py

backends/nxp/quantizer/patterns.py

backends/nxp/tests/models.py

backends/nxp/aten_passes/remove_simulated_linear_bn_fusion_qat_pass.py

...nxp/aten_passes/simulated_linear_bn_fusion_passes/add_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/aten_passes/fuse_batch_norm_with_linear_pass.py

backends/nxp/aten_passes/remove_simulated_linear_bn_fusion_qat_pass.py

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 8 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/nxp/aten_passes/add_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py

backends/nxp/aten_passes/add_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/aten_passes/remove_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/aten_passes/add_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 6 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/nxp/aten_passes/add_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py

backends/nxp/aten_passes/fuse_batch_norm_with_linear_pass.py

backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py

backends/nxp/aten_passes/fuse_batch_norm_with_linear_pass.py

backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py

backends/nxp/aten_passes/remove_simulated_linear_bn_fusion_qat_pass.py

.../aten_passes/simulated_linear_bn_fusion_passes/remove_simulated_linear_bn_fusion_qat_pass.py

robert-kalmar · 2026-02-04T11:01:08Z

Only change I really would like to see is integration into the executorch_pipeline.py and to the aot_neutron_convert.py if it make sense (at this moment). Other my comments are nice to have.

StrycekSimon · 2026-02-10T13:25:56Z

Only change I really would like to see is integration into the executorch_pipeline.py and to the aot_neutron_convert.py if it make sense (at this moment). Other my comments are nice to have.

I will add it to the calibrate_and_quantize function just to include it in tests and cover that there are no conflicts (failing tests) with the rest of the implementation. Just note that the only model in aot_neutron_compile.py that uses QAT so far is CifarNet on which it is pretty pointless to explicitly use it as it does not contain the Linear+BN pattern.

Copilot

Pull request overview

Copilot reviewed 9 out of 9 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.../aten_passes/simulated_linear_bn_fusion_passes/remove_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/tests/models.py

.../aten_passes/simulated_linear_bn_fusion_passes/remove_simulated_linear_bn_fusion_qat_pass.py

StrycekSimon · 2026-02-11T08:14:01Z

After the recent push, one of PTQ test cases for mm op conversion fails. I will investigate.

Update: It's a mismatch by 1 in one of the output values, much like what can be seen in some other tests (eg. test_clone_converter.py) probably caused by float arithmetics. Increasing atol to 1 like in similar tests resolved this issue.

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 7 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py

backends/nxp/tests/ir/converter/node_converter/test_mm_converter.py

backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py

...nxp/aten_passes/simulated_linear_bn_fusion_passes/add_simulated_linear_bn_fusion_qat_pass.py

backends/nxp/tests/ir/edge_passes/test_linear_bn_fusing.py

backends/nxp/tests/models.py

Copilot AI review requested due to automatic review settings January 15, 2026 15:12

StrycekSimon requested a review from robert-kalmar as a code owner January 15, 2026 15:12

StrycekSimon requested review from MartinPavella and removed request for Copilot and robert-kalmar January 15, 2026 15:12

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 15, 2026

StrycekSimon requested review from jirioc, robert-kalmar and roman-janik-nxp January 15, 2026 15:12

Copilot started reviewing on behalf of StrycekSimon January 15, 2026 15:12 View session

StrycekSimon added module: nxp Issues related to NXP Neutron NPU delegation and code under backends/nxp/ release notes: nxp Changes to the NXP Neutron backend delegate labels Jan 15, 2026

roman-janik-nxp reviewed Jan 25, 2026

View reviewed changes

Copilot AI review requested due to automatic review settings January 29, 2026 12:40

StrycekSimon force-pushed the feature/EIEX-641-create-a-pass-to-fuse-linear-batchnorm-after-qat-quantization branch from 8d41a7a to b45a2ba Compare January 29, 2026 12:40

Copilot started reviewing on behalf of StrycekSimon January 29, 2026 12:40 View session

Copilot AI reviewed Jan 29, 2026

View reviewed changes

robert-kalmar reviewed Jan 29, 2026

View reviewed changes

backends/nxp/aten_passes/fuse_batch_norm_with_linear_pass.py Outdated Show resolved Hide resolved

backends/nxp/aten_passes/remove_simulated_linear_bn_fusion_qat_pass.py Outdated Show resolved Hide resolved

StrycekSimon force-pushed the feature/EIEX-641-create-a-pass-to-fuse-linear-batchnorm-after-qat-quantization branch from b45a2ba to 764403b Compare January 29, 2026 16:35

Copilot AI review requested due to automatic review settings January 29, 2026 16:44

StrycekSimon force-pushed the feature/EIEX-641-create-a-pass-to-fuse-linear-batchnorm-after-qat-quantization branch from 764403b to 79557ff Compare January 29, 2026 16:44

Copilot started reviewing on behalf of StrycekSimon January 29, 2026 16:44 View session

Copilot AI reviewed Jan 29, 2026

View reviewed changes

StrycekSimon force-pushed the feature/EIEX-641-create-a-pass-to-fuse-linear-batchnorm-after-qat-quantization branch from 79557ff to ad5acaf Compare January 29, 2026 19:44

Copilot AI review requested due to automatic review settings February 2, 2026 07:42

StrycekSimon force-pushed the feature/EIEX-641-create-a-pass-to-fuse-linear-batchnorm-after-qat-quantization branch from ad5acaf to ff54eb2 Compare February 2, 2026 07:42

Copilot started reviewing on behalf of StrycekSimon February 2, 2026 07:43 View session

Copilot AI reviewed Feb 2, 2026

View reviewed changes

NXP backend: Add pass for inserting simulated Linear+BatchNorm fusion

96b4c9e

StrycekSimon added 3 commits February 2, 2026 09:10

NXP backend: Add pass for removing simulated Linear+BatchNorm fusion

16be826

NXP backend: Add support for fusing in quantized graph

b378f74

NXP backend: Remove linear output quantization in QAT

1b3839f

StrycekSimon force-pushed the feature/EIEX-641-create-a-pass-to-fuse-linear-batchnorm-after-qat-quantization branch from ff54eb2 to 2704388 Compare February 2, 2026 08:13

roman-janik-nxp reviewed Feb 2, 2026

View reviewed changes

robert-kalmar reviewed Feb 4, 2026

View reviewed changes

backends/nxp/aten_passes/remove_simulated_linear_bn_fusion_qat_pass.py Outdated Show resolved Hide resolved

.../aten_passes/simulated_linear_bn_fusion_passes/remove_simulated_linear_bn_fusion_qat_pass.py Show resolved Hide resolved

Copilot AI review requested due to automatic review settings February 10, 2026 16:10

StrycekSimon force-pushed the feature/EIEX-641-create-a-pass-to-fuse-linear-batchnorm-after-qat-quantization branch from 2704388 to 51bd521 Compare February 10, 2026 16:10

Copilot started reviewing on behalf of StrycekSimon February 10, 2026 16:10 View session

Copilot AI reviewed Feb 10, 2026

View reviewed changes

StrycekSimon force-pushed the feature/EIEX-641-create-a-pass-to-fuse-linear-batchnorm-after-qat-quantization branch from 51bd521 to fb048d2 Compare February 11, 2026 09:48

Copilot AI review requested due to automatic review settings February 11, 2026 10:05

StrycekSimon force-pushed the feature/EIEX-641-create-a-pass-to-fuse-linear-batchnorm-after-qat-quantization branch from fb048d2 to 0bcdcb9 Compare February 11, 2026 10:05

Copilot started reviewing on behalf of StrycekSimon February 11, 2026 10:06 View session

Copilot AI reviewed Feb 11, 2026

View reviewed changes

StrycekSimon added 6 commits February 11, 2026 13:49

NXP backend: Add test for Linear+BatchNorm fusing

f9bfb13

NXP backend: Extract generally used functions into utils file

2115c52

NXP backend: Relocate Linear+BN related passes

506ba81

NXP backend: Add Linear+BN fusing to the quantization pipeline

4ee316d

NXP backend: Unify BatchNorm op checks

d089765

NXP backend: Adjust MM converter test tolerance

d70a29e

StrycekSimon force-pushed the feature/EIEX-641-create-a-pass-to-fuse-linear-batchnorm-after-qat-quantization branch from 0bcdcb9 to d70a29e Compare February 11, 2026 13:29

NXP backend: Linear + BatchNorm QAT fusing #16623

Are you sure you want to change the base?

NXP backend: Linear + BatchNorm QAT fusing #16623

Conversation

StrycekSimon commented Jan 15, 2026 • edited by MartinPavella Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16623

❌ 11 New Failures, 1 Pending

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

robert-kalmar commented Feb 4, 2026

Uh oh!

StrycekSimon commented Feb 10, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

StrycekSimon commented Jan 15, 2026 •

edited by MartinPavella

Loading

pytorch-bot bot commented Jan 15, 2026 •

edited

Loading

StrycekSimon commented Feb 11, 2026 •

edited

Loading