[X86] Add convert_element_type to smooth quant pattern by cyxlily · Pull Request #3784 · pytorch/ao

cyxlily · 2026-01-30T08:39:23Z

No description provided.

Signed-off-by: Cui, Lily <lily.cui@intel.com>

pytorch-bot · 2026-01-30T08:39:27Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3784

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copilot

Pull request overview

This PR extends the smooth quantization pattern matching to support operations that include convert_element_type nodes. The changes enable the pattern matcher to recognize and handle quantization patterns with additional type conversion operations.

Changes:

Added convert_a parameter to get_pattern_no_bias function to generate patterns with extra convert_element_type nodes
Created new pattern variants (pattern_no_bias_1_c1, pattern_no_bias_1_c2, etc.) to match different combinations of conversion operations
Updated validation logic to accept additional match node counts (8, 9, 12) for the new patterns
Modified keyword argument handling to use x_scale_dtype and make dtype optional with fallback

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

torchao/quantization/pt2e/inductor_passes/x86.py

Signed-off-by: Cui, Lily <lily.cui@intel.com>

Xia-Weiwen

Thanks for the PR. Please also add a UT.

torchao/quantization/pt2e/inductor_passes/x86.py

Signed-off-by: Cui, Lily <lily.cui@intel.com>

torchao/kernel/intmm.py

Signed-off-by: Cui, Lily <lily.cui@intel.com>

Moved to the other pr. Signed-off-by: Cui, Lily <lily.cui@intel.com>

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Xia-Weiwen · 2026-03-30T07:30:21Z

torchao/quantization/pt2e/inductor_passes/x86.py

    def _validate_pattern(match: Match):
-        if len(match.nodes) not in [4, 5, 6, 7, 10]:
+        # Valid node counts correspond to different pattern variations:
+        # 4: pattern1_with_no_outer_or_act_reshape (int_mm + convert + mul + mul)
+        # 6: pattern_no_bias_1 (reshape + int_mm + convert + mul + mul + reshape)
+        # 7: pattern_with_bias_1 (pattern_no_bias_1 + add)
+        # 8: pattern_no_bias_1_with_output_convert (pattern_no_bias_1 with dot scaled + output convert)
+        # 9: pattern_with_bias_1_with_output_convert (pattern_with_bias_1 with dot scaled + output convert)
+        if len(match.nodes) not in [4, 6, 7, 8, 9]:
            return False


@cyxlily Please take a look.

Xia-Weiwen · 2026-03-30T07:30:45Z

torchao/quantization/pt2e/inductor_passes/x86.py

    # When torch.compile'ing with dynamic=True, the expand node and the two tailing reshape nodes exist
    # When torch.compile'ing with dynamic=False, they don't exist


@cyxlily Please take a look.

Xia-Weiwen · 2026-03-30T07:33:47Z

torchao/quantization/pt2e/inductor_passes/x86.py

+    def get_pattern_no_bias(reshape_a: bool = True, convert_a: bool = False):
+        int_mm_pattern = CallFunction(


@cyxlily Please consider renaming here.

Add convert_element_type to smooth quant pattern

513bddc

Signed-off-by: Cui, Lily <lily.cui@intel.com>

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 30, 2026

Xia-Weiwen requested a review from Copilot January 30, 2026 08:45

Copilot AI reviewed Jan 30, 2026

View reviewed changes

torchao/quantization/pt2e/inductor_passes/x86.py Show resolved Hide resolved

Xia-Weiwen changed the title ~~Add convert_element_type to smooth quant pattern~~ [X86] Add convert_element_type to smooth quant pattern Jan 30, 2026

cyxlily added 3 commits February 2, 2026 11:00

Merge branch 'pytorch:main' into smooth_quant_pattern

7229c42

Cleanup patterns

2c45d12

Signed-off-by: Cui, Lily <lily.cui@intel.com>

Update nodes

22b7ed9

Signed-off-by: Cui, Lily <lily.cui@intel.com>

Xia-Weiwen reviewed Feb 4, 2026

View reviewed changes

cyxlily added 5 commits February 10, 2026 10:17

Merge remote-tracking branch 'upstream/main' into smooth_quant_pattern

866e92e

Remove cpu expand

92cfcee

Signed-off-by: Cui, Lily <lily.cui@intel.com>

Remove expand pattern

7b8b44b

Signed-off-by: Cui, Lily <lily.cui@intel.com>

Rename pattern

be223bf

Signed-off-by: Cui, Lily <lily.cui@intel.com>

Refine codes

2ec8981

Signed-off-by: Cui, Lily <lily.cui@intel.com>

jerryzh168 reviewed Feb 13, 2026

View reviewed changes

torchao/kernel/intmm.py Outdated Show resolved Hide resolved

cyxlily added 4 commits February 13, 2026 18:39

Merge remote-tracking branch 'upstream/main' into smooth_quant_pattern

3fbafad

Add unit test

760e66d

Signed-off-by: Cui, Lily <lily.cui@intel.com>

Merge branch 'pytorch:main' into smooth_quant_pattern

a53e0ea

Revert cpu intmm change

3217cb6

Moved to the other pr. Signed-off-by: Cui, Lily <lily.cui@intel.com>

Xia-Weiwen requested a review from Copilot March 18, 2026 08:40

Copilot started reviewing on behalf of Xia-Weiwen March 18, 2026 08:40 View session

Copilot AI reviewed Mar 18, 2026

View reviewed changes

cyxlily added 2 commits March 20, 2026 09:56

Merge branch 'pytorch:main' into smooth_quant_pattern

eb2df47

Merge remote-tracking branch 'upstream/main' into smooth_quant_pattern

00ee982

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[X86] Add convert_element_type to smooth quant pattern#3784

[X86] Add convert_element_type to smooth quant pattern#3784
cyxlily wants to merge 15 commits intopytorch:mainfrom
cyxlily:smooth_quant_pattern

cyxlily commented Jan 30, 2026

Uh oh!

pytorch-bot bot commented Jan 30, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Xia-Weiwen left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Xia-Weiwen Mar 30, 2026

Uh oh!

Xia-Weiwen Mar 30, 2026

Uh oh!

Xia-Weiwen Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		# When torch.compile'ing with dynamic=True, the expand node and the two tailing reshape nodes exist
		# When torch.compile'ing with dynamic=False, they don't exist

		def get_pattern_no_bias(reshape_a: bool = True, convert_a: bool = False):
		int_mm_pattern = CallFunction(

Conversation

cyxlily commented Jan 30, 2026

Uh oh!

pytorch-bot bot commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3784

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Xia-Weiwen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Xia-Weiwen Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Jan 30, 2026 •

edited

Loading