Skip to content

Commit aa61c38

Browse files
committed
Add comments
Signed-off-by: Cui, Lily <lily.cui@intel.com>
1 parent 6462a41 commit aa61c38

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

torchao/kernel/intmm.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -136,9 +136,9 @@ def _int_scaled_matmul_cpu(
136136
) -> torch.Tensor:
137137
"""
138138
CPU-optimized path for scaled integer matrix multiplication.
139-
It goes to u8s8 or s8s8 path based on ISA support for
140-
hardware. The selection is for performance only and both paths
141-
should work regardless of ISA support.
139+
CPU prefers decomposed version to leverage the fusion capability of Inductor.
140+
It goes to u8s8 or s8s8 path based on ISA support for hardware. The selection
141+
is for performance only and both paths should work regardless of ISA support.
142142
143143
Args:
144144
a (torch.Tensor): The first matrix to multiply (int8).

0 commit comments

Comments
 (0)