Enable GPTQ on Intel GPU by xiaowangintel · Pull Request #4191 · pytorch/ao

xiaowangintel · 2026-03-27T08:16:40Z

Summary
This PR enables GPTQ support on Intel GPU.

Previously, GPTQ workflows in torchao were primarily validated on CUDA. This PR extends support to XPU.

pytorch-bot · 2026-03-27T08:16:43Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4191

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 8 New Failures

As of commit 950b4fb with merge base 4611835 ():

NEW FAILURES - The following jobs have failed:

Run Regression Tests / test (CPU 2.10, linux.4xlarge, torch==2.10.0 --index-url https://download.pytorch.org/whl/cpu, cpu) / linux-job (gh)
test/quantization/pt2e/test_x86inductor_quantizer.py::TestQuantizePT2EX86Inductor::test_set_module_name_with_mixed_configs
Run Regression Tests / test (CPU 2.8, linux.4xlarge, torch==2.8.0 --index-url https://download.pytorch.org/whl/cpu, cpu) / linux-job (gh)
test/quantization/pt2e/test_x86inductor_quantizer.py::TestQuantizePT2EX86Inductor::test_set_module_name_with_mixed_configs
Run Regression Tests / test (CPU 2.9, linux.4xlarge, torch==2.9.1 --index-url https://download.pytorch.org/whl/cpu, cpu) / linux-job (gh)
test/quantization/pt2e/test_x86inductor_quantizer.py::TestQuantizePT2EX86Inductor::test_set_module_name_with_mixed_configs
Run Regression Tests / test (CUDA 2.10, linux.g5.12xlarge.nvidia.gpu, torch==2.10.0, cuda, 12.6) / linux-job (gh)
test/quantization/pt2e/test_x86inductor_quantizer.py::TestQuantizePT2EX86Inductor::test_set_module_name_with_mixed_configs
Run Regression Tests / test (CUDA 2.8, linux.g5.12xlarge.nvidia.gpu, torch==2.8.0, cuda, 12.6) / linux-job (gh)
test/quantization/pt2e/test_x86inductor_quantizer.py::TestQuantizePT2EX86Inductor::test_set_module_name_with_mixed_configs
Run Regression Tests / test (CUDA 2.9, linux.g5.12xlarge.nvidia.gpu, torch==2.9.1, cuda, 12.6) / linux-job (gh)
test/quantization/pt2e/test_x86inductor_quantizer.py::TestQuantizePT2EX86Inductor::test_set_module_name_with_mixed_configs
Run Regression Tests / test-nightly (CPU Nightly, linux.4xlarge, --pre torch --index-url https://download.pytorch.org/wh... / linux-job (gh)
test/quantization/pt2e/test_x86inductor_quantizer.py::TestQuantizePT2EX86Inductor::test_set_module_name_with_mixed_configs
Run Regression Tests / test-nightly (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://downloa... / linux-job (gh)
test/quantization/pt2e/test_x86inductor_quantizer.py::TestQuantizePT2EX86Inductor::test_set_module_name_with_mixed_configs

This comment was automatically generated by Dr. CI and updates every 15 minutes.

liangan1 · 2026-03-30T07:43:37Z

torchao/prototype/gptq/gptq_example.py

+    if device == "cuda":
        torch.cuda.empty_cache()
+    elif device == "xpu":
+        torch.xpu.empty_cache()


suggest to use the torch.acc API

liangan1 · 2026-03-30T07:44:01Z

torchao/prototype/gptq/gptq_example.py

    # Save model to generated output directory
    print(f"Saving model to {output_dir}...")
    tokenizer.save_pretrained(output_dir)
+    print("model:", model)


Suggested change

print("model:", model)

liangan1 · 2026-03-30T07:45:29Z

torchao/prototype/gptq/gptq_example.py


 def main():
    args = parse_args()
+    device = _get_device(args.device)


suggest to use torch.acc.get_current_xxx to get the current device information instead of add new params --device. it will change the user behevior

liangan1 · 2026-03-30T07:52:51Z

test/dtypes/test_nf4.py


    @skip_if_lt_x_gpu(2)
    @unittest.skipIf(not torch.accelerator.is_available(), "Need GPU available")
+    @unittest.skipIf(torch.xpu.is_available(), "XPU enablement in progress")


pls skip not related UT in another PR.

jerryzh168 · 2026-03-30T17:18:02Z

this feature is not complete yet, @xiaowangintel @liangan1 can you restrict the contribution only to stable features for now? they are in https://github.com/pytorch/ao/tree/main/torchao/quantization/quantize_

liangan1 · 2026-03-31T07:17:15Z

this feature is not complete yet, @xiaowangintel @liangan1 can you restrict the contribution only to stable features for now? they are in https://github.com/pytorch/ao/tree/main/torchao/quantization/quantize_

Sure. Thanks for your suggestion. We will focus on these features now.

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 27, 2026

xiaowangintel requested review from jcaip and liangan1 March 27, 2026 08:16

xiaowangintel self-assigned this Mar 27, 2026

xiaowangintel added module: inference quantize_ api inference flow ciflow/xpu label used to trigger xpu CI jobs labels Mar 27, 2026

xiaowangintel requested review from jerryzh168 and vkuzo March 27, 2026 08:46

Enable GPTQ on Intel GPU

9fed009

liangan1 reviewed Mar 30, 2026

View reviewed changes

Enable GPTQ on Intel GPU

950b4fb

xiaowangintel mentioned this pull request Mar 31, 2026

[TorchAO][BMG] INT4 GPTQ failed due to TorchAO API change. intel/torch-xpu-ops#2707

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable GPTQ on Intel GPU#4191

Enable GPTQ on Intel GPU#4191
xiaowangintel wants to merge 2 commits intopytorch:mainfrom
xiaowangintel:xw/prototype_gptq

xiaowangintel commented Mar 27, 2026

Uh oh!

pytorch-bot bot commented Mar 27, 2026 •

edited

Loading

Uh oh!

liangan1 Mar 30, 2026

Uh oh!

liangan1 Mar 30, 2026

Uh oh!

liangan1 Mar 30, 2026

Uh oh!

liangan1 Mar 30, 2026

Uh oh!

jerryzh168 commented Mar 30, 2026

Uh oh!

liangan1 commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

xiaowangintel commented Mar 27, 2026

Uh oh!

pytorch-bot bot commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4191

❌ 8 New Failures

Uh oh!

liangan1 Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

liangan1 Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

liangan1 Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

liangan1 Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

jerryzh168 commented Mar 30, 2026

Uh oh!

liangan1 commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Mar 27, 2026 •

edited

Loading