Support swiglustep and mul by Dboyqiao · Pull Request #199 · vllm-project/vllm-xpu-kernels

Dboyqiao · 2026-03-18T03:19:33Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS ABOVE HAVE BEEN CONSIDERED.

Purpose

Support swiglustep and mul

Test Plan

python -m pytest tests/test_swiglustep_and_mul.py -v

Test Result

Pass

(Optional) Documentation Update

BEFORE SUBMITTING, PLEASE READ https://docs.vllm.ai/en/latest/contributing (anything written below this line will be removed by GitHub Actions)

Copilot

Pull request overview

Adds support for a new fused activation (swiglustep_and_mul) across the XPU extension stack (C++/SYCL kernel → Torch binding → Python dispatch), plus accompanying unit test and benchmark.

Changes:

Add swiglustep activation option to the fused MoE Python interface.
Register and bind a new torch.ops._C.swiglustep_and_mul XPU operator and implement its SYCL kernel.
Add unit test coverage and a benchmark script for the new op.

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
vllm_xpu_kernels/fused_moe_interface.py	Routes `activation="swiglustep"` to the new fused op.
csrc/activation.cpp	Implements `swiglustep_and_mul` device function + kernel + launcher.
csrc/torch_bindings.cpp	Registers the new op schema and XPU implementation.
csrc/ops.h	Declares the new C++ op entrypoint.
tests/register_ops.py	Adds a Python test wrapper for the new op.
tests/ops/swiglustep_and_mul_op.py	Adds a `CustomOp` test harness + native reference implementation.
tests/test_swiglustep_and_mul.py	Adds pytest coverage + `opcheck` for the new op.
benchmark/benchmark_swiglustep_and_mul.py	Adds performance benchmarking for the op vs native/compile.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

tests/test_swiglustep_and_mul.py

+XPU_DEVICES = [
+    f"xpu:{i}" for i in range(1 if torch.xpu.device_count() == 1 else 2)
+]
+


tests/test_swiglustep_and_mul.py

+    torch.set_default_device(device)
+    x = torch.randn(num_tokens, 2 * d, dtype=dtype)
+
+    layer = SwigluStepAndMul()


tests/test_swiglustep_and_mul.py

+    d = x.shape[-1] // 2
+    output_shape = (x.shape[:-1] + (d, ))


vllm_xpu_kernels/fused_moe_interface.py

+    elif activation == "swiglustep":
+        torch.ops._C.swiglustep_and_mul(act_output, gemm1_output, 7.0)


benchmark/benchmark_swiglustep_and_mul.py

+from tests.ops.swiglustep_and_mul_op import SwigluStepAndMul
+
+


Signed-off-by: Qiao, Zhefeng <zhefeng.qiao@intel.com>

xinyu-intel · 2026-03-19T05:45:15Z

csrc/activation.cpp

+    torch::Tensor& input,  // [..., 2 * d]
+    double limit) {
+  LAUNCH_SWIGLUSTEP_AND_MUL(vllm::swiglustep_and_mul, limit);
+}


add blank line

Signed-off-by: Qiao, Zhefeng <zhefeng.qiao@intel.com>

Copilot AI review requested due to automatic review settings March 18, 2026 03:19

Copilot AI reviewed Mar 18, 2026

View reviewed changes

Copilot started reviewing on behalf of Dboyqiao March 18, 2026 03:45 View session

Support swiglustep and mul

bef820a

Signed-off-by: Qiao, Zhefeng <zhefeng.qiao@intel.com>

Dboyqiao force-pushed the dev/zhefeng/swiglustep_and_mul branch from 21a7552 to bef820a Compare March 18, 2026 07:11

xinyu-intel reviewed Mar 19, 2026

View reviewed changes

Dboyqiao added 2 commits March 19, 2026 10:30

Fix format issue

f846f84

Signed-off-by: Qiao, Zhefeng <zhefeng.qiao@intel.com>

Merge branch 'main' into dev/zhefeng/swiglustep_and_mul

20133cd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support swiglustep and mul#199

Support swiglustep and mul#199
Dboyqiao wants to merge 3 commits intovllm-project:mainfrom
Dboyqiao:dev/zhefeng/swiglustep_and_mul

Dboyqiao commented Mar 18, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

xinyu-intel Mar 19, 2026

Uh oh!

Dboyqiao Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		elif activation == "swiglustep":
		torch.ops._C.swiglustep_and_mul(act_output, gemm1_output, 7.0)

		from tests.ops.swiglustep_and_mul_op import SwigluStepAndMul

Conversation

Dboyqiao commented Mar 18, 2026

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

xinyu-intel Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

Dboyqiao Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants