reduce unit test duration for mini test profiler by chaojun-zhang · Pull Request #217 · vllm-project/vllm-xpu-kernels

chaojun-zhang · 2026-03-24T02:27:26Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS ABOVE HAVE BEEN CONSIDERED.

Purpose

Test Plan

export XPU_KERNEL_PYTEST_PROFILER=MINI
pytest -s -v tests/flash_attn/test_flash_attn_varlen_func.py
pytest -s -v tests/gdn_attn/test_gdn_attn.py

Test Result

(Optional) Documentation Update

BEFORE SUBMITTING, PLEASE READ https://docs.vllm.ai/en/latest/contributing (anything written below this line will be removed by GitHub Actions)

Signed-off-by: chzhang <chaojun.zhang@intel.com>

Copilot

Pull request overview

Reduces the runtime of “mini” profiler test configurations by shrinking parameter grids used in two attention-related test suites.

Changes:

Shrink mini-parameterization for GDN attention tests (token count and batch size).
Shrink mini-parameterization for FlashAttention varlen+PagedKV tests (seq lens/head sizes/etc.).

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
tests/gdn_attn/test_gdn_attn.py	Updates MINI_PYTEST_PARAMS to run a smaller (faster) configuration.
tests/flash_attn/test_flash_attn_varlen_func.py	Narrows MINI_PYTEST_PARAMS for the varlen paged-KV test to reduce runtime.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-24T05:18:24Z

tests/flash_attn/test_flash_attn_varlen_func.py

+        "seq_lens": [[(5, 18)]],
+        "head_size": [64],
+        "block_size": [64],
+        "num_heads": [(2)],


num_heads previously used a 2-tuple ((8, 2)), which strongly suggests the test code expects to unpack (num_q_heads, num_kv_heads) (or similar). Changing it to an int ((2) is just 2 in Python) is likely to break parametrization consumers that expect a tuple. Keep the parameter type consistent (e.g., provide a 2-tuple with reduced values) or update the downstream test logic to accept both scalar and tuple forms.

Suggested change

"num_heads": [(2)],

"num_heads": [(2, 2)],

Copilot · 2026-03-24T05:18:25Z

tests/gdn_attn/test_gdn_attn.py

+        "num_actual_tokens": [1],
+        "batch_size": [4],


The PR description template is still unfilled (Purpose/Test Result sections are blank), so it’s unclear what runtime reduction is expected and what the before/after results are. Please update the PR description with the intended goal (e.g., target runtime), and paste the measured before/after test timings for the provided test plan commands.

reduce unit test duration for mini test profiler

0b4c6ee

Signed-off-by: chzhang <chaojun.zhang@intel.com>

chaojun-zhang marked this pull request as ready for review March 24, 2026 05:11

Copilot AI review requested due to automatic review settings March 24, 2026 05:11

Copilot AI reviewed Mar 24, 2026

View reviewed changes

Copilot started reviewing on behalf of chaojun-zhang March 24, 2026 05:33 View session

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reduce unit test duration for mini test profiler#217

reduce unit test duration for mini test profiler#217
chaojun-zhang wants to merge 1 commit intovllm-project:mainfrom
chaojun-zhang:cri_ut_fix

chaojun-zhang commented Mar 24, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Copilot AI Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chaojun-zhang commented Mar 24, 2026

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants