-
Notifications
You must be signed in to change notification settings - Fork 37
Open
Description
Target release date: 03/30, to catch vllm 0.19.0 release
[update] Target release date 0403.
- eplb add eplb enabling kernels #182
- [Attention] block size 16/32 support Add block_size 16/32 support for chunk prefill and fix paged decode #171
- [Attention] dynamic stride support [CHUNK_PREFILL] add dynamic_stride support #187
- Vectorize act-and-mul kernels for speedup #207
- remove xpu_fused_moe weights handling #163
- TBD
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels