[WIP] Add DP-aware routing support to KVEvents and indexing pipeline by satyamg1620 · Pull Request #370 · llm-d/llm-d-kv-cache

satyamg1620 · 2026-02-28T14:13:17Z

Summary

Propagate DataParallelRank from EventBatch through the KV event processing pipeline into PodEntry, enabling the index and scorer to distinguish KV blocks cached by different data-parallel ranks on the same pod.
Add DataParallelRank int field to PodEntry with sentinel value -1 (NoDataParallelRank) for backward compatibility with non-DP deployments.
Update LongestPrefixScorer to produce DP-aware scoring keys ("pod-1@dp0") so different ranks on the same pod receive independent scores.
Add optional int32 data_parallel_rank field to the PodScore proto message and update the gRPC server to populate it.

This PR resolves issue #357

Signed-off-by: satyamg1620 <Satyam.Gupta.3@ibm.com>

satyamg1620 · 2026-02-28T17:06:20Z

@vMaroon Can you please review this initial draft PR. Let me know if any changes are required.

vMaroon · 2026-02-28T18:19:54Z

@satyamg1620 thank you for the contribution!

Generally I think the approach is correct - this is the only way to currently cover all deployments mentioned in https://docs.vllm.ai/en/stable/serving/data_parallel_deployment. Today in llm-d we only support the external LB mode, in which every rank is a separate deployment, and the pipeline works as follows:

Each rank publishes to a kv@<IP>:<PORT>@<MODEL> topic, and a PodIdentifier is then <IP>:<PORT>
The scheduler treats each rank as a separate, normal endpoint, identified by <IP>:<PORT>

Though for other DP modes, there is one gap on consumption from the scheduler side: the scheduler currently has nothing that connects actual DP-rank information with the port.

The bridge between the scheduler and the indexed data is missing here. It would be great if you can prepare an overview of how llm-d would support each of the modes in the vllm doc. Once a plan is conceived on this kind of coverage - we can proceed in a phased approach where this PR is one, and scheduler updates are another.

satyamg1620 · 2026-03-02T06:16:01Z

sure @vMaroon . I will prepare an overview for same.

Added changes to fetch dp properties from kvevents for DP-aware routing

5337c47

Signed-off-by: satyamg1620 <Satyam.Gupta.3@ibm.com>

github-actions bot requested review from hyeongyun0916, liu-cong, sagearc and yankay February 28, 2026 14:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add DP-aware routing support to KVEvents and indexing pipeline#370

[WIP] Add DP-aware routing support to KVEvents and indexing pipeline#370
satyamg1620 wants to merge 1 commit intollm-d:mainfrom
satyamg1620:dp-aware-routing

satyamg1620 commented Feb 28, 2026

Uh oh!

satyamg1620 commented Feb 28, 2026

Uh oh!

vMaroon commented Feb 28, 2026 •

edited

Loading

Uh oh!

satyamg1620 commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

satyamg1620 commented Feb 28, 2026

Summary

Uh oh!

satyamg1620 commented Feb 28, 2026

Uh oh!

vMaroon commented Feb 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

satyamg1620 commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vMaroon commented Feb 28, 2026 •

edited

Loading