Skip to content
This repository was archived by the owner on Feb 3, 2026. It is now read-only.

Ensure max_completion_tokens=1 for prefill#67

Merged
nirrozenbaum merged 1 commit intollm-d:mainfrom
shmuelk:max-completion-tokens-fix
Oct 29, 2025
Merged

Ensure max_completion_tokens=1 for prefill#67
nirrozenbaum merged 1 commit intollm-d:mainfrom
shmuelk:max-completion-tokens-fix

Conversation

@shmuelk
Copy link
Contributor

@shmuelk shmuelk commented Oct 29, 2025

This PR back port PR llm-d-inference-scheduler 403 to this repo to enable us to release a quick fix.

Signed-off-by: Shmuel Kallner <kallner@il.ibm.com>
Copy link
Collaborator

@nirrozenbaum nirrozenbaum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm
/approve

@nirrozenbaum nirrozenbaum merged commit e7151bd into llm-d:main Oct 29, 2025
1 check passed
pierDipi pushed a commit to pierDipi/llm-d-routing-sidecar that referenced this pull request Oct 30, 2025
Signed-off-by: Shmuel Kallner <kallner@il.ibm.com>
pierDipi pushed a commit to pierDipi/llm-d-routing-sidecar that referenced this pull request Oct 30, 2025
…8d (llm-d#67)

Signed-off-by: konflux-internal-p02 <170854209+konflux-internal-p02[bot]@users.noreply.github.com>
Co-authored-by: konflux-internal-p02[bot] <170854209+konflux-internal-p02[bot]@users.noreply.github.com>
pierDipi pushed a commit to pierDipi/llm-d-routing-sidecar that referenced this pull request Oct 30, 2025
Signed-off-by: Shmuel Kallner <kallner@il.ibm.com>
pierDipi pushed a commit to pierDipi/llm-d-routing-sidecar that referenced this pull request Oct 30, 2025
Signed-off-by: Shmuel Kallner <kallner@il.ibm.com>
Jooho added a commit to opendatahub-io/llm-d-routing-sidecar that referenced this pull request Oct 30, 2025
…e3db496c26d-v0.3

[0.3] Ensure max_completion_tokens=1 for prefill (llm-d#67)
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants