Handle VLM weight name prefixes in QuantizedModel loader by uday610 · Pull Request #1996 · microsoft/onnxruntime-genai

uday610 · 2026-02-28T17:41:35Z

This change makes the quantized model weight loader work with VLM checkpoints where weight names include extra prefixes like model.language_model.* and vision tower weights like model.visual.*.

Skip weights under model.visual.*
Normalize model.language_model.* → model.* before existing pattern matching and layer parsing

No functional change for pure LLM checkpoints.

This change makes the quantized model weight loader work with VLM checkpoints where weight names include extra prefixes like `model.language_model.*` and vision tower weights like `model.visual.*`. - Skip weights under `model.visual.*` - Normalize `model.language_model.* → model.*` before existing pattern matching and layer parsing No functional change for pure LLM checkpoints.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle VLM weight name prefixes in QuantizedModel loader#1996

Handle VLM weight name prefixes in QuantizedModel loader#1996
uday610 wants to merge 1 commit intomicrosoft:mainfrom
uday610:vlm_quant_model_load

uday610 commented Feb 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

uday610 commented Feb 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant