Problem
bridge/models/qwen_vl/modelling_qwen3_vl/rope.py#L318
assert not config.apply_rope_fusion
I have a question regarding the assert above. Are you sure that Qwen3.5 and the Qwen3-VL series cannot use apply_rope_fusion?
In practice, when I tested with Qwen3.5-35B under the THD setting, I found that after commenting out this assert, enabling apply_rope_fusion still achieves reward convergence fully aligned with the baseline, and even further reduces the train–inference mismatch.
So I would like to confirm whether this assert is actually necessary.
Minimal repro
Expected behavior
no
Affected area
area:model
Regression?
Yes
Environment
No response
Logs
Problem
bridge/models/qwen_vl/modelling_qwen3_vl/rope.py#L318
I have a question regarding the assert above. Are you sure that Qwen3.5 and the Qwen3-VL series cannot use apply_rope_fusion?
In practice, when I tested with Qwen3.5-35B under the THD setting, I found that after commenting out this assert, enabling apply_rope_fusion still achieves reward convergence fully aligned with the baseline, and even further reduces the train–inference mismatch.
So I would like to confirm whether this assert is actually necessary.
Minimal repro
Expected behavior
no
Affected area
area:model
Regression?
Yes
Environment
No response
Logs