My device is an M4 Max Studio with 128GB RAM. When running Qwen3.5-27B, I only get 1 t/s, What settings should I check or adjust? #157
-
|
My device is an M4 Max Studio with 128GB RAM. When running Qwen3.5-27B, I only get 1 t/s, What settings should I check or adjust? |
Beta Was this translation helpful? Give feedback.
Answered by
xiaolv52099
Mar 12, 2026
Replies: 1 comment
-
|
换成 mlx版的模型试试,我m1pro 32GB跑 Qwen3.5-4B-4bit-mlx的 都有40t/s,速度飞起,一点不卡 |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
Deanmsn
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
换成 mlx版的模型试试,我m1pro 32GB跑 Qwen3.5-4B-4bit-mlx的 都有40t/s,速度飞起,一点不卡