Skip to content

Commit 87254c3

Browse files
committed
try with pp2
Signed-off-by: Brian Yu <bxyu@nvidia.com>
1 parent dbf76cb commit 87254c3

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

responses_api_models/local_vllm_model/test_scripts/2_nodes/1_instance_1x16.sh

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@
1313
# See the License for the specific language governing permissions and
1414
# limitations under the License.
1515

16-
config_paths="responses_api_models/local_vllm_model/configs/openai/gpt-oss-120b-reasoning-high.yaml"
16+
config_paths="responses_api_models/local_vllm_model/configs/openai/gpt-oss-20b-reasoning-high.yaml"
1717
ng_run "+config_paths=[${config_paths}]" \
18-
++gpt-oss-120b-reasoning-high.responses_api_models.local_vllm_model.vllm_serve_kwargs.tensor_parallel_size=16 \
19-
++gpt-oss-120b-reasoning-high.responses_api_models.local_vllm_model.vllm_serve_env_vars.VLLM_RAY_DP_PACK_STRATEGY=span
18+
++gpt-oss-20b-reasoning-high.responses_api_models.local_vllm_model.vllm_serve_kwargs.pipeline_parallel_size=2 \
19+
++gpt-oss-20b-reasoning-high.responses_api_models.local_vllm_model.vllm_serve_env_vars.VLLM_RAY_DP_PACK_STRATEGY=span

0 commit comments

Comments
 (0)