Skip to content

Commit 3cd3bf2

Browse files
committed
docs: Update NGF inference doc
Updated the supported inference version. Also removed a port that isn't used.
1 parent e041721 commit 3cd3bf2

File tree

1 file changed

+2
-3
lines changed

1 file changed

+2
-3
lines changed

content/ngf/how-to/gateway-api-inference-extension.md

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -70,7 +70,7 @@ NGINX will query the Endpoint Picker Extension to determine the appropriate pod
7070
{{< call-out "warning" >}} The Endpoint Picker Extension is a third-party application written and provided by the Gateway API Inference Extension project. Communication between NGINX and the Endpoint Picker uses TLS with certificate verification disabled by default, as the Endpoint Picker does not currently support mounting CA certificates. The Gateway API Inference Extension is in alpha status and should not be used in production. NGINX Gateway Fabric is not responsible for any threats or risks associated with using this third-party Endpoint Picker Extension application. {{< /call-out >}}
7171

7272
```shell
73-
export IGW_CHART_VERSION=v1.0.1
73+
export IGW_CHART_VERSION=v1.1.0
7474
helm install vllm-llama3-8b-instruct \
7575
--set inferencePool.modelServers.matchLabels.app=vllm-llama3-8b-instruct \
7676
--version $IGW_CHART_VERSION \
@@ -113,7 +113,7 @@ GW_IP=XXX.YYY.ZZZ.III
113113
GW_PORT=<port number>
114114
```
115115

116-
## Deploy a HTTPRoute
116+
## Deploy an HTTPRoute
117117

118118
```yaml
119119
kubectl apply -f - <<EOF
@@ -131,7 +131,6 @@ spec:
131131
- group: inference.networking.k8s.io
132132
kind: InferencePool
133133
name: vllm-llama3-8b-instruct
134-
port: 3000
135134
matches:
136135
- path:
137136
type: PathPrefix

0 commit comments

Comments
 (0)