Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
38 changes: 19 additions & 19 deletions site-src/guides/getting-started-latest.md
Original file line number Diff line number Diff line change
Expand Up @@ -56,7 +56,7 @@ kubectl apply -k https://github.com/kubernetes-sigs/gateway-api-inference-extens
1. Requirements
- Gateway API [CRDs](https://gateway-api.sigs.k8s.io/guides/#installing-gateway-api) installed.

2. Install Istio
1. Install Istio:

```
TAG=$(curl https://storage.googleapis.com/istio-build/dev/1.28-dev)
Expand All @@ -79,14 +79,14 @@ kubectl apply -k https://github.com/kubernetes-sigs/gateway-api-inference-extens

- Gateway API [CRDs](https://gateway-api.sigs.k8s.io/guides/#installing-gateway-api) installed.

2. Set the Kgateway version and install the Kgateway CRDs.
1. Set the Kgateway version and install the Kgateway CRDs:

```bash
KGTW_VERSION=v2.1.0
helm upgrade -i --create-namespace --namespace kgateway-system --version $KGTW_VERSION kgateway-crds oci://cr.kgateway.dev/kgateway-dev/charts/kgateway-crds
```

3. Install Kgateway
1. Install Kgateway:

```bash
helm upgrade -i --namespace kgateway-system --version $KGTW_VERSION kgateway oci://cr.kgateway.dev/kgateway-dev/charts/kgateway --set inferenceExtension.enabled=true
Expand Down Expand Up @@ -114,7 +114,7 @@ kubectl apply -k https://github.com/kubernetes-sigs/gateway-api-inference-extens
See [Deploy Inference Gateways](https://cloud.google.com/kubernetes-engine/docs/how-to/deploy-gke-inference-gateway)
for detailed instructions.

2. Deploy the Inference Gateway:
1. Deploy the Inference Gateway:

```bash
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/gke/gateway.yaml
Expand All @@ -127,13 +127,13 @@ kubectl apply -k https://github.com/kubernetes-sigs/gateway-api-inference-extens
NAME CLASS ADDRESS PROGRAMMED AGE
inference-gateway inference-gateway <MY_ADDRESS> True 22s
```
3. Deploy the HTTPRoute
1. Deploy the HTTPRoute:

```bash
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/gke/httproute.yaml
```

4. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:
1. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:

```bash
kubectl get httproute llm-route -o yaml
Expand All @@ -144,7 +144,7 @@ kubectl apply -k https://github.com/kubernetes-sigs/gateway-api-inference-extens
Please note that this feature is currently in an experimental phase and is not intended for production use.
The implementation and user experience are subject to changes as we continue to iterate on this project.

1. Deploy the Inference Gateway
1. Deploy the Inference Gateway:

```bash
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/istio/gateway.yaml
Expand All @@ -157,13 +157,13 @@ kubectl apply -k https://github.com/kubernetes-sigs/gateway-api-inference-extens
inference-gateway inference-gateway <MY_ADDRESS> True 22s
```

2. Deploy the HTTPRoute
1. Deploy the HTTPRoute:

```bash
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/istio/httproute.yaml
```

3. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:
1. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:

```bash
kubectl get httproute llm-route -o yaml
Expand All @@ -176,7 +176,7 @@ kubectl apply -k https://github.com/kubernetes-sigs/gateway-api-inference-extens
implementation. Kgateway supports Inference Gateway with the [agentgateway](https://agentgateway.dev/) data plane. Follow these steps
to run Kgateway as an Inference Gateway:

1. Deploy the Inference Gateway
1. Deploy the Inference Gateway:

```bash
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/kgateway/gateway.yaml
Expand All @@ -187,13 +187,13 @@ kubectl apply -k https://github.com/kubernetes-sigs/gateway-api-inference-extens
kubectl get gateway inference-gateway
```

2. Deploy the HTTPRoute
1. Deploy the HTTPRoute:

```bash
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/kgateway/httproute.yaml
```

3. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:
1. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:

```bash
kubectl get httproute llm-route -o yaml
Expand All @@ -216,7 +216,7 @@ Deploy the sample InferenceObjective which allows you to specify priority of req
The following instructions assume you would like to cleanup ALL resources that were created in this quickstart guide.
Please be careful not to delete resources you'd like to keep.

1. Uninstall the InferencePool, InferenceObjective and model server resources
1. Uninstall the InferencePool, InferenceObjective and model server resources:

```bash
helm uninstall vllm-llama3-8b-instruct
Expand All @@ -227,7 +227,7 @@ Deploy the sample InferenceObjective which allows you to specify priority of req
kubectl delete secret hf-token --ignore-not-found
```

1. Uninstall the Gateway API Inference Extension CRDs
1. Uninstall the Gateway API Inference Extension CRDs:

```bash
kubectl delete -k https://github.com/kubernetes-sigs/gateway-api-inference-extension/config/crd --ignore-not-found
Expand All @@ -252,13 +252,13 @@ Deploy the sample InferenceObjective which allows you to specify priority of req

The following steps assume you would like to clean up ALL Istio resources that were created in this quickstart guide.

1. Uninstall All Istio resources
1. Uninstall All Istio resources:

```bash
istioctl uninstall -y --purge
```

2. Remove the Istio namespace
1. Remove the Istio namespace:

```bash
kubectl delete ns istio-system
Expand All @@ -273,19 +273,19 @@ Deploy the sample InferenceObjective which allows you to specify priority of req

The following steps assume you would like to cleanup ALL Kgateway resources that were created in this quickstart guide.

1. Uninstall Kgateway
1. Uninstall Kgateway:

```bash
helm uninstall kgateway -n kgateway-system
```

2. Uninstall the Kgateway CRDs.
1. Uninstall the Kgateway CRDs:

```bash
helm uninstall kgateway-crds -n kgateway-system
```

3. Remove the Kgateway namespace.
1. Remove the Kgateway namespace:

```bash
kubectl delete ns kgateway-system
Expand Down
38 changes: 19 additions & 19 deletions site-src/guides/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -52,7 +52,7 @@ kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extens
1. Requirements
- Gateway API [CRDs](https://gateway-api.sigs.k8s.io/guides/#installing-gateway-api) installed.

2. Install Istio
1. Install Istio:

```
TAG=$(curl https://storage.googleapis.com/istio-build/dev/1.28-dev)
Expand All @@ -75,14 +75,14 @@ kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extens

- Gateway API [CRDs](https://gateway-api.sigs.k8s.io/guides/#installing-gateway-api) installed.

2. Set the Kgateway version and install the Kgateway CRDs.
1. Set the Kgateway version and install the Kgateway CRDs:

```bash
KGTW_VERSION=v2.2.0-main
helm upgrade -i --create-namespace --namespace kgateway-system --version $KGTW_VERSION kgateway-crds oci://cr.kgateway.dev/kgateway-dev/charts/kgateway-crds
```

3. Install Kgateway
1. Install Kgateway:

```bash
helm upgrade -i --namespace kgateway-system --version $KGTW_VERSION kgateway oci://cr.kgateway.dev/kgateway-dev/charts/kgateway --set inferenceExtension.enabled=true
Expand Down Expand Up @@ -110,7 +110,7 @@ kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extens
See [Deploy Inference Gateways](https://cloud.google.com/kubernetes-engine/docs/how-to/deploy-gke-inference-gateway)
for detailed instructions.

2. Deploy the Inference Gateway:
1. Deploy the Inference Gateway:

```bash
kubectl apply -f https://raw.githubusercontent.com/kubernetes-sigs/gateway-api-inference-extension/refs/tags/v1.1.0/config/manifests/gateway/gke/gateway.yaml
Expand All @@ -123,13 +123,13 @@ kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extens
NAME CLASS ADDRESS PROGRAMMED AGE
inference-gateway inference-gateway <MY_ADDRESS> True 22s
```
3. Deploy the HTTPRoute
1. Deploy the HTTPRoute:

```bash
kubectl apply -f https://raw.githubusercontent.com/kubernetes-sigs/gateway-api-inference-extension/refs/tags/v1.1.0/config/manifests/gateway/gke/httproute.yaml
```

4. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:
1. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:

```bash
kubectl get httproute llm-route -o yaml
Expand All @@ -140,7 +140,7 @@ kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extens
Please note that this feature is currently in an experimental phase and is not intended for production use.
The implementation and user experience are subject to changes as we continue to iterate on this project.

1. Deploy the Inference Gateway
1. Deploy the Inference Gateway:

```bash
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/istio/gateway.yaml
Expand All @@ -153,13 +153,13 @@ kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extens
inference-gateway inference-gateway <MY_ADDRESS> True 22s
```

2. Deploy the HTTPRoute
1. Deploy the HTTPRoute:

```bash
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/istio/httproute.yaml
```

3. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:
1. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:

```bash
kubectl get httproute llm-route -o yaml
Expand All @@ -172,7 +172,7 @@ kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extens
implementation. Kgateway supports Inference Gateway with the [agentgateway](https://agentgateway.dev/) data plane. Follow these steps
to run Kgateway as an Inference Gateway:

1. Deploy the Inference Gateway
1. Deploy the Inference Gateway:

```bash
kubectl apply -f https://raw.githubusercontent.com/kubernetes-sigs/gateway-api-inference-extension/refs/tags/v1.1.0/config/manifests/gateway/agentgateway/gateway.yaml
Expand All @@ -183,13 +183,13 @@ kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extens
kubectl get gateway inference-gateway
```

2. Deploy the HTTPRoute
1. Deploy the HTTPRoute:

```bash
kubectl apply -f https://raw.githubusercontent.com/kubernetes-sigs/gateway-api-inference-extension/refs/tags/v1.1.0/config/manifests/gateway/agentgateway/httproute.yaml
```

3. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:
1. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`:

```bash
kubectl get httproute llm-route -o yaml
Expand All @@ -212,7 +212,7 @@ Deploy the sample InferenceObjective which allows you to specify priority of req
The following instructions assume you would like to cleanup ALL resources that were created in this quickstart guide.
Please be careful not to delete resources you'd like to keep.

1. Uninstall the InferencePool, InferenceObjective and model server resources
1. Uninstall the InferencePool, InferenceObjective and model server resources:

```bash
helm uninstall vllm-llama3-8b-instruct
Expand All @@ -223,7 +223,7 @@ Deploy the sample InferenceObjective which allows you to specify priority of req
kubectl delete secret hf-token --ignore-not-found
```

1. Uninstall the Gateway API Inference Extension CRDs
1. Uninstall the Gateway API Inference Extension CRDs:

```bash
kubectl delete -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/releases/download/v1.1.0/manifests.yaml --ignore-not-found
Expand All @@ -249,13 +249,13 @@ Deploy the sample InferenceObjective which allows you to specify priority of req

The following steps assume you would like to clean up ALL Istio resources that were created in this quickstart guide.

1. Uninstall All Istio resources
1. Uninstall All Istio resources:

```bash
istioctl uninstall -y --purge
```

2. Remove the Istio namespace
1. Remove the Istio namespace:

```bash
kubectl delete ns istio-system
Expand All @@ -270,19 +270,19 @@ Deploy the sample InferenceObjective which allows you to specify priority of req

The following steps assume you would like to cleanup ALL Kgateway resources that were created in this quickstart guide.

1. Uninstall Kgateway
1. Uninstall Kgateway:

```bash
helm uninstall kgateway -n kgateway-system
```

2. Uninstall the Kgateway CRDs.
1. Uninstall the Kgateway CRDs:

```bash
helm uninstall kgateway-crds -n kgateway-system
```

3. Remove the Kgateway namespace.
1. Remove the Kgateway namespace:

```bash
kubectl delete ns kgateway-system
Expand Down