From ab2d5c13b6992c41b9b3c6f0c90076b7c5b3813f Mon Sep 17 00:00:00 2001
From: fharo <fharo@redhat.com>
Date: Tue, 9 Dec 2025 20:26:33 -0700
Subject: [PATCH 1/4] Adding a document for deploying fake-gpu-operator on
 Openshift 4.20.

---
 docs/deploying-on-openshift.md | 165 +++++++++++++++++++++++++++++++++
 1 file changed, 165 insertions(+)
 create mode 100644 docs/deploying-on-openshift.md

diff --git a/docs/deploying-on-openshift.md b/docs/deploying-on-openshift.md
new file mode 100644
index 0000000..8d8a1c3
--- /dev/null
+++ b/docs/deploying-on-openshift.md
@@ -0,0 +1,165 @@
+# Deploying fake-gpu-operator on Openshift
+
+This document will guide you through deploying fake-gpu-operator on an OpenShift cluster with some basic validation. Unlike most Kubernetes based clusters, Openshift has tighter security controls on what pods or workloads can do. Since there is this delta, we think it would be helpful to have this document here for others looking to deploy it on Openshift.
+
+It must be noted that this operator has limitations on what can be simulated. Do NOT expect to be able to plug in and have it play nice for say something like vllm or llm-d gpu required workloads/requests. As of this document, the simulated GPUs do not support simulated inferencing nor simulating training.
+
+Note: Comparable Openshift infrastructure and/or older OCP versions may work but have not been tested.
+Note: The oc cli is used for this guide but kubectl should work as well.
+Note: The fake-gpu-operator won't show up under installed operators on the web console but don't interpret this as it not being installed. 
+
+## Prerequisites
+
+Apart from what is already mentioned on the main README page, below are some further points to keep in mind.
+
+### Infrastructure Setup
+
+- AWS - An [ec2](https://aws.amazon.com/ec2/instance-types/m6a/) instance of m6a.4xlarge was used for the Openshift control plane nodes. The cluster was scaled up with 2 extra worker nodes for demonstration purposes. If your a Red Hat associate, partner, or customer you can provision through the demo redhat system. 
+
+### Platform Setup
+- OpenShift - This guide was tested on OpenShift 4.20.
+- Cluster administrator privileges are required to grant appropriate privileges for some of the fake-gpu-operator component service accounts.
+
+## Deployment Steps
+
+1. After logging into your Openshift cluster, get the cluster node names and focus on the ones you want to be your simulated GPUs. I will select the workers.
+    ```
+    $ oc get nodes
+    NAME                                        STATUS   ROLES                         AGE   VERSION
+    ip-10-0-13-14.us-east-2.compute.internal    Ready    worker                        21h   v1.33.5
+    ip-10-0-28-20.us-east-2.compute.internal    Ready    control-plane,master,worker   43h   v1.33.5
+    ip-10-0-30-218.us-east-2.compute.internal   Ready    worker                        21h   v1.33.5
+    ```
+1. Make sure to label them.
+    ```
+    oc label node ip-10-0-13-14.us-east-2.compute.internal run.ai/simulated-gpu-node-pool=default
+    oc label node ip-10-0-30-218.us-east-2.compute.internal run.ai/simulated-gpu-node-pool=default
+    ```
+1. Deploy the helm chart. You can get the particular version you want by looking at the fake-gpu-operator repository releases page. Make sure you drop the version prefix "v" when running the helm command.
+    ```
+    helm upgrade -i gpu-operator oci://ghcr.io/run-ai/fake-gpu-operator/fake-gpu-operator --namespace gpu-operator --create-namespace --version 0.0.64
+    ```
+1. You should see the following helm output.
+    ```
+    Release "gpu-operator" does not exist. Installing it now.
+    Pulled: ghcr.io/run-ai/fake-gpu-operator/fake-gpu-operator:0.0.64
+    Digest: sha256:f3a96f26ebc3bd77a2c50c4f792c692064826b99906aead51720413e6936e08b
+    W1208 21:48:28.101177 3339253 warnings.go:70] would violate PodSecurity "restricted:latest": privileged (container "nvidia-device-plugin-ctr" must not set securityContext.privileged=true), allowPrivilegeEscalation != false (container "nvidia-device-plugin-ctr" must set securityContext.allowPrivilegeEscalation=false), unrestricted capabilities (container "nvidia-device-plugin-ctr" must set securityContext.capabilities.drop=["ALL"]), restricted volume types (volumes "device-plugin", "runai-bin-directory", "runai-shared-directory" use restricted volume type "hostPath"), runAsNonRoot != true (pod or container "nvidia-device-plugin-ctr" must set securityContext.runAsNonRoot=true), seccompProfile (pod or container "nvidia-device-plugin-ctr" must set securityContext.seccompProfile.type to "RuntimeDefault" or "Localhost")
+    W1208 21:48:28.136227 3339253 warnings.go:70] would violate PodSecurity "restricted:latest": privileged (container "hostpath-init" must not set securityContext.privileged=true), allowPrivilegeEscalation != false (containers "hostpath-init", "nvidia-dcgm-exporter" must set securityContext.allowPrivilegeEscalation=false), unrestricted capabilities (containers "hostpath-init", "nvidia-dcgm-exporter" must set securityContext.capabilities.drop=["ALL"]), restricted volume types (volume "runai-data" uses restricted volume type "hostPath"), runAsNonRoot != true (pod or containers "hostpath-init", "nvidia-dcgm-exporter" must set securityContext.runAsNonRoot=true), seccompProfile (pod or container "nvidia-dcgm-exporter" must set securityContext.seccompProfile.type to "RuntimeDefault" or "Localhost")
+    W1208 21:48:28.308365 3339253 warnings.go:70] would violate PodSecurity "restricted:latest": allowPrivilegeEscalation != false (container "nvidia-dcgm-exporter" must set securityContext.allowPrivilegeEscalation=false), unrestricted capabilities (container "nvidia-dcgm-exporter" must set securityContext.capabilities.drop=["ALL"]), restricted volume types (volume "runai-data" uses restricted volume type "hostPath"), runAsNonRoot != true (pod or container "nvidia-dcgm-exporter" must set securityContext.runAsNonRoot=true), seccompProfile (pod or container "nvidia-dcgm-exporter" must set securityContext.seccompProfile.type to "RuntimeDefault" or "Localhost")
+
+    NAME: gpu-operator
+    LAST DEPLOYED: Mon Dec  8 21:48:20 2025
+    NAMESPACE: gpu-operator
+    STATUS: deployed
+    REVISION: 1
+    TEST SUITE: None
+    ```
+5. However, due to Openshift security controls expect to see quite a number events like the following.
+    ```
+    $ oc get events -n gpu-operator
+    LAST SEEN   TYPE      REASON                            OBJECT                                       MESSAGE
+    6s          Warning   FailedCreate                      daemonset/device-plugin                      Error creating: pods "device-plugin-" is forbidden: unable to validate against any security context constraint: [provider "anyuid": Forbidden: not usable by user or serviceaccount, spec.volumes[0]: Invalid value: "hostPath": hostPath volumes are not allowed to be used, spec.volumes[1]: Invalid value: "hostPath": hostPath volumes are not allowed to be used, spec.volumes[2]: Invalid value: "hostPath": hostPath volumes are not allowed to be used, provider restricted-v2: .containers[0].privileged: Invalid value: true: Privileged containers are not allowed, provider "restricted-v3": Forbidden: not usable by user or serviceaccount, provider "restricted": Forbidden: not usable by user or serviceaccount, provider "nested-container": Forbidden: not usable by user or serviceaccount, provider "nonroot-v2": Forbidden: not usable by user or serviceaccount, provider "nonroot": Forbidden: not usable by user or serviceaccount, provider "hostmount-anyuid": Forbidden: not usable by user or serviceaccount, provider "hostmount-anyuid-v2": Forbidden: not usable by user or serviceaccount, provider "machine-api-termination-handler": Forbidden: not usable by user or serviceaccount, provider "hostnetwork-v2": Forbidden: not usable by user or serviceaccount, provider "hostnetwork": Forbidden: not usable by user or serviceaccount, provider "hostaccess": Forbidden: not usable by user or serviceaccount, provider "insights-runtime-extractor-scc": Forbidden: not usable by user or serviceaccount, provider "node-exporter": Forbidden: not usable by user or serviceaccount, provider "privileged": Forbidden: not usable by user or serviceaccount]
+    6s          Warning   FailedCreate                      daemonset/nvidia-dcgm-exporter               Error creating: pods "nvidia-dcgm-exporter-" is forbidden: unable to validate against any security context constraint: [provider "anyuid": Forbidden: not usable by user or serviceaccount, spec.volumes[0]: Invalid value: "hostPath": hostPath volumes are not allowed to be used, provider restricted-v2: .initContainers[0].privileged: Invalid value: true: Privileged containers are not allowed, provider "restricted-v3": Forbidden: not usable by user or serviceaccount, provider "restricted": Forbidden: not usable by user or serviceaccount, provider "nested-container": Forbidden: not usable by user or serviceaccount, provider "nonroot-v2": Forbidden: not usable by user or serviceaccount, provider "nonroot": Forbidden: not usable by user or serviceaccount, provider "hostmount-anyuid": Forbidden: not usable by user or serviceaccount, provider "hostmount-anyuid-v2": Forbidden: not usable by user or serviceaccount, provider "machine-api-termination-handler": Forbidden: not usable by user or serviceaccount, provider "hostnetwork-v2": Forbidden: not usable by user or serviceaccount, provider "hostnetwork": Forbidden: not usable by user or serviceaccount, provider "hostaccess": Forbidden: not usable by user or serviceaccount, provider "insights-runtime-extractor-scc": Forbidden: not usable by user or serviceaccount, provider "node-exporter": Forbidden: not usable by user or serviceaccount, provider "privileged": Forbidden: not usable by user or serviceaccount]
+    ...
+    ```
+6. As the admin user, grant the following service accounts with sufficient privileges. 
+    ```
+    oc adm policy add-scc-to-user privileged -z status-exporter -n gpu-operator
+    oc adm policy add-scc-to-user privileged -z nvidia-device-plugin -n gpu-operator
+    oc adm policy add-scc-to-user privileged -z mig-faker -n gpu-operator
+    ```
+7. Restart these deployments if the pod creation and/or scheduling got put into stuck state. If you don't see pods for certain controllers then go ahead and scale up by 1 too.
+    ```
+    $ oc rollout restart deployment/nvidia-dcgm-exporter -n gpu-operator
+    deployment.apps/nvidia-dcgm-exporter restarted
+    ...
+    $ oc scale deployment.apps/nvidia-dcgm-exporter --replicas 1 -n gpu-operator
+    deployment.apps/nvidia-dcgm-exporter scaled
+    ```
+8. Validate the deployments and daemonsets are up.
+    ```
+    $ oc get deploy,ds -n gpu-operator
+    NAME                                     READY   UP-TO-DATE   AVAILABLE   AGE
+    deployment.apps/gpu-operator             1/1     1            1           14m
+    deployment.apps/kwok-gpu-device-plugin   1/1     1            1           14m
+    deployment.apps/nvidia-dcgm-exporter     1/1     1            1           14m
+    deployment.apps/status-updater           1/1     1            1           14m
+    deployment.apps/topology-server          1/1     1            1           14m
+
+    NAME                                  DESIRED   CURRENT   READY   UP-TO-DATE   AVAILABLE   NODE SELECTOR                                    AGE
+    daemonset.apps/device-plugin          2         2         2       2            2           nvidia.com/gpu.deploy.device-plugin=true         14m
+    daemonset.apps/mig-faker              0         0         0       0            0           node-role.kubernetes.io/runai-dynamic-mig=true   14m
+    daemonset.apps/nvidia-dcgm-exporter   2         2         2       2            2           nvidia.com/gpu.deploy.dcgm-exporter=true         14m
+    ```
+9. Save the following content as a yaml file named `gpu-test-pod.yaml`.
+    ```
+    apiVersion: v1
+    kind: Pod
+    metadata:
+      name: gpu-test-pod
+      namespace: gpu-operator
+    spec:
+      containers:
+      - name: gpu-container
+        image: image-registry.openshift-image-registry.svc:5000/openshift/tools:latest
+        resources:
+          limits:
+            nvidia.com/gpu: 1
+          env:
+          - name: NODE_NAME
+            valueFrom:
+              fieldRef:
+                fieldPath: spec.nodeName
+          command:
+          - /bin/sh
+          - -c
+          - |
+            while true; do
+              echo "NODE_NAME=$NODE_NAME"
+              sleep 10
+            done
+    ```
+10. Create the pod that "needs" a gpu to simulate scheduling onto a "gpu" node.
+    ```
+    $ oc apply -f gpu-test-pod.yaml 
+    pod/gpu-test-pod created
+    ```
+11. Confirm that the mock nvidia-smi command got injected into the pods runtime and that we get the default simulated `Tesla-K80` gpu info.
+    ```
+    $ oc exec pod/gpu-test-pod -n gpu-operator -- nvidia-smi
+    Wed Dec 10 03:15:26 2025
+    +------------------------------------------------------------------------------+
+    | NVIDIA-SMI 470.129.06   Driver Version: 470.129.06   CUDA Version: 11.4      |
+    +--------------------------------+----------------------+----------------------+
+    | GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
+    | Fan  Temp  Perf  Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
+    |                                |                      |               MIG M. |
+    +--------------------------------+----------------------+----------------------+
+    |   0  Tesla-K80             Off | 00000001:00:00.0 Off |                  Off |
+    | N/A   33C    P8    11W /  70W  |  11441MiB / 11441MiB |     100%     Default |
+    |                                |                      |                  N/A |
+    +--------------------------------+----------------------+----------------------+
+
+    +------------------------------------------------------------------------------+
+    | Processes:                                                                   |
+    |  GPU   GI   CI        PID   Type   Process name                  GPU Memory  |
+    |        ID   ID                                                   Usage       |
+    +------------------------------------------------------------------------------+
+    |    0   N/A  N/A       17       G   /bin/sh-cwhile true; do                   |
+    |  ..    11441MiB                                                              |
+    +------------------------------------------------------------------------------+
+    ```
+
+
+## Tips
+- Have atleast 2 nodes labelled as for some reason the nvidia-dcgm-exporter pods complain with the following strange messages:
+    ```
+    2025/12/09 05:16:40 Topology update not received within interval, publishing...
+    2025/12/09 05:16:40 Error getting configmap: topology-ip-10-0-13-14.us-east-2.compute.internal
+    ```
+- If for any reason you need to remove the helm release execute the following. Replace with your version.
+    ```
+    $ helm uninstall gpu-operator --namespace gpu-operator
+    release "gpu-operator" uninstalled
+    ```
\ No newline at end of file

From 61500eb814832f9fd6524d430c1111348d34185c Mon Sep 17 00:00:00 2001
From: fharo <fharo@redhat.com>
Date: Wed, 10 Dec 2025 13:37:29 -0700
Subject: [PATCH 2/4] Removing unnecessary steps.

---
 docs/deploying-on-openshift.md | 40 ++++++----------------------------
 1 file changed, 7 insertions(+), 33 deletions(-)

diff --git a/docs/deploying-on-openshift.md b/docs/deploying-on-openshift.md
index 8d8a1c3..d15fea3 100644
--- a/docs/deploying-on-openshift.md
+++ b/docs/deploying-on-openshift.md
@@ -35,49 +35,23 @@ Apart from what is already mentioned on the main README page, below are some fur
     oc label node ip-10-0-13-14.us-east-2.compute.internal run.ai/simulated-gpu-node-pool=default
     oc label node ip-10-0-30-218.us-east-2.compute.internal run.ai/simulated-gpu-node-pool=default
     ```
-1. Deploy the helm chart. You can get the particular version you want by looking at the fake-gpu-operator repository releases page. Make sure you drop the version prefix "v" when running the helm command.
+1. Deploy the helm chart. You can get the particular version you want by looking at the fake-gpu-operator repository releases page. Make sure you drop the version prefix "v" when running the helm command. Overwrite the environment.openshift value from the values file so the approprate security context constraints can be configured.
     ```
-    helm upgrade -i gpu-operator oci://ghcr.io/run-ai/fake-gpu-operator/fake-gpu-operator --namespace gpu-operator --create-namespace --version 0.0.64
+    helm upgrade -i gpu-operator oci://ghcr.io/run-ai/fake-gpu-operator/fake-gpu-operator --namespace gpu-operator --create-namespace --version 0.0.64 --set environment.openshift=true
     ```
 1. You should see the following helm output.
     ```
     Release "gpu-operator" does not exist. Installing it now.
     Pulled: ghcr.io/run-ai/fake-gpu-operator/fake-gpu-operator:0.0.64
     Digest: sha256:f3a96f26ebc3bd77a2c50c4f792c692064826b99906aead51720413e6936e08b
-    W1208 21:48:28.101177 3339253 warnings.go:70] would violate PodSecurity "restricted:latest": privileged (container "nvidia-device-plugin-ctr" must not set securityContext.privileged=true), allowPrivilegeEscalation != false (container "nvidia-device-plugin-ctr" must set securityContext.allowPrivilegeEscalation=false), unrestricted capabilities (container "nvidia-device-plugin-ctr" must set securityContext.capabilities.drop=["ALL"]), restricted volume types (volumes "device-plugin", "runai-bin-directory", "runai-shared-directory" use restricted volume type "hostPath"), runAsNonRoot != true (pod or container "nvidia-device-plugin-ctr" must set securityContext.runAsNonRoot=true), seccompProfile (pod or container "nvidia-device-plugin-ctr" must set securityContext.seccompProfile.type to "RuntimeDefault" or "Localhost")
-    W1208 21:48:28.136227 3339253 warnings.go:70] would violate PodSecurity "restricted:latest": privileged (container "hostpath-init" must not set securityContext.privileged=true), allowPrivilegeEscalation != false (containers "hostpath-init", "nvidia-dcgm-exporter" must set securityContext.allowPrivilegeEscalation=false), unrestricted capabilities (containers "hostpath-init", "nvidia-dcgm-exporter" must set securityContext.capabilities.drop=["ALL"]), restricted volume types (volume "runai-data" uses restricted volume type "hostPath"), runAsNonRoot != true (pod or containers "hostpath-init", "nvidia-dcgm-exporter" must set securityContext.runAsNonRoot=true), seccompProfile (pod or container "nvidia-dcgm-exporter" must set securityContext.seccompProfile.type to "RuntimeDefault" or "Localhost")
-    W1208 21:48:28.308365 3339253 warnings.go:70] would violate PodSecurity "restricted:latest": allowPrivilegeEscalation != false (container "nvidia-dcgm-exporter" must set securityContext.allowPrivilegeEscalation=false), unrestricted capabilities (container "nvidia-dcgm-exporter" must set securityContext.capabilities.drop=["ALL"]), restricted volume types (volume "runai-data" uses restricted volume type "hostPath"), runAsNonRoot != true (pod or container "nvidia-dcgm-exporter" must set securityContext.runAsNonRoot=true), seccompProfile (pod or container "nvidia-dcgm-exporter" must set securityContext.seccompProfile.type to "RuntimeDefault" or "Localhost")
-
     NAME: gpu-operator
-    LAST DEPLOYED: Mon Dec  8 21:48:20 2025
+    LAST DEPLOYED: Wed Dec 10 13:30:57 2025
     NAMESPACE: gpu-operator
     STATUS: deployed
     REVISION: 1
     TEST SUITE: None
     ```
-5. However, due to Openshift security controls expect to see quite a number events like the following.
-    ```
-    $ oc get events -n gpu-operator
-    LAST SEEN   TYPE      REASON                            OBJECT                                       MESSAGE
-    6s          Warning   FailedCreate                      daemonset/device-plugin                      Error creating: pods "device-plugin-" is forbidden: unable to validate against any security context constraint: [provider "anyuid": Forbidden: not usable by user or serviceaccount, spec.volumes[0]: Invalid value: "hostPath": hostPath volumes are not allowed to be used, spec.volumes[1]: Invalid value: "hostPath": hostPath volumes are not allowed to be used, spec.volumes[2]: Invalid value: "hostPath": hostPath volumes are not allowed to be used, provider restricted-v2: .containers[0].privileged: Invalid value: true: Privileged containers are not allowed, provider "restricted-v3": Forbidden: not usable by user or serviceaccount, provider "restricted": Forbidden: not usable by user or serviceaccount, provider "nested-container": Forbidden: not usable by user or serviceaccount, provider "nonroot-v2": Forbidden: not usable by user or serviceaccount, provider "nonroot": Forbidden: not usable by user or serviceaccount, provider "hostmount-anyuid": Forbidden: not usable by user or serviceaccount, provider "hostmount-anyuid-v2": Forbidden: not usable by user or serviceaccount, provider "machine-api-termination-handler": Forbidden: not usable by user or serviceaccount, provider "hostnetwork-v2": Forbidden: not usable by user or serviceaccount, provider "hostnetwork": Forbidden: not usable by user or serviceaccount, provider "hostaccess": Forbidden: not usable by user or serviceaccount, provider "insights-runtime-extractor-scc": Forbidden: not usable by user or serviceaccount, provider "node-exporter": Forbidden: not usable by user or serviceaccount, provider "privileged": Forbidden: not usable by user or serviceaccount]
-    6s          Warning   FailedCreate                      daemonset/nvidia-dcgm-exporter               Error creating: pods "nvidia-dcgm-exporter-" is forbidden: unable to validate against any security context constraint: [provider "anyuid": Forbidden: not usable by user or serviceaccount, spec.volumes[0]: Invalid value: "hostPath": hostPath volumes are not allowed to be used, provider restricted-v2: .initContainers[0].privileged: Invalid value: true: Privileged containers are not allowed, provider "restricted-v3": Forbidden: not usable by user or serviceaccount, provider "restricted": Forbidden: not usable by user or serviceaccount, provider "nested-container": Forbidden: not usable by user or serviceaccount, provider "nonroot-v2": Forbidden: not usable by user or serviceaccount, provider "nonroot": Forbidden: not usable by user or serviceaccount, provider "hostmount-anyuid": Forbidden: not usable by user or serviceaccount, provider "hostmount-anyuid-v2": Forbidden: not usable by user or serviceaccount, provider "machine-api-termination-handler": Forbidden: not usable by user or serviceaccount, provider "hostnetwork-v2": Forbidden: not usable by user or serviceaccount, provider "hostnetwork": Forbidden: not usable by user or serviceaccount, provider "hostaccess": Forbidden: not usable by user or serviceaccount, provider "insights-runtime-extractor-scc": Forbidden: not usable by user or serviceaccount, provider "node-exporter": Forbidden: not usable by user or serviceaccount, provider "privileged": Forbidden: not usable by user or serviceaccount]
-    ...
-    ```
-6. As the admin user, grant the following service accounts with sufficient privileges. 
-    ```
-    oc adm policy add-scc-to-user privileged -z status-exporter -n gpu-operator
-    oc adm policy add-scc-to-user privileged -z nvidia-device-plugin -n gpu-operator
-    oc adm policy add-scc-to-user privileged -z mig-faker -n gpu-operator
-    ```
-7. Restart these deployments if the pod creation and/or scheduling got put into stuck state. If you don't see pods for certain controllers then go ahead and scale up by 1 too.
-    ```
-    $ oc rollout restart deployment/nvidia-dcgm-exporter -n gpu-operator
-    deployment.apps/nvidia-dcgm-exporter restarted
-    ...
-    $ oc scale deployment.apps/nvidia-dcgm-exporter --replicas 1 -n gpu-operator
-    deployment.apps/nvidia-dcgm-exporter scaled
-    ```
-8. Validate the deployments and daemonsets are up.
+1. Validate the deployments and daemonsets are up.
     ```
     $ oc get deploy,ds -n gpu-operator
     NAME                                     READY   UP-TO-DATE   AVAILABLE   AGE
@@ -92,7 +66,7 @@ Apart from what is already mentioned on the main README page, below are some fur
     daemonset.apps/mig-faker              0         0         0       0            0           node-role.kubernetes.io/runai-dynamic-mig=true   14m
     daemonset.apps/nvidia-dcgm-exporter   2         2         2       2            2           nvidia.com/gpu.deploy.dcgm-exporter=true         14m
     ```
-9. Save the following content as a yaml file named `gpu-test-pod.yaml`.
+1. Save the following content as a yaml file named `gpu-test-pod.yaml`.
     ```
     apiVersion: v1
     kind: Pod
@@ -120,12 +94,12 @@ Apart from what is already mentioned on the main README page, below are some fur
               sleep 10
             done
     ```
-10. Create the pod that "needs" a gpu to simulate scheduling onto a "gpu" node.
+1. Create the pod that "needs" a gpu to simulate scheduling onto a "gpu" node.
     ```
     $ oc apply -f gpu-test-pod.yaml 
     pod/gpu-test-pod created
     ```
-11. Confirm that the mock nvidia-smi command got injected into the pods runtime and that we get the default simulated `Tesla-K80` gpu info.
+1. Confirm that the mock nvidia-smi command got injected into the pods runtime and that we get the default simulated `Tesla-K80` gpu info.
     ```
     $ oc exec pod/gpu-test-pod -n gpu-operator -- nvidia-smi
     Wed Dec 10 03:15:26 2025

From 40ba54711c864293df49fb24812a67c5b8db731c Mon Sep 17 00:00:00 2001
From: fharo <fharo@redhat.com>
Date: Wed, 10 Dec 2025 13:43:03 -0700
Subject: [PATCH 3/4] Clarifying the docs and putting helpful doc links to
 alternative tooling.

---
 docs/deploying-on-openshift.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/deploying-on-openshift.md b/docs/deploying-on-openshift.md
index d15fea3..a388710 100644
--- a/docs/deploying-on-openshift.md
+++ b/docs/deploying-on-openshift.md
@@ -2,7 +2,7 @@
 
 This document will guide you through deploying fake-gpu-operator on an OpenShift cluster with some basic validation. Unlike most Kubernetes based clusters, Openshift has tighter security controls on what pods or workloads can do. Since there is this delta, we think it would be helpful to have this document here for others looking to deploy it on Openshift.
 
-It must be noted that this operator has limitations on what can be simulated. Do NOT expect to be able to plug in and have it play nice for say something like vllm or llm-d gpu required workloads/requests. As of this document, the simulated GPUs do not support simulated inferencing nor simulating training.
+It must be noted that this operator has limitations on what can be simulated. Do NOT expect to be able to plug in and have it play nice for say something like vllm or llm-d gpu required workloads/requests. As of this document, the simulated GPUs do not support simulated inferencing. If you need that functionality, consider looking at [llm-d/llm-d simulated-accelerators](https://github.com/llm-d/llm-d/tree/main/guides/simulated-accelerators) and/or [llm-d/llm-d-inference-sim](https://github.com/llm-d/llm-d-inference-sim).
 
 Note: Comparable Openshift infrastructure and/or older OCP versions may work but have not been tested.
 Note: The oc cli is used for this guide but kubectl should work as well.

From 875a88d459c36bd8c71731af600294cc0b827674 Mon Sep 17 00:00:00 2001
From: fharo <fharo@redhat.com>
Date: Wed, 10 Dec 2025 13:46:01 -0700
Subject: [PATCH 4/4] Removing untrue comment.

---
 docs/deploying-on-openshift.md | 1 -
 1 file changed, 1 deletion(-)

diff --git a/docs/deploying-on-openshift.md b/docs/deploying-on-openshift.md
index a388710..cb8e9cd 100644
--- a/docs/deploying-on-openshift.md
+++ b/docs/deploying-on-openshift.md
@@ -6,7 +6,6 @@ It must be noted that this operator has limitations on what can be simulated. Do
 
 Note: Comparable Openshift infrastructure and/or older OCP versions may work but have not been tested.
 Note: The oc cli is used for this guide but kubectl should work as well.
-Note: The fake-gpu-operator won't show up under installed operators on the web console but don't interpret this as it not being installed. 
 
 ## Prerequisites