e2e_inference: Add rhods accelerator profile for Intel flex 140 GPU

Signed-off-by: vbedida79 <[email protected]>
intel · Dec 1, 2023 · c7dd264 · c7dd264
1 parent f8bc139
commit c7dd264
Show file tree

Hide file tree

Showing 4 changed files with 26 additions and 20 deletions.
diff --git a/docs/images/accelerator-profile-dropdown.png b/docs/images/accelerator-profile-dropdown.png
diff --git a/docs/images/openvino-accelerator-field.png b/docs/images/openvino-accelerator-field.png
diff --git a/e2e/inference/README.md b/e2e/inference/README.md
@@ -29,32 +29,25 @@ Follow this [link](https://github.com/openvinotoolkit/operator/blob/v1.1.0/docs/
 ## Work with interactive mode
 To enable the interactive mode, the OpenVINO notebook CR needs to be created and integrated with RHODS.  
 1.	Click on the ```create Notebook``` option in this [link](https://github.com/red-hat-data-services/odh-deployer) from the web console and follow these [steps](https://github.com/openvinotoolkit/operator/blob/main/docs/notebook_in_rhods.md) to create the notebook CR.
-2.	Enable Intel Data Center GPU 
+2.	Enable Intel Data Center GPU - **Technical Preview feature**
 
-**Note:** The  Intel Data Center GPU option is not visible in the RHODS UI at this time of release. For more details, please refer to this [issue](https://github.com/opendatahub-io/odh-dashboard/issues/956). Until this issue is resolved, please follow the steps below to enable the Intel Data Center GPU.
+Create AcceleratoProfile in the ```redhat-ods-applications``` namespace 
 
-a.	Search for the OpenVINO Notebook Server from web console ```Search -> Notebook -> Jupyter-nb-<ocp-user>``` in the namespace ```rhods-notebooks```.
+```$ oc apply -f https://raw.githubusercontent.com/intel/intel-technology-enabling-for-openshift/main/e2e/inference/accelerator_profile.yaml``` 
 
-b.	Navigate to notebook yaml and modify the yaml file according to the example shown below.
+3. Navigate to ```openvino-notebooks``` ImageStream and add the above created ```AcceleratorProfile``` key to the annotation field:
 
- ```   
-    containers:
-       name: jupyter-nb-<ocp-user-name>
-       resources:
-         limits:
-           cpu: '14'
-           gpu.intel.com/i915: '1'
-           memory: 56Gi
-         requests: 
-           cpu: '7'
-           gpu.intel.com/i915: '1'
-           memory: 56Gi
-  ```  
+```opendatahub.io/recommended-accelerators: '["gpu.intel.com/flex-140"]'```
+Below image shows this update.
 
-c.	This operation respawns the notebook server to use the Intel Data Center GPU.
+![Alt text](/docs/images/openvino-accelerator-field.png)
+
+4. If step 2 is successful, ```Intel® Data Center GPU Flex Series 140 accelerator``` is shown in the accelerator dropdown menu in ```rhods-dashboard```. Users can run OpenVINO notebook image with Intel® Data Center GPU Flex Series 140 card. Below image shows the dropdown menu. 
+
+![Alt text](/docs/images/accelerator-profile-dropdown.png) 
+
+5. For more details on RHODS and Jupyter notebook, follow the [link](https://github.com/openvinotoolkit/operator/blob/main/docs/notebook_in_rhods.md). For the details on specific notebooks please check this [link](https://github.com/openvinotoolkit/openvino_notebooks).
 
-3.	Run sample Jupyter Notebooks.
-Follow the [link](https://github.com/openvinotoolkit/operator/blob/main/docs/notebook_in_rhods.md) to execute the sample Jupyter Notebook There are 60+ sample notebooks 	available with this notebook image. For the details on the notebooks with Intel Data Center 	GPU, please check this [link](https://github.com/openvinotoolkit/openvino_notebooks).
 ## Work with deployment mode
 1.	From the web console, click on the ModelServer option in this [link](https://github.com/openvinotoolkit/operator/blob/v1.1.0/docs/operator_installation.md) and follow the [steps](https://github.com/openvinotoolkit/operator/blob/v1.1.0/docs/modelserver.md) to start the OVMS instance.  
 2.	To enable the Intel Data Center GPU, make sure to modify the OVMS instance options according to the screenshot below.

diff --git a/e2e/inference/accelerator_profile.yaml b/e2e/inference/accelerator_profile.yaml
@@ -0,0 +1,13 @@
+apiVersion: dashboard.opendatahub.io/v1
+kind: AcceleratorProfile
+metadata:
+  name: intel-gpu-flex-series-140
+spec:
+  displayName: Intel® Data Center GPU Flex Series 140
+  description: Intel Data Center GPU for inference 
+  enabled: true
+  identifier: gpu.intel.com/i915
+  tolerations:
+  - effect: NoSchedule
+    key: gpu.intel.com/flex-140
+    operator: Exists