Skip to content

Commit

Permalink
e2e_inference: Add rhods accelerator profile for Intel flex 140 GPU
Browse files Browse the repository at this point in the history
Signed-off-by: vbedida79 <[email protected]>
  • Loading branch information
vbedida79 committed Dec 1, 2023
1 parent f8bc139 commit c7dd264
Show file tree
Hide file tree
Showing 4 changed files with 26 additions and 20 deletions.
Binary file added docs/images/accelerator-profile-dropdown.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/images/openvino-accelerator-field.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
33 changes: 13 additions & 20 deletions e2e/inference/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,32 +29,25 @@ Follow this [link](https://github.com/openvinotoolkit/operator/blob/v1.1.0/docs/
## Work with interactive mode
To enable the interactive mode, the OpenVINO notebook CR needs to be created and integrated with RHODS.
1. Click on the ```create Notebook``` option in this [link](https://github.com/red-hat-data-services/odh-deployer) from the web console and follow these [steps](https://github.com/openvinotoolkit/operator/blob/main/docs/notebook_in_rhods.md) to create the notebook CR.
2. Enable Intel Data Center GPU
2. Enable Intel Data Center GPU - **Technical Preview feature**

**Note:** The Intel Data Center GPU option is not visible in the RHODS UI at this time of release. For more details, please refer to this [issue](https://github.com/opendatahub-io/odh-dashboard/issues/956). Until this issue is resolved, please follow the steps below to enable the Intel Data Center GPU.
Create AcceleratoProfile in the ```redhat-ods-applications``` namespace

a. Search for the OpenVINO Notebook Server from web console ```Search -> Notebook -> Jupyter-nb-<ocp-user>``` in the namespace ```rhods-notebooks```.
```$ oc apply -f https://raw.githubusercontent.com/intel/intel-technology-enabling-for-openshift/main/e2e/inference/accelerator_profile.yaml```

b. Navigate to notebook yaml and modify the yaml file according to the example shown below.
3. Navigate to ```openvino-notebooks``` ImageStream and add the above created ```AcceleratorProfile``` key to the annotation field:

```
containers:
name: jupyter-nb-<ocp-user-name>
resources:
limits:
cpu: '14'
gpu.intel.com/i915: '1'
memory: 56Gi
requests:
cpu: '7'
gpu.intel.com/i915: '1'
memory: 56Gi
```
```opendatahub.io/recommended-accelerators: '["gpu.intel.com/flex-140"]'```
Below image shows this update.

c. This operation respawns the notebook server to use the Intel Data Center GPU.
![Alt text](/docs/images/openvino-accelerator-field.png)

4. If step 2 is successful, ```Intel® Data Center GPU Flex Series 140 accelerator``` is shown in the accelerator dropdown menu in ```rhods-dashboard```. Users can run OpenVINO notebook image with Intel® Data Center GPU Flex Series 140 card. Below image shows the dropdown menu.

![Alt text](/docs/images/accelerator-profile-dropdown.png)

5. For more details on RHODS and Jupyter notebook, follow the [link](https://github.com/openvinotoolkit/operator/blob/main/docs/notebook_in_rhods.md). For the details on specific notebooks please check this [link](https://github.com/openvinotoolkit/openvino_notebooks).

3. Run sample Jupyter Notebooks.
Follow the [link](https://github.com/openvinotoolkit/operator/blob/main/docs/notebook_in_rhods.md) to execute the sample Jupyter Notebook There are 60+ sample notebooks available with this notebook image. For the details on the notebooks with Intel Data Center GPU, please check this [link](https://github.com/openvinotoolkit/openvino_notebooks).
## Work with deployment mode
1. From the web console, click on the ModelServer option in this [link](https://github.com/openvinotoolkit/operator/blob/v1.1.0/docs/operator_installation.md) and follow the [steps](https://github.com/openvinotoolkit/operator/blob/v1.1.0/docs/modelserver.md) to start the OVMS instance.
2. To enable the Intel Data Center GPU, make sure to modify the OVMS instance options according to the screenshot below.
Expand Down
13 changes: 13 additions & 0 deletions e2e/inference/accelerator_profile.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
apiVersion: dashboard.opendatahub.io/v1
kind: AcceleratorProfile
metadata:
name: intel-gpu-flex-series-140
spec:
displayName: Intel® Data Center GPU Flex Series 140
description: Intel Data Center GPU for inference
enabled: true
identifier: gpu.intel.com/i915
tolerations:
- effect: NoSchedule
key: gpu.intel.com/flex-140
operator: Exists

0 comments on commit c7dd264

Please sign in to comment.