Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NFD Feature Rules for GPU type aren't deployed #193

Closed
brgavino opened this issue Jan 24, 2024 · 3 comments
Closed

NFD Feature Rules for GPU type aren't deployed #193

brgavino opened this issue Jan 24, 2024 · 3 comments

Comments

@brgavino
Copy link

Per https://github.com/intel/intel-device-plugins-for-kubernetes/blob/main/cmd/gpu_plugin/labels.md ,

Nodes can label the cards that are available using these rules: https://github.com/intel/intel-device-plugins-for-kubernetes/blob/main/deployments/nfd/overlays/node-feature-rules/platform-labeling-rules.yaml

This should probably be installed by default since this is via the OpenShift Operator deployment; otherwise admins have to create the rules themselves.

@brgavino
Copy link
Author

Here's a version that works, without the "extendedResources" tag

platform-labeling-rules.yaml.txt

@uMartinXu
Copy link
Contributor

@brgavino Thanks for your suggestion. :-)

To align with RH OpenShift AI Dashboard Accelerator Profile, the taints/tolerance mechanism will be used by us to support the Heterogenous(different type of) Intel dGPU product.

We are working on a related solution for the user.

And as cluster administrators, they can also pick the other way like the labelling you mentioned and use them with the nodeselect.

@uMartinXu
Copy link
Contributor

duplcated with #216

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants