Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement API for scanning GPU allocation map #3327

Open
jopemachine opened this issue Dec 30, 2024 — with Lablup-Issue-Syncer · 0 comments · May be fixed by #2273
Open

Implement API for scanning GPU allocation map #3327

jopemachine opened this issue Dec 30, 2024 — with Lablup-Issue-Syncer · 0 comments · May be fixed by #2273
Assignees
Labels
comp:agent Related to Agent component

Comments

@jopemachine
Copy link
Member

Using the fGPU feature can lead to GPU device fragmentation.

While such fragmentation is inevitable, administrators must at least be able to see how GPU devices are actually allocated among agents. Currently, some of our customers are frustrated because they encounter resource shortage errors during session creation, even though sufficient fGPU resources are actually available.

Therefore, it is necessary to implement an API that allows administrators to monitor GPU fragmentation status.

@jopemachine jopemachine linked a pull request Jan 2, 2025 that will close this issue
3 tasks
@jopemachine jopemachine added the comp:agent Related to Agent component label Jan 2, 2025
@jopemachine jopemachine self-assigned this Jan 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
comp:agent Related to Agent component
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant