[Feature]: Sharing without restrictions #66

sfxworks · 2024-05-23T12:39:17Z

Suggestion Description

I've got a w6800 that is awesome, but I'm left with scheduling it against either my Photoprism server for encoding acceleration, LocalAI for AI resources, or ffmpeg jobs for one time encoding of raw footage. A GPU as big as this can be shared. Of course, if not managed properly from my end apps can crash, but 32G is a LOT of play room for one container.

I want the ability to assign more than one workload to this GPU. Bonus points if there a way to do memory management but not required at all.

Operating System

Arch Linux

GPU

W6800

ROCm Component

No response

judahrand · 2024-11-03T15:02:17Z

This is also really useful when using the GPU for light tasks like transcoding etc. This is a feature that the Intel GPU Device Plugin has had for a really long time with the sharedDevNum option.

agelwarg · 2024-11-27T18:02:28Z

Is there any other way to do this? I'm using sharedDevNum for an Intel GPU, just like mentioned above by @judahrand , but am looking for a way to do something similar with this AMD device plugin

sfxworks · 2024-11-30T19:02:49Z

Nvidia has GPU sharing documented in their device plugin https://github.com/NVIDIA/k8s-device-plugin?tab=readme-ov-file#shared-access-to-gpus but really wanna stay as close to team red as possible. Currently having to run things on the node as is without containers.

Goorzhel mentioned this issue Aug 13, 2024

k3s: GPU Passthrough for Nvidia / AMD / Intel NixOS/nixpkgs#288037

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Sharing without restrictions #66

[Feature]: Sharing without restrictions #66

sfxworks commented May 23, 2024

judahrand commented Nov 3, 2024

agelwarg commented Nov 27, 2024

sfxworks commented Nov 30, 2024

[Feature]: Sharing without restrictions #66

[Feature]: Sharing without restrictions #66

Comments

sfxworks commented May 23, 2024

Suggestion Description

Operating System

GPU

ROCm Component

judahrand commented Nov 3, 2024

agelwarg commented Nov 27, 2024

sfxworks commented Nov 30, 2024