@lissyx Thank you for printing out the crux of the issue. We are in the process of rearchitecting the nvidia container stack in such a way that issues such as this should not exist in the future (because we will rely on runc (or whatever the configured container runtime is) to do all cgroup setup instead of doing it ourselves).. That said, this rearchitecting effort will take at least …
14.10.2020 · Recently, I test containerd+nvidia-container-runtime in kernal 5.4 and cgroupv2, but I find nvidia-container-cli can not run successfully because of …
NVIDIA Container Toolkit Introduction The NVIDIA Container Toolkit allows users ... driver and Docker 19.03 for your Linux distribution Note that you do not ...
20.12.2021 · So i use method2 from THIS POST, which is bypass cgroups option. When using nvidia-container-runtime or nvidia-container-toolkit with cgroup option, it automatically allocate machine resource for the container. So when bypass this option, you gotta allocate resource by your own. Here's an example A single docker run
12.12.2021 · Did something that works for now: I clean built every package (libnvidia, container runtime, container toolkit, docker). Then changed the kernel paramaters.Since hijacking the proc/cmdline didn't exactly work, I looked for the alternate ways specified in the kernel parameters wiki.. Since I use refind, I added the systemd param using the refind menu.
Dec 06, 2021 · Did something that works for now: I clean built every package (libnvidia, container runtime, container toolkit, docker). Then changed the kernel paramaters.Since hijacking the proc/cmdline didn't exactly work, I looked for the alternate ways specified in the kernel parameters wiki.