[pve-devel] applied: [PATCH manager stable-7] pve7to8: add check for nvidia-vgpu-mgr

Thomas Lamprecht t.lamprecht at proxmox.com
Mon Jun 12 17:15:07 CEST 2023


Am 12/06/2023 um 12:00 schrieb Dominik Csapak:
> Currently the nvidia vgpu host driver (15.2) does not support kernels >
> 6.0 and thus will not work with bookworm based releases for now.
> 
> Fail when the service is running, and warn if it only exists, but is
> disabled/stopped (in case a user installed it sometime but did not need
> it and disabled it).
> 
> In any case, link to the known issues section in the upgrade guide
> (which we can update to contain up-to-date information).
> 
> Signed-off-by: Dominik Csapak <d.csapak at proxmox.com>
> ---
> I opted to not parse more specific information about the driver (like
> version, etc.) since it increases the complexity of the check but
> without any real upside currently. If there is some future version that
> supports it, we can update that to only warn/error for not supported
> versions.
> 
> I'll add the section to the upgrade guide shortly
> 
>  PVE/CLI/pve7to8.pm | 22 ++++++++++++++++++++++
>  1 file changed, 22 insertions(+)
> 
>

applied, thanks!

But I made some follow-ups:

- fix typo and factor common message into single variable

- pass the suppress_stderr param from get_systemd_unit_state to avoid an ugly message
  for unaffected systems, i.e. like:
  "Failed to get unit file state for nvidia-vgpu-mgr.service: No such file or directory"

- downgraded the failure again to a warning, reversing my initial recommendation to you,
  mostly due to future proofing for the case where NVIDIA fixes this, as in that case we'd
  need to tell users that they should ignore a failure, which is not good – my bad for not
  thinking of this earlier.





More information about the pve-devel mailing list