Project

General

Profile

Actions

Bug #22612

closed

CUDA install doesn't really work because headers aren't available

Added by Brett Smith about 2 months ago. Updated about 1 month ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
Deployment
Target version:
Story points:
-
Release relationship:
Auto

Description

The Ansible playbook to install CUDA "succeeds" but doesn't really work because this happens:

Setting up nvidia-kernel-open-dkms (560.35.05-1) ...
Loading new nvidia-current-560.35.05 DKMS files...
Building for 5.10.0-33-cloud-amd64
Module build for kernel 5.10.0-33-cloud-amd64 was skipped since the
kernel headers for this kernel does not seem to be installed.

We need to install the headers for the right kernel version. The ROCm playbook already has a recipe for this. But, I'm realizing that recipe can be buggy if the dist-upgrade early in the playbook upgrades the kernel, so, this is going to become a whole thing.


Subtasks 1 (0 open1 closed)

Task #22619: Review 22612-driver-bugfixesResolvedBrett Smith02/27/2025Actions

Related issues 1 (0 open1 closed)

Related to Arvados - Support #22562: Test running CUDA tordo with updated pinsResolvedBrett SmithActions
Actions

Also available in: Atom PDF