Project

General

Profile

Actions

Feature #22563

closed

compute node ansible playbook to install ROCm

Added by Peter Amstutz about 1 month ago. Updated 19 days ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
Deployment
Target version:
Story points:
-
Release relationship:
Auto

Description

https://rocm.docs.amd.com/projects/install-on-linux/en/latest/install/install-methods/package-manager-index.html

This is pretty straightforward: get the package signing key, set up two 3rd party debian repos (one for the driver, one for the ROCm tools), then install "amdgpu-dkms" and "rocm" packages.

Apparently each ROCm version gets its own package, so "rocm" is actually just a metapackage pointing to the latest, which is called "rocmX.Y.Z" (e.g. "rocm6.3.2").

(The stated reason is to support installing multiple versions of ROCm for testing).


Files

amdrocm.log (535 KB) amdrocm.log Brett Smith, 02/17/2025 06:37 PM

Subtasks 2 (0 open2 closed)

Task #22572: Review 22563-ansible-rocmResolvedPeter Amstutz02/17/2025Actions
Task #22601: Review 22563-rocm-disk-sizeResolvedPeter Amstutz02/24/2025Actions

Related issues 2 (0 open2 closed)

Related to Arvados - Support #22562: Test running CUDA tordo with updated pinsResolvedBrett SmithActions
Related to Arvados - Feature #21926: AMD ROCm GPU supportResolvedPeter AmstutzActions
Actions

Also available in: Atom PDF