Support #17271

document that compute nodes need to use cgroupsv1 / cgroupsfs for Docker

Added by Peter Amstutz 9 months ago. Updated 6 months ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
Documentation
Target version:
Start date:
01/26/2021
Due date:
% Done:

100%

Estimated time:
(Total: 0.00 h)
Story points:
-
Release relationship:
Auto

Description

If you have the following versions

linux kernel > 4.x
systemd > 247 (en debian family)
docker > 20.10

Is very likely that you have cgroupsv2 enable by default. You can check by doing

if [ -e /sys/fs/cgroup/cgroup.controllers ] 
then 
  echo "cgroupsV2 enabled" 
fi

Unfortunately Arvados 2.1 doesn't have support for cgrupsV2 so compute nodes need to have this disabled
this can be done by adding `systemd.unified_cgroup_hierarchy=0` to the kernel parameters.

Debian / Ubuntu:

in /etc/default/grub make sure that you have
GRUB_CMDLINE_LINUX="systemd.unified_cgroup_hierarchy=0"

update-grub

Fedora 31

dnf install -y grubby
grubby --update-kernel=ALL --args=”systemd.unified_cgroup_hierarchy=0" 


Subtasks

Task #17280: Review 17271-doc-cgroupv2ResolvedNico César

Associated revisions

Revision fc353107
Added by Nico Cesar 9 months ago

Merge branch '17271-doc-cgroupv2'

closes #17271

Arvados-DCO-1.1-Signed-off-by: Nico Cesar <>

History

#1 Updated by Peter Amstutz 9 months ago

  • Category set to Documentation

#2 Updated by Peter Amstutz 9 months ago

  • Assigned To set to Nico César

#3 Updated by Nico César 9 months ago

  • Description updated (diff)

#5 Updated by Nico César 9 months ago

  • Status changed from New to In Progress

#6 Updated by Lucas Di Pentima 9 months ago

I think it would be useful to explain why cgroups v2 accounting should be turned off, and if the fact of having it enabled makes the system completely fail or just some specific feature to not work properly.
I'm thinking that a preexisting Slurm Cluster admin who is looking for installing Arvados in it may wonder if it's a hard requirement or if they could live with the limitations related to keeping cgroups v2 turned on in the case that it's required by other cluster services.

Other than that, it LGTM.

#7 Updated by Nico César 9 months ago

addressing #note-6 4091da044 17271-doc-cgroupv2

#8 Updated by Anonymous 9 months ago

  • % Done changed from 0 to 100
  • Status changed from In Progress to Resolved

#9 Updated by Peter Amstutz 6 months ago

  • Release set to 38

Also available in: Atom PDF