Project

General

Profile

Actions

Bug #20616

closed

"cgroup stats files never appeared" on scale cluster

Added by Peter Amstutz 11 months ago. Updated 10 months ago.

Status:
Duplicate
Priority:
Normal
Assigned To:
Category:
Crunch
Story points:
-

Description

We built a new compute AMI with a newer Debian cloud release, and all of a sudden crunchstat stopped working.

eg

https://scale-4zz18-z33wfnfdga5wroy.collections.scale.arvadosapi.com/log%20for%20container%20scale-dz642-c9o553ezb91wbfl/crunchstat.txt

Subtasks 1 (0 open1 closed)

Task #20630: Review (no new code for this -- see main story's comment)ResolvedBrett Smith07/05/2023Actions

Related issues

Related to Arvados - Bug #17244: Make sure cgroupsV2 works with ArvadosResolvedTom Clegg07/18/2023Actions
Actions #1

Updated by Peter Amstutz 11 months ago

  • Description updated (diff)
Actions #2

Updated by Peter Amstutz 11 months ago

  • Description updated (diff)
Actions #3

Updated by Peter Amstutz 11 months ago

  • Related to Bug #17244: Make sure cgroupsV2 works with Arvados added
Actions #4

Updated by Peter Amstutz 11 months ago

  • Subject changed from "cgroup stats files never appeared" due to cgroups v2? to "cgroup stats files never appeared" on scale cluster
Actions #5

Updated by Peter Amstutz 11 months ago

  • Assigned To set to Lucas Di Pentima
Actions #6

Updated by Lucas Di Pentima 11 months ago

  • Status changed from New to In Progress
Actions #7

Updated by Lucas Di Pentima 11 months ago

Status update: compute nodes with the same base AMI on tordo work just fine. No sure why this happens.

Actions #8

Updated by Peter Amstutz 10 months ago

  • Target version changed from Development 2023-06-21 sprint to Development 2023-07-05 sprint
Actions #9

Updated by Peter Amstutz 10 months ago

  • Tracker changed from Feature to Bug
Actions #11

Updated by Peter Amstutz 10 months ago

  • Target version changed from Development 2023-07-05 sprint to Development 2023-07-19 sprint
Actions #12

Updated by Lucas Di Pentima 10 months ago

Re-created the compute node AMI for scale using the fixes from #20707 and did a test run. We now have crunchstat data: scale-4zz18-dq47remmgoco8oh.

There's no new code to review in this ticket, just using the compute image build script from the other one.

Actions #13

Updated by Lucas Di Pentima 10 months ago

  • Status changed from In Progress to Duplicate
Actions

Also available in: Atom PDF