Project

General

Profile

Actions

Bug #21302

open

Container created with corrupted mounts

Added by Brett Smith 5 months ago. Updated about 2 months ago.

Status:
New
Priority:
Normal
Assigned To:
-
Category:
API
Target version:
Story points:
-

Description

A user's container request failed to dispatch because the container created for it had corrupted mounts, so Crunch could not dispatch it. There were ~2,250 mounts of individual files. This container was near the end of a workflow, so all its inputs came from a single collection, the output of a previous step.

The mounts were corrupted in different ways:

  • Some had their path changed, so the mount for a given file would be backed by a different file.
  • Some became the empty array [].
  • Some became the literal object {"for": "to_hash"}.

We haven't yet identified a pattern for which mounts got corrupted which way.

We did check the database record and confirmed the mounts were corrupted in the container's mounts column. This suggests that the problem happened during container creation, and isn't an issue with, e.g., serializing the record to JSON.

See child ticket for supporting documentation.


Subtasks


Related issues

Related to Arvados - Bug #21314: a-d-c should cancel a container if it can't be loadedNewTom CleggActions
Actions #1

Updated by Tom Clegg 4 months ago

  • Related to Bug #21314: a-d-c should cancel a container if it can't be loaded added
Actions #2

Updated by Peter Amstutz 3 months ago

  • Tracker changed from Idea to Bug
Actions #3

Updated by Peter Amstutz about 2 months ago

  • Target version changed from To be scheduled to Future
Actions

Also available in: Atom PDF