Project

General

Profile

Actions

Bug #8805

closed

[Crunch] os.walk doing recursive copy early in a Crunch script causes a silent exit 1

Added by Sarah Guthrie about 8 years ago. Updated over 4 years ago.

Status:
Closed
Priority:
Normal
Assigned To:
-
Category:
-
Target version:
-
Story points:
-

Description

Two jobs, identical in inputs, both on qr1hi, in quick succession (on the same node) failed at different points in the process of running a crunch script.

They both go through a directory, using os.walk, copying everything to a temporary directory.

I used excessive logging to figure out that both jobs were failing at different points, despite having the same inputs, crunch_script version, docker image, and compute node.

This behavior was not observed when using subprocess.check_call(['cp', '-r'])


Files

8805walk.py (1.27 KB) 8805walk.py Brett Smith, 03/28/2016 06:28 PM

Related issues

Related to Arvados - Feature #8801: [Crunch] log free disk space before task startsDuplicateActions
Actions

Also available in: Atom PDF