Project

General

Profile

Actions

Bug #10729

open

[Crunch2] Propagate error messages if sbatch command succeeds but crunch-run can't run (or can't log to the Arvados API)

Added by Tom Clegg over 7 years ago. Updated about 2 months ago.

Status:
New
Priority:
Normal
Assigned To:
-
Category:
Crunch
Target version:
Story points:
-
Release:
Release relationship:
Auto

Description

Currently the only way to see what happened in these cases is by looking at slurm-XXXXX.out on the compute nodes, where XXXXX is the slurm job number. This is inconvenient or impossible for sysadmins, and impossible for regular users.

Example scenario: crunch-run is not installed on the compute node.


Related issues

Related to Arvados - Bug #10700: [Crunch2] crunch-dispatch-slurm pileupResolvedTom Clegg01/27/2017Actions
Related to Arvados - Bug #11148: [Crunch2] Propagate dispatch error messages (e.g., sbatch fails) to user via logs/websocketNewActions
Actions

Also available in: Atom PDF