Project

General

Profile

Actions

Bug #11148

open

[Crunch2] Propagate dispatch error messages (e.g., sbatch fails) to user via logs/websocket

Added by Tom Clegg about 7 years ago. Updated about 2 months ago.

Status:
New
Priority:
Normal
Assigned To:
-
Category:
Crunch
Target version:
Story points:
-
Release:
Release relationship:
Auto

Description

Background

If crunch-dispatch-slurm encounters an error while trying to submit a container to the slurm queue, it logs a message to stderr, but there is no way for a user to see this. If the error persists, the user has no way of knowing whether anything is happening.

Fix

In addition to printing a message on stderr, crunch-dispatch-slurm should use the Arvados API to emit a log that is visible to the user.
  • object_uuid: container uuid
  • event_type: "dispatch"

Similar to source:services/crunch-run/logging.go#L183

A log should also be emitted when the container is successfully submitted to the slurm queue.


Related issues

Related to Arvados - Bug #10704: [Crunch2] sbatch submit failures not reported to use, loop foreverResolvedTom Clegg01/31/2017Actions
Related to Arvados - Bug #10729: [Crunch2] Propagate error messages if sbatch command succeeds but crunch-run can't run (or can't log to the Arvados API)NewActions
Actions #1

Updated by Tom Clegg about 7 years ago

  • Description updated (diff)
Actions #2

Updated by Tom Clegg about 7 years ago

  • Description updated (diff)
Actions #3

Updated by Tom Morris over 6 years ago

  • Target version set to Arvados Future Sprints
Actions #4

Updated by Ward Vandewege almost 3 years ago

  • Target version deleted (Arvados Future Sprints)
Actions #5

Updated by Peter Amstutz about 1 year ago

  • Release set to 60
Actions #6

Updated by Peter Amstutz about 2 months ago

  • Target version set to Future
Actions

Also available in: Atom PDF