Project

General

Profile

Actions

Feature #5510

closed

[DRAFT] [Crunch] User can configure task retries

Added by Peter Amstutz about 9 years ago. Updated almost 3 years ago.

Status:
Closed
Priority:
Normal
Assigned To:
-
Category:
Crunch
Target version:
-
Story points:
-

Description

For some of our long running jobs, we're finding that we get new types of transient failures that are not identified as "temporary" failure even if they would probably work if retried. We should add a job field indicating a "minimum number of retries", which will be honored even for "permanent" failures.

Alternately, an even simpler solution would be to add a flag which causes all failures to be treated as "temporary" for the purposes of retry. Question: should this behavior be default?

Actions #1

Updated by Peter Amstutz about 9 years ago

  • Description updated (diff)
Actions #2

Updated by Peter Amstutz about 9 years ago

  • Subject changed from [Crunch] Set minimum number of task retries to [Crunch] User can configure task retries
  • Description updated (diff)
Actions #3

Updated by Brett Smith about 9 years ago

  • Target version set to Arvados Future Sprints
  • Subject changed from [Crunch] User can configure task retries to [DRAFT] [Crunch] User can configure task retries
  • Category set to Crunch
Actions #4

Updated by Ward Vandewege almost 3 years ago

  • Target version deleted (Arvados Future Sprints)
Actions #5

Updated by Peter Amstutz almost 3 years ago

  • Status changed from New to Closed
Actions

Also available in: Atom PDF