Feature #5510

[DRAFT] [Crunch] User can configure task retries

Added by Peter Amstutz almost 4 years ago. Updated almost 4 years ago.

Status:
New
Priority:
Normal
Assigned To:
-
Category:
Crunch
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Story points:
-

Description

For some of our long running jobs, we're finding that we get new types of transient failures that are not identified as "temporary" failure even if they would probably work if retried. We should add a job field indicating a "minimum number of retries", which will be honored even for "permanent" failures.

Alternately, an even simpler solution would be to add a flag which causes all failures to be treated as "temporary" for the purposes of retry. Question: should this behavior be default?

History

#1 Updated by Peter Amstutz almost 4 years ago

  • Description updated (diff)

#2 Updated by Peter Amstutz almost 4 years ago

  • Subject changed from [Crunch] Set minimum number of task retries to [Crunch] User can configure task retries
  • Description updated (diff)

#3 Updated by Brett Smith almost 4 years ago

  • Subject changed from [Crunch] User can configure task retries to [DRAFT] [Crunch] User can configure task retries
  • Category set to Crunch
  • Target version set to Arvados Future Sprints

Also available in: Atom PDF