Feature #5510

[DRAFT] [Crunch] User can configure task retries

Added by Peter Amstutz over 4 years ago. Updated over 4 years ago.

Status:
New
Priority:
Normal
Assigned To:
-
Category:
Crunch
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Story points:
-

Description

For some of our long running jobs, we're finding that we get new types of transient failures that are not identified as "temporary" failure even if they would probably work if retried. We should add a job field indicating a "minimum number of retries", which will be honored even for "permanent" failures.

Alternately, an even simpler solution would be to add a flag which causes all failures to be treated as "temporary" for the purposes of retry. Question: should this behavior be default?

History

#1 Updated by Peter Amstutz over 4 years ago

  • Description updated (diff)

#2 Updated by Peter Amstutz over 4 years ago

  • Subject changed from [Crunch] Set minimum number of task retries to [Crunch] User can configure task retries
  • Description updated (diff)

#3 Updated by Brett Smith over 4 years ago

  • Target version set to Arvados Future Sprints
  • Subject changed from [Crunch] User can configure task retries to [DRAFT] [Crunch] User can configure task retries
  • Category set to Crunch

Also available in: Atom PDF