Feature #5064
closed[Crunch] Automatically restart jobs after internal/intermittent errors
Assigned To:
Target version:
Story points:
After a customer meeting, they asked if a pipeline crashes halfway through based on a node failure or some random internal failure not on their code, will they have to wait until next morning to run the pipeline or will it try to restart form there?
This would save informaticians a lot of time and disappointment because they will try to run their job before they go to sleep and if it fails before the morning, they do not want to wait a whole day again to restart that job, they would want arvados to take care of it themselves.
Updated by Brett Smith about 10 years ago
- Subject changed from Automatic restart on jobs if the error was internal to [Crunch] Automatically restart jobs after internal/intermittent errors
- Category set to Crunch
- Target version set to Arvados Future Sprints
Updated by Peter Amstutz about 5 years ago
- Target version deleted (
Arvados Future Sprints) - Status changed from New to Resolved