Project

General

Profile

Actions

Feature #8018

closed

[Crunch2] Identify container failure and retry

Added by Peter Amstutz over 8 years ago. Updated over 7 years ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
-
Target version:
Story points:
1.0
Release:
Release relationship:
Auto

Description

When a container goes into cancelled state, create a new duplicate container when these conditions are met by at least one container request:

  • request is in "committed" state (not uncommitted or finalized)
  • request has priority > 0
  • request container_count < container_count_max

Add a new column, container_count. Each time a new container is assigned to a committed container request, increment container_count.


Subtasks 3 (0 open3 closed)

Task #10062: Review 8018-container-retryResolvedRadhika Chippada10/05/2016Actions
Task #10064: Retry on container cancelledResolvedPeter Amstutz09/28/2016Actions
Task #10063: Determine how to count retry attemptsResolvedPeter Amstutz09/23/2016Actions

Related issues

Related to Arvados - Feature #14706: [Crunch2] Retain references + permissions to earlier containers when retrying a container requestResolvedActions
Actions

Also available in: Atom PDF