Project

General

Profile

Actions

Idea #4293

closed

[Node Manager] Write off cloud nodes that spend too long in booted state

Added by Brett Smith about 10 years ago. Updated about 10 years ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
Node Manager
Target version:
Start date:
10/27/2014
Due date:
Story points:
1.5

Description

If the cloud has an internal error starting a node, Node Manager won't shut it down until the normal shutdown window opens. There should be a separate timer for this case: if a cloud node doesn't appear in the node listing within N minutes (configurable), assume it failed to start, and shut it down.


Subtasks 2 (0 open2 closed)

Task #4322: [Node Manager] Should not pair cloud and Arvados nodes immediately after bootingResolvedBrett Smith10/27/2014Actions
Task #4732: Review 4293-node-manager-timed-bootstrap-wipResolvedPeter Amstutz12/05/2014Actions

Related issues 1 (0 open1 closed)

Copied to Arvados - Bug #4751: [Node Manager] Can erroneously pair cloud nodes with stale Arvados node recordsResolvedBrett Smith03/02/2015Actions
Actions

Also available in: Atom PDF