Project

General

Profile

Actions

Bug #4314

closed

[Crunch] Figure out why this job was marked Failed unexpectedly

Added by Bryan Cosca over 9 years ago. Updated over 9 years ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
-
Target version:
Story points:
0.5

Description

examples: qr1hi-8i9sb-cha7spydhjauvvq qr1hi-8i9sb-w5hjmuq7vicng11 qr1hi-8i9sb-vt7mb676a4htd6k

They seemed to have this error in common: error: Unable to allocate resources: Requested nodes are busy. Ward said that there were two crunch dispatchers running and shutting one down seemed to fix it.

When the jobs end, they usually have a 403 Permission error and cannot output to keep.


Subtasks 3 (0 open3 closed)

Task #4373: Diagnose and fixResolvedPeter Amstutz10/24/2014Actions
Task #4457: Review 4314-crunch-token-expireResolved10/24/2014Actions
Task #4718: Review 4314-trigger-job-updateResolvedRadhika Chippada10/24/2014Actions

Related issues

Related to Arvados - Bug #4310: [Crunch] crunch-dispatch --jobs locking is brokenResolvedPeter Amstutz11/06/2014Actions
Related to Arvados - Bug #4334: [Crunch] crunch-dispatch should not allocate Jobs to nodes in the idle* SLURM stateResolvedPeter Amstutz10/28/2014Actions
Actions

Also available in: Atom PDF