Project

General

Profile

Actions

Bug #4967

closed

[Crunch] Doesn't cope well with FUSE mounts left hanging around after killing tasks with SIGKILL

Added by Bryan Cosca almost 10 years ago. Updated almost 10 years ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
Crunch
Target version:
Story points:
0.5

Description

Taken from qr1hi-8i9sb-ve5v94njtcw66yw. I re-ran the job and it seemed to work fine, I just wanted to bring it to attention.

1/12/2015 2:16:09 PM compute16 1 task-print 0 fuse: failed to open mountpoint for reading: Transport endpoint is not connected
1/12/2015 2:16:09 PM compute16 1 task-dispatch 0 srun: error: compute16: task 0: Exited with exit code 1
1/12/2015 2:16:09 PM compute16 1 task-print 0 Traceback (most recent call last):
1/12/2015 2:16:09 PM compute16 1 task-print 0 File "/usr/local/bin/arv-mount", line 149, in <module>
1/12/2015 2:16:09 PM compute16 1 task-print 0 llfuse.init(operations, args.mountpoint, opts)
1/12/2015 2:16:09 PM compute16 1 task-print 0 File "fuse_api.pxi", line 153, in llfuse.init (src/llfuse.c:17409)
1/12/2015 2:16:09 PM compute16 1 task-print 0 RuntimeError: fuse_mount failed
1/12/2015 2:16:09 PM compute16 1 task-dispatch 0 child 22088 on compute16.1 exit 1 success=
1/12/2015 2:16:09 PM compute16 1 task-dispatch 0 failure (#1, permanent) after 1 seconds


Subtasks 1 (0 open1 closed)

Task #5039: Review 4967-crunch-mount-cleanup-wipResolvedWard Vandewege01/21/2015Actions

Related issues

Related to Arvados - Feature #5036: [arv-mount] Change default mount type from "fuse" to "fuse.arvados"Closed01/20/2015Actions
Has duplicate Arvados - Bug #4970: [Crunch] Cannot create directory `/tmp/crunch-job/task/compute14.1.keep': File existsResolved01/12/2015Actions
Has duplicate Arvados - Bug #5046: Jobs failing to start. Logs show "rm:cannot remove `/tmp/crunch-job/task/compute19.1.keep': Is a directory"Closed01/21/2015Actions
Actions

Also available in: Atom PDF