Project

General

Profile

Actions

Feature #6518

closed

[Crunch] [Crunch2] Dispatch containers via slurm

Added by Tom Clegg almost 9 years ago. Updated about 8 years ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Radhika Chippada
Category:
Crunch
Target version:
Story points:
1.0
Release:
Release relationship:
Auto

Description

When containers appear in the queue, use SLURM to execute them on worker nodes.

For now, the queue is arvados.v1.containers.queue (much like the Crunch1 job queue).

From Crunch2 dispatch:

slurm batch mode
  • Use "sinfo" to determine whether it is possible to run the container
  • Submit a batch job to the queue: "echo crunch-run --job {uuid} | sbatch -N1"
  • When container priority changes, use scontrol and scancel to propagate changes to slurm
  • Use strigger to run a cleanup script when a container exits

The cleanup script just has to deal with cases like the node dying before crunch-run has a chance to update the container record to state="Complete"


Subtasks 4 (0 open4 closed)

Task #8474: Review 6518-crunch2-dispatch-slurmResolvedPeter Amstutz07/08/2015Actions
Task #8522: Implement crunch-dispatch-slurmResolvedPeter Amstutz07/08/2015Actions
Task #8608: Review tests branch: 6518-crunch2-dispatch-slurm-testsResolvedPeter Amstutz07/08/2015Actions
Task #8607: Add testsResolvedRadhika Chippada07/08/2015Actions

Related issues

Related to Arvados - Idea #6282: [Crunch] Write stories for implementation of Crunch v2ResolvedPeter Amstutz06/23/2015Actions
Related to Arvados - Feature #7816: [Crunch2] Execute minimal container spec with loggingResolvedPeter Amstutz11/17/2015Actions
Related to Arvados - Feature #8128: [Crunch2] API support for crunch-dispatchResolvedTom Clegg04/28/2016Actions
Blocked by Arvados - Idea #6429: [API] [Crunch2] Implement "containers" and "container requests" tables, models and controllersResolvedPeter Amstutz12/03/2015Actions
Actions

Also available in: Atom PDF