Feature #15345

[arvados-dispatch-cloud] kill container (management API)

Added by Tom Clegg 15 days ago. Updated about 7 hours ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
-
Target version:
Start date:
06/19/2019
Due date:
% Done:

100%

Estimated time:
(Total: 0.00 h)
Story points:
1.0

Description

Described at Dispatching containers to cloud VMs:

POST /arvados/v1/dispatch/containers/kill?container_uuid=X terminates a container immediately.
  • a single attempt is made to send SIGTERM to the container
  • container state/priority fields are not affected
  • assuming SIGTERM works, the container record will end up with state "Cancelled"

Currently (without this API) an admin can use the controller API to update the container's priority to 0, with similar results. This API is a bit more direct: it works even if the dispatcher isn't syncing with the controller properly/quickly, and it bypasses the TimeoutSignal config that rate-limits SIGTERM attempts per container.


Subtasks

Task #15384: Review 15345-kill-container-apiResolvedEric Biagiotti


Related issues

Blocks Arvados - Story #13908: [Epic] Replace SLURM for cloud job scheduling/dispatchingNew

Associated revisions

Revision 1519b9e7
Added by Tom Clegg about 22 hours ago

Merge branch '15345-kill-container-api'

closes #15345

Arvados-DCO-1.1-Signed-off-by: Tom Clegg <>

History

#1 Updated by Tom Clegg 15 days ago

  • Blocks Story #13908: [Epic] Replace SLURM for cloud job scheduling/dispatching added

#2 Updated by Tom Morris 13 days ago

  • Story points set to 1.0
  • Target version changed from To Be Groomed to Arvados Future Sprints

#3 Updated by Tom Clegg 7 days ago

  • Assigned To set to Tom Clegg
  • Status changed from New to In Progress

#4 Updated by Tom Clegg 7 days ago

  • Description updated (diff)

#6 Updated by Tom Morris 6 days ago

  • Target version changed from Arvados Future Sprints to 2019-07-03 Sprint

#7 Updated by Eric Biagiotti 4 days ago

Tom Clegg wrote:

15345-kill-container-api @ 49717fb59156c2b276ccc2fde0b9f2de71e812a6 -- https://ci.curoverse.com/view/Developer/job/developer-run-tests/1326/

This LGTM, if the tests are rerun and they pass. Thanks!

#9 Updated by Tom Clegg about 7 hours ago

  • Status changed from In Progress to Resolved

Also available in: Atom PDF