Project

General

Profile

Actions

Feature #21938

open

Additional arvados-dispatch-cloud metrics

Added by Peter Amstutz 5 days ago. Updated 1 day ago.

Status:
New
Priority:
Normal
Assigned To:
-
Category:
Crunch
Story points:
-

Description

  • total vCPUs (since AWS quotas are based on vCPUs)
  • count of "instance creation errors" that is parameterized on cluster, instance type and subnet. Either cumulative or rolling count of errors with a period of last 5-10 minutes
    • only want to log fatal errors where it couldn't start an instance despite trying every subnet / every candidate instance type
Actions

Also available in: Atom PDF