Actions
Feature #20383
openMonitoring that gives list of compute containers that don't seem to be making progress
Story points:
-
Description
Want to get a real time list of containers who's CPU usage and I/O usage are very low indicating it isn't doing any work.
Updated by Peter Amstutz over 1 year ago
- Subject changed from Monitoring that gives list of "idle" compute nodes to Monitoring that gives list of compute containers that don't seem to be making progress
Updated by Brett Smith over 1 year ago
What about CUDA jobs? If they're pegging the GPU but nothing else, is that reported? Can they be excluded from this list?
Actions