Idea #2622
closedDatamanager outputs garbage collection list
Description
The python datamanager will output a CSV file with the following format:
block uuid, latest mtime, disk size, cumulative size, disk free
These are blocks which exist on keep that no one has persisted, sorted by increasing mtime. The columns are:
block uuid: The id of the block we want to delete
latest mtime: The latest mtime of the block across all keep servers.
disk size: The total disk space used by this block (block size multiplied by current replication level)
cumulative disk size: The sum of this block's disk size and all the blocks listed above it
disk free: The proportion of our disk space that would be free if we deleted this block and all the above. So this is (current free disk space - cumulative disk size) / total disk capacity