Project

General

Profile

Actions

Idea #2622

closed

Datamanager outputs garbage collection list

Added by Misha Zatsman over 10 years ago. Updated over 10 years ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Misha Zatsman
Category:
-
Start date:
04/16/2014
Due date:
Story points:
1.0

Description

The python datamanager will output a CSV file with the following format:

block uuid, latest mtime, disk size, cumulative size, disk free

These are blocks which exist on keep that no one has persisted, sorted by increasing mtime. The columns are:
block uuid: The id of the block we want to delete
latest mtime: The latest mtime of the block across all keep servers.
disk size: The total disk space used by this block (block size multiplied by current replication level)
cumulative disk size: The sum of this block's disk size and all the blocks listed above it
disk free: The proportion of our disk space that would be free if we deleted this block and all the above. So this is (current free disk space - cumulative disk size) / total disk capacity


Subtasks 2 (0 open2 closed)

Task #2690: Output garbage collection listResolvedMisha Zatsman04/16/2014Actions
Task #2689: Move datamanager to experimental directory and submit ResolvedMisha Zatsman04/16/2014Actions
Actions

Also available in: Atom PDF