Project

General

Profile

Actions

Idea #2622

closed

Datamanager outputs garbage collection list

Added by Misha Zatsman over 10 years ago. Updated over 10 years ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Misha Zatsman
Category:
-
Start date:
04/16/2014
Due date:
Story points:
1.0

Description

The python datamanager will output a CSV file with the following format:

block uuid, latest mtime, disk size, cumulative size, disk free

These are blocks which exist on keep that no one has persisted, sorted by increasing mtime. The columns are:
block uuid: The id of the block we want to delete
latest mtime: The latest mtime of the block across all keep servers.
disk size: The total disk space used by this block (block size multiplied by current replication level)
cumulative disk size: The sum of this block's disk size and all the blocks listed above it
disk free: The proportion of our disk space that would be free if we deleted this block and all the above. So this is (current free disk space - cumulative disk size) / total disk capacity


Subtasks 2 (0 open2 closed)

Task #2690: Output garbage collection listResolvedMisha Zatsman04/16/2014Actions
Task #2689: Move datamanager to experimental directory and submit ResolvedMisha Zatsman04/16/2014Actions
Actions #1

Updated by Misha Zatsman over 10 years ago

  • Tracker changed from Bug to Idea
Actions #2

Updated by Misha Zatsman over 10 years ago

  • Project changed from 35 to Arvados
Actions #3

Updated by Misha Zatsman over 10 years ago

  • Target version set to 2014-05-07 Storing and Organizing Data
Actions #4

Updated by Misha Zatsman over 10 years ago

  • Story points set to 1.0
Actions #5

Updated by Misha Zatsman over 10 years ago

  • Description updated (diff)
Actions #6

Updated by Misha Zatsman over 10 years ago

  • Status changed from New to Resolved
Actions

Also available in: Atom PDF