Project

General

Profile

Actions

Feature #21351

open

Support Amazon S3 inventory as alternative to list API

Added by Peter Amstutz 4 months ago. Updated 4 months ago.

Status:
New
Priority:
Normal
Assigned To:
-
Category:
Keep
Target version:
Story points:
-

Description

User has 75 million objects in one S3 bucket. It takes about 5 hours to list all those objects in order to run keep-balance. We should think about how to make this go faster.

AWS provides an alternative to the list API to getting the list of objects in a bucket, called S3 Inventory:

https://docs.aws.amazon.com/AmazonS3/latest/userguide/storage-inventory.html

With S3 Inventory, we could download a single flat file with all the objects listed. The file is updated daily.

-- Although, on second thought, if that means we can only run keep-balance once a day, it ends up being a wash.

Actions #1

Updated by Peter Amstutz 4 months ago

  • Target version changed from Development 2024-02-14 sprint to Future
  • Description updated (diff)
Actions #2

Updated by Peter Amstutz 4 months ago

  • Description updated (diff)
Actions

Also available in: Atom PDF