Project

General

Profile

Actions

Feature #16519

open

[keepstore] optimize md5sum calculations

Added by Ward Vandewege almost 4 years ago. Updated 2 months ago.

Status:
New
Priority:
Normal
Assigned To:
-
Category:
-
Target version:
Story points:
-
Release:
Release relationship:
Auto

Description

There is now a Go package to speed up md5sum calculations when the hardware supports it (AVX/AVX2 extensions, which are common):

https://github.com/minio/md5-simd

which is described here:

https://blog.min.io/accelerating-aggregate-md5-hashing-up-to-800-with-avx512-2/

Keepstore should leverage this library to speed up its hashing, if the hardware it runs on supports the necessary extensions.

Ideally, this goes into our codebase in a such a way that all our Go code that calculates md5sums leverages it automatically.


Related issues

Related to Arvados - Feature #16518: [keep] Allow clients to set a header to disable md5sum calculations in keepstoreNewActions
Related to Arvados - Feature #16513: Get reference Keep performance numbers for Keep-on-S3ResolvedWard Vandewege06/15/2020Actions
Related to Arvados Epics - Idea #18342: Keep performance optimizationNew08/01/202305/30/2024Actions
Actions

Also available in: Atom PDF