Project

General

Profile

Actions

Bug #21617

closed

Timeout error reading content from collection on a remote cluster

Added by Tom Clegg about 1 month ago. Updated about 1 month ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Category:
Keep
Story points:
-
Release relationship:
Auto

Description

In a 3-way federation with login cluster z1111:
  • a collection stored on z1111 can be read from z2222 (e.g., workbench.z2222/collections/z1111-4zz18-...)
  • a collection stored on z2222 cannot be read from z1111 (timeout)
  • a collection stored on z2222 cannot be read from z3333 (timeout)

It looks like the intermediate cluster's keepstore process cannot retrieve the list of keep services from the cluster where the data is stored ("failed to validate remote token") -- this auto-retries in the background for a while, then eventually blockReadRemote gives up.

Manual testing, with jutro/tordo/pirca playing the roles of z1111/z2222/z3333, indicates the same problem existed before and after #2960 was merged and deployed to tordo.


Subtasks 1 (0 open1 closed)

Task #21619: Review 21617-fed-contentResolvedTom Clegg03/29/2024Actions

Related issues

Related to Arvados - Bug #20750: collection sharing tokens shouldn't leak account info of the person sharing (user/currrent)ResolvedBrett Smith08/24/2023Actions
Actions

Also available in: Atom PDF