Project

General

Profile

Actions

Bug #4269

closed

[API] Disallow collection UUIDs in script_parameters, only allow portable data hash

Added by Peter Amstutz about 10 years ago. Updated about 10 years ago.

Status:
Resolved
Priority:
Normal
Assigned To:
Tim Pierce
Category:
-
Target version:
Story points:
1.0

Description

One of the unintended side effects of #3036 is that the Collection chooser dialog has been filling in collection record uuids instead of portable data hashes. As a result, most jobs since then contain a uuid instead of a hash in the script_parameters field. This works because arv-mount permits uuids in addition to hashes in --by-id mode (perhaps it shouldn't have). This breaks reproducibility because collection records can be modified after the fact. The science team has already experienced jobs being inappropriately reused due to this problem. Recommend:

  1. Add a migration that searches past jobs and replaces any instances of collection uuids with the appropriate portable data hash for the collection.
  2. Add a job model validation that prevents uuids from showing up in script_parameters
  3. Update legacy code that accepts or tolerates collection UUIDs in script_parameters:
    • arv-copy
    • (other?)

This is blocked by #4015


Subtasks 2 (0 open2 closed)

Task #4725: Review 4269-no-collection-uuid-in-script-paramsResolvedRadhika Chippada12/04/2014Actions
Task #4397: add validationResolvedTim Pierce12/04/2014Actions

Related issues 2 (0 open2 closed)

Related to Arvados - Bug #4756: [API] Add migration to change collection uuids to portable_data_hash in old job script_parametersRejected12/09/2014Actions
Blocked by Arvados - Bug #4015: [Workbench] Collection chooser should be filling in portable_data_hash as well as collection uuid when used to pick pipeline inputs.ResolvedTim Pierce10/21/2014Actions
Actions

Also available in: Atom PDF