Project

General

Profile

Actions

Bug #18724

open

Collection file_names should contain full paths for substring matching

Added by Peter Amstutz 6 months ago. Updated 5 months ago.

Status:
New
Priority:
Normal
Assigned To:
-
Category:
API
Target version:
-
Start date:
Due date:
% Done:

0%

Estimated time:
Story points:
-

Description

Customer wants to search for a file by name, and probably also by path.

Currently, they can use the trigram index on the full Collection record.

This turned up an unexpected behavior.

Searching for "dmel_r6.16/Bowtie2Index/genome.fa" doesn't work

Searching for "dmel_r6.16/Bowtie2Index" works

This is because the file_names column is populated with each file name and each stream name ("directory") from the manifest. The file name and stream name are not joined to get a full path. So the file_names table looks like:

genome.fa
./dmel_r6.16/Bowtie2Index

It would better fit user expectations if it was formatted like this:

./dmel_r6.16/Bowtie2Index/genome.fa

Propose migrating the column contents to use full paths.


Related issues

Related to Arvados - Bug #18767: Document behavior of text search on filenames and paths in CollectionsResolvedPeter Amstutz03/16/2022

Actions
Actions #1

Updated by Peter Amstutz 6 months ago

  • Description updated (diff)
Actions #2

Updated by Peter Amstutz 6 months ago

  • Target version changed from 2022-03-02 sprint to 2022-03-16 sprint
Actions #3

Updated by Peter Amstutz 6 months ago

  • Related to Bug #18767: Document behavior of text search on filenames and paths in Collections added
Actions #4

Updated by Peter Amstutz 5 months ago

  • Target version changed from 2022-03-16 sprint to 2022-03-30 Sprint
Actions #5

Updated by Peter Amstutz 5 months ago

  • Target version deleted (2022-03-30 Sprint)
Actions

Also available in: Atom PDF