Project

General

Profile

Actions

Idea #8563

closed

[Docs] Pipeline author guide gives a basic demonstration of including a third-party tool

Added by Brett Smith about 8 years ago. Updated almost 3 years ago.

Status:
Closed
Priority:
Normal
Assigned To:
Sarah Guthrie
Category:
Documentation
Target version:
-
Start date:
02/26/2016
Due date:
Story points:
0.5

Description

Write a new wiki page describing:

  • Basic introduction to writing a Dockerfile (with links to more resources/references), using a small but real analysis tool
  • How to build your Docker image
  • How to upload your Docker image to Arvados
  • How to call your tool from a Crunch script, including best practices (using subprocess.Popen, capturing stdout, uploading results, setting success based on Popen's returncode)
    • How to upload output from the tool using arvados.crunch.TaskOutputDir()
    • Explain when TaskOutputDir does not work:
      • The tool writes things that fuse does not support (symbollic links and named pipes)
      • The I/O access patterns are not performant with fuse (ex: 20 file handles on one file - tophat)
    • For when it doesn't work, explain how to use a tempdir and how to save one file from that directory or the entire directory tree to Keep

Subtasks 1 (0 open1 closed)

Task #8974: ReviewResolvedTom Morris02/26/2016Actions
Actions

Also available in: Atom PDF