Bcbio-nextgen tutorial » History » Version 8
Bryan Cosca, 03/16/2015 07:03 PM
1 | 2 | Bryan Cosca | h1. Running bcbio-nextgen using Arvados |
---|---|---|---|
2 | |||
3 | WIP |
||
4 | |||
5 | 4 | Bryan Cosca | h2. Uploading data through the web and using it on Arvados |
6 | 2 | Bryan Cosca | |
7 | 5 | Bryan Cosca | # In your home project, click on the blue *+ Add data* button in the top right. |
8 | # Click *Upload files from my computer* |
||
9 | # Click *Choose Files* and choose the 2 paired end fastq files you would like to run bcbio-nextgen on. |
||
10 | # Once you're ready, click *> Start* |
||
11 | # Feel free to rename your Collection so you can remember it later. Click on the pencil icon in the top left corner next to *New collection* |
||
12 | # Once that is uploaded, navigate back to the dashboard and click on *Run a pipeline...* and choose bcbio-nextgen pipeline. |
||
13 | # You can change the input by clicking on the *[Choose]* button next to the *R1 parameter* and *R2 parameter*. |
||
14 | # Input your left fastq file as R1 and your right fastq file as R2. |
||
15 | # For each fastq file, click on the dropdown menu, click on your newly-created project, and choose your desired input collection. |
||
16 | 6 | Bryan Cosca | # Click *Run* to run bcbio on your data! |
17 | 2 | Bryan Cosca | |
18 | 4 | Bryan Cosca | h2. Uploading data through your shell and using it on Arvados |
19 | 2 | Bryan Cosca | |
20 | 7 | Bryan Cosca | Full documentation can be found "here":http://doc.arvados.org/user/tutorials/tutorial-keep.html |
21 | |||
22 | # Install the "Arvados Python SDK":http://doc.arvados.org/sdk/python/sdk-python.html on the system from which you will upload the data (such as your workstation, or a server containing data from your sequencer). Doing so will install the Arvados file upload tool, arv-put. |
||
23 | # To configure the environment with the Arvados instance host name and authentication token, see "here":http://doc.arvados.org/user/reference/api-tokens.html |
||
24 | # Navigate back to your Workbench dashboard and create a new project by clicking on the Projects dropdown menu and clicking Home. |
||
25 | # Click on [+ Add a subproject]. Feel free to edit the Project name or description by clicking the pencil to the right of the text. |
||
26 | # To add data, return to your shell, create a folder, and put the two paired-end fastq files you want to upload inside. Use the command arv-put * --project-uuid qr1hi-xxxxx-yyyyyyyyyyyyyyy. The qr1hi tag can be found in the url of your new project. This ensures that all the files you would like to upload are in one collection. |
||
27 | # The output value xxxxxxxxxxxxxxxxxxxx+yyyy is the Arvados collection locator that uniquely describes this file. |
||
28 | 8 | Bryan Cosca | # Once that is uploaded, navigate back to the dashboard and click on *Run a pipeline...* and choose bcbio-nextgen pipeline. |
29 | 7 | Bryan Cosca | # You can change the input by clicking on the *[Choose]* button next to the *R1 parameter* and *R2 parameter*. |
30 | # Input your left fastq file as R1 and your right fastq file as R2. |
||
31 | # For each fastq file, click on the dropdown menu, click on your newly-created project, and choose your desired input collection. |
||
32 | # Click *Run* to run bcbio on your data! |
||
33 | 2 | Bryan Cosca | |
34 | 3 | Bryan Cosca | h3. FAQ |
35 | 1 | Bryan Cosca | |
36 | WIP |