Project

General

Profile

Actions

More about running fastq-to-gVCF

When we do releases, we run a test pipeline that is intended to be representative of a bioinformatics workload.

1. Deploy the version of arvados-cwl-runner that you want to test and make sure that the corresponding arvados/jobs image has been built and uploaded to docker hub or built using the arvados/build/build-dev-docker-jobs-image.sh script and uploaded using arv-keepdocker.

2. Clone https://git.arvados.org/arvados-tutorial.git/

3. Create an Arvados project for the test run

4. cd arvados/tutorial/WGS-processing

5. Run the following command: arvados-cwl-runner --no-wait --disable-reuse --project-uuid <my project> cwl/wgs-processing-wf.cwl yml/wgs-processing-wf-chr19.yml

6. Monitor this for success. It usually takes about an hour to run.

If you are running this on pirca then all the data should already be present. If you are running it from somewhere else, you may need to do some additional data copying from pirca to the other cluster. The input document yml/wgs-processing-wf-chr19.yml has the portable data hashes of the collections.

Updated by Peter Amstutz 13 days ago · 2 revisions