Public Pipelines and Datasets » History » Version 4
Nancy Ouyang, 06/09/2015 05:21 PM
1 | 3 | Radhika Chippada | {{toc}} |
---|---|---|---|
2 | |||
3 | 2 | Nancy Ouyang | WARNING: WORK-IN-PROGRESS |
4 | |||
5 | 1 | Nancy Ouyang | h1. Public Pipelines |
6 | 2 | Nancy Ouyang | |
7 | 3 | Radhika Chippada | h2. Bcbio-nextgen |
8 | 1 | Nancy Ouyang | |
9 | 3 | Radhika Chippada | By: Brad Chapman |
10 | |||
11 | 1 | Nancy Ouyang | Bcbio-nextgen is a python toolkit providing best-practice pipelines for fully automated high throughput sequencing analysis. |
12 | |||
13 | 3 | Radhika Chippada | p(. Source Documentation: https://bcbio-nextgen.readthedocs.org/en/latest/ |
14 | 1 | Nancy Ouyang | |
15 | 3 | Radhika Chippada | p(. Arvados Documentation: https://arvados.org/projects/arvados/wiki/Bcbio-nextgen_tutorial |
16 | 2 | Nancy Ouyang | |
17 | 3 | Radhika Chippada | p(. Link to Arvados Project: https://workbench.qr1hi.arvadosapi.com/projects/qr1hi-j7d0g-a45rn6qhjszct1d |
18 | |||
19 | h2. GATK 2 Unified Genotyper |
||
20 | |||
21 | 1 | Nancy Ouyang | The Genome Analysis Toolkit or GATK is a software package developed at the Broad Institute to analyze high-throughput sequencing data. Raw reads are processed and the GATK UnifiedGenotyper is will call SNPs and indels. |
22 | 2 | Nancy Ouyang | |
23 | 3 | Radhika Chippada | p(. Source Documentation: http://gatkforums.broadinstitute.org/discussion/17/gatk-2-0-announcement |
24 | 1 | Nancy Ouyang | |
25 | 3 | Radhika Chippada | p(. Arvados Documentation: https://arvados.org/projects/arvados/wiki/GATK2_tutorial |
26 | 1 | Nancy Ouyang | |
27 | 3 | Radhika Chippada | p(. Link to Arvados Project: https://workbench.qr1hi.arvadosapi.com/projects/qr1hi-j7d0g-iodpr5gc65cp4t0 |
28 | |||
29 | h2. GATK 3 Haplotype Caller |
||
30 | |||
31 | 1 | Nancy Ouyang | The Genome Analysis Toolkit or GATK is a software package developed at the Broad Institute to analyze high-throughput sequencing data. Raw reads are processed then, the GATK HaplotypeCaller will call SNPs and indels simultaneously via local re-assembly of haplotypes in an active region. |
32 | |||
33 | 3 | Radhika Chippada | p(. Source Documentation: https://www.broadinstitute.org/gatk/blog?tag=gatk3 |
34 | 2 | Nancy Ouyang | |
35 | 3 | Radhika Chippada | p(. Arvados Documentation: https://arvados.org/projects/arvados/wiki/GATK3_tutorial |
36 | 1 | Nancy Ouyang | |
37 | 3 | Radhika Chippada | p(. Link to Arvados Project: https://workbench.qr1hi.arvadosapi.com/projects/qr1hi-j7d0g-l1emsbyyx6wyjdh |
38 | |||
39 | h2. lobSTR 3 |
||
40 | |||
41 | 1 | Nancy Ouyang | lobSTR is a tool for profiling Short Tandem Repeats (STRs) from high throughput sequencing data. |
42 | 2 | Nancy Ouyang | |
43 | 3 | Radhika Chippada | p(. Source Documentation: http://melissagymrek.com/lobstr-code/ |
44 | 1 | Nancy Ouyang | |
45 | 3 | Radhika Chippada | p(. Arvados Documentation: https://arvados.org/projects/arvados/wiki/LobSTR_tutorial |
46 | 1 | Nancy Ouyang | |
47 | 3 | Radhika Chippada | p(. Link to Arvados Project: https://workbench.qr1hi.arvadosapi.com/projects/qr1hi-j7d0g-up6qgpqz5ie2vfq |
48 | |||
49 | h2. Platypus |
||
50 | |||
51 | 1 | Nancy Ouyang | Platypus is a tool designed for efficient and accurate variant-detection in high-throughput sequencing data. Platypus can detect SNPs, MNPs, short indels, replacements and (using the assembly option) deletions up to several kb. |
52 | |||
53 | 3 | Radhika Chippada | p(. Source Documentation: http://www.well.ox.ac.uk/platypus |
54 | 1 | Nancy Ouyang | |
55 | 3 | Radhika Chippada | p(. Arvados Documentation: https://arvados.org/projects/arvados/wiki/Platypus_tutorial |
56 | 2 | Nancy Ouyang | |
57 | 3 | Radhika Chippada | p(. Link to Arvados Project: https://workbench.qr1hi.arvadosapi.com/projects/qr1hi-j7d0g-xcntt7rtz251isk |
58 | |||
59 | h2. Pathomap -- Ancestry Mapper |
||
60 | |||
61 | By: Chris Mason |
||
62 | |||
63 | 2 | Nancy Ouyang | PathoMap is a research project by Weill Cornell Medical College to study the microbiome and metagenome of the built environment of NYC. The Ancestry Mapper pipeline is one pipeline out of the Pathomap suite, which takes human high-throughput sequencing data and creates an ancestry map. |
64 | 1 | Nancy Ouyang | |
65 | 3 | Radhika Chippada | p(. Source Documentation: http://www.pathomap.org/about/ |
66 | 1 | Nancy Ouyang | |
67 | 3 | Radhika Chippada | p(. Arvados Documentation: https://arvados.org/projects/arvados/wiki/pathomap_tutorial/ |
68 | 1 | Nancy Ouyang | |
69 | 3 | Radhika Chippada | p(. Link to Arvados Project: https://workbench.qr1hi.arvadosapi.com/projects/qr1hi-j7d0g-662ij1pcw6bj8uj |
70 | |||
71 | h2. Tuxedo Suite (BTC) |
||
72 | |||
73 | 1 | Nancy Ouyang | The Tuxedo Suite (Bowtie, Tophat, Cufflinks) takes High-throughput RNA sequencing data and assembles transcripts and estimates their relative abundances, without using a reference annotation. |
74 | |||
75 | 3 | Radhika Chippada | p(. Source Documentation: |
76 | 1 | Nancy Ouyang | |
77 | 3 | Radhika Chippada | p(((. http://bowtie-bio.sourceforge.net/bowtie2/index.shtml |
78 | 2 | Nancy Ouyang | |
79 | 3 | Radhika Chippada | p(((. http://ccb.jhu.edu/software/tophat/index.shtml |
80 | |||
81 | p(((. http://cole-trapnell-lab.github.io/cufflinks/ |
||
82 | |||
83 | 4 | Nancy Ouyang | p(. Arvados Documentation: https://arvados.org/projects/arvados/wiki/Tuxedo_tutorial |
84 | 3 | Radhika Chippada | |
85 | p(. Link to Arvados Project: https://workbench.qr1hi.arvadosapi.com/projects/qr1hi-j7d0g-ziejtm5kaik7ndl |
||
86 | |||
87 | h2. VCF-compare |
||
88 | |||
89 | 2 | Nancy Ouyang | Compares positions in two VCF files and outputs concordance and discordance rates. |
90 | |||
91 | 3 | Radhika Chippada | p(. Source Documentation: http://vcftools.sourceforge.net/perl_module.html#vcf-compare |
92 | 2 | Nancy Ouyang | |
93 | 3 | Radhika Chippada | p(. Arvados Documentation: https://arvados.org/projects/arvados/wiki/Vcf-compare_tutorial |
94 | 2 | Nancy Ouyang | |
95 | 3 | Radhika Chippada | p(. Link to Arvados Project: https://workbench.qr1hi.arvadosapi.com/projects/qr1hi-j7d0g-qhf708xb2s7cow2 |
96 | 2 | Nancy Ouyang | |
97 | 3 | Radhika Chippada | |
98 | 2 | Nancy Ouyang | h2. All the tutorials |
99 | |||
100 | Each of the following tutorials introduces the following Arvados features: |
||
101 | |||
102 | * How to run the pipeline using Arvados |
||
103 | * How to access your pipeline results |
||
104 | * How to browse and select your input data for lobSTR and submit re-run the pipeline |
||
105 | |||
106 | 3 | Radhika Chippada | p(. https://arvados.org/projects/arvados/wiki/Platypus_tutorial |
107 | 2 | Nancy Ouyang | |
108 | 3 | Radhika Chippada | p(. https://arvados.org/projects/arvados/wiki/gatk2_tutorial |
109 | 2 | Nancy Ouyang | |
110 | 3 | Radhika Chippada | p(. https://arvados.org/projects/arvados/wiki/gatk3_tutorial |
111 | |||
112 | p(. https://arvados.org/projects/arvados/wiki/bcbio-nextgen_tutorial |
||
113 | |||
114 | p(. https://arvados.org/projects/arvados/wiki/Tuxedo_tutorial |
||
115 | |||
116 | p(. https://arvados.org/projects/arvados/wiki/Pathomap_tutorial |
||
117 | |||
118 | |||
119 | h1. Public Datasets (TBD) |