-
Notifications
You must be signed in to change notification settings - Fork 2
/
annotation.yml
197 lines (197 loc) · 11.4 KB
/
annotation.yml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
id: annotation
title: Genome annotation
tabs:
- id: tools
title: Tools
heading_html: >
Common tools are listed here, or search for more in the full tool panel to the left.
content:
- title_html: <code>MAKER</code> - genome annotation pipeline
description_html: >
<p>
MAKER is able to annotate both prokaryotes and eukaryotes. It works by aligning as many evidences as possible along the genome sequence, and then reconciling all these signals to determine probable gene structures.
<br><br> The evidences can be transcript or protein sequences from the same (or closely related) organism. These sequences can come from public databases (like NR or GenBank) or from your own experimental data (transcriptome assembly from an RNASeq experiment for example). MAKER is also able to take into account repeated elements.
</p>
inputs:
- datatypes:
- fasta
label: Genome assembly
- datatypes:
- fasta
label: Protein evidence (optional)
button_link: "{{ galaxy_base_url }}/tool_runner?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Fmaker%2Fmaker"
- title_html: <code>Funannotate predict</code> - predicted gene annotations
description_html: >
<p>
<code>Funannotate predict</code> performs a comprehensive whole genome gene prediction. Uses AUGUSTUS, GeneMark, Snap, GlimmerHMM, BUSCO, EVidence Modeler, tbl2asn, tRNAScan-SE, Exonerate, minimap2. This approach differs from Maker as it does not need to train <em>ab initio</em> predictors.
</p>
inputs:
- datatypes:
- fasta
label: Genome assembly (soft-masked)
- datatypes:
- bam
label: Mapped RNA evidence (optional)
- datatypes:
- fasta
label: Protein evidence (optional)
button_link: "{{ galaxy_base_url }}/tool_runner?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Ffunannotate_predict%2Ffunannotate_predict"
- title_html: <code>RepeatMasker</code> - screen DNA sequences for interspersed repeats and low complexity regions
description_html: >
<p>
RepeatMasker is a program that screens DNA for repeated elements such as tandem repeats, transposons, SINEs and LINEs. Galaxy AU has installed the full and curated DFam screening databases, or a custom database can be provided in <code>fasta</code> format. Additional reference data can be downloaded from <a href="https://www.girinst.org/repbase/" target="_blank"> RepBase</a>.
</p>
inputs:
- datatypes:
- fasta
label: Genome assembly
button_link: "{{ galaxy_base_url }}/tool_runner?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fbgruening%2Frepeat_masker%2Frepeatmasker_wrapper"
- title_html: <code>InterProScan</code> - Scans InterPro database and assigns functional annotations
description_html: >
<p>
Interproscan is a batch tool to query the InterPro database. It provides annotations based on multiple searches of profile and other functional databases.
</p>
inputs:
- datatypes:
- fasta
label: Genome assembly
button_link: "{{ galaxy_base_url }}/tool_runner?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fbgruening%2Finterproscan%2Finterproscan"
- title_html: <code>Funannotate compare</code> - compare several annotations
description_html: >
<p>
<code>Funannotate compare</code> compares several annotations and outputs a GFF3 file with the best gene models. It can be used to compare the results of different gene predictors, or to compare the results of a gene predictor with a reference annotation.
</p>
inputs:
- datatypes:
- fasta
label: Genome assemblies to compare
button_link: "{{ galaxy_base_url }}/tool_runner?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Ffunannotate_compare%2Ffunannotate_compare"
- title_html: <code>JBrowse</code> - Genome browser to visualize annotations
description_html: ''
inputs:
- datatypes:
- fasta
label: Genome assembly
- datatypes:
- gff
- gff3
- bed
label: Annotations
- datatypes:
- bam
label: Mapped RNAseq data (optional)
button_link: "{{ galaxy_base_url }}/tool_runner?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fiuc%2Fjbrowse%2Fjbrowse"
- title_html: <code>Prokka</code> - Genome annotation, prokaryotes only
description_html: ''
inputs:
- datatypes:
- fasta
label: Genome assembly
button_link: "{{ galaxy_base_url }}/tool_runner?tool_id=toolshed.g2.bx.psu.edu%2Frepos%2Fcrs4%2Fprokka%2Fprokka"
- id: workflows
title: Workflows
heading_html: >
A workflow is a series of Galaxy tools that have been linked together to perform a specific analysis. You can use and customize the example workflows below.
<a href="https://galaxyproject.org/learn/advanced-workflow/" target="_blank">Learn more.</a>
content:
- title_html: Annotation with Maker
description_html: >
<p>
Annotates a genome using multiple rounds of Maker, including gene prediction using SNAP and Augustus. <br> <br> Tools: <code>maker</code> <code>snap</code> <code>augustus</code> <code>busco</code> <code>jbrowse</code>
</p>
inputs:
- datatypes:
- fasta
label: Genome assembly
- datatypes:
- fastq
label: RNAseq Illumina reads
- datatypes:
- fasta
label: Proteins
button_link: "{{ galaxy_base_url }}/u/anna/w/genome-annotation-with-maker"
view_link: ''
view_tip: ''
button_tip: Run in Galaxy AU
- title_html: Annotation with Funannotate
description_html: >
<p>
Annotates a genome using Funannotate, includes RNAseq data with RNAstar, and protein predictions from EggNOG. <br> <br> Tools: <code>RNAstar</code> <code>funannotate</code> <code>eggnog</code> <code>busco</code> <code>jbrowse</code> <code>aegean parseval</code>
</p>
inputs:
- datatypes:
- fasta
label: Genome assembly (soft-masked)
- datatypes:
- fastq
label: RNAseq Illumina reads
- datatypes:
- gff3
label: Alternative annotation
- datatypes:
- gbk
label: Alternative annotation
button_link: "{{ galaxy_base_url }}/u/anna/w/annotation-funannotate"
view_link: ''
view_tip: ''
button_tip: Run in Galaxy AU
- id: help
title: Help
content:
- title_html: What is genome annotation?
description_html: >
<p>
These <a href="https://training.galaxyproject.org/training-material/topics/genome-annotation/tutorials/introduction/slides.html#1" target="_blank">slides</a> from the Galaxy training network explain the process of genome annotation in detail. You can use the <code style="font-size: 1.5rem;">←</code> and <code style="font-size: 1.5rem;">→</code> keys to navigate through the slides.
</p>
- title_html: Genome annotation overview
description_html: >
<p>
The flowchart below shows how you might use your input data (in green) with different Galaxy tools (in blue) to annotate a genome assembly. For example, one pathway would be taking an assembled genome, plus information about repeats, and data from RNA-seq, to run in the Maker pipeline. The annotatations can then be viewed in JBrowse.
</p>
<img class="img-fluid" src="/static/home/labs/genome/static/annotation-overview.png" alt="Genome annotation flowchart">
<p class="text-center">
A graphical representation of genome annotation
</p>
- title_html: Can I use Fgenesh++ for annotation?
description_html: >
<p>
<a href="http://www.softberry.com/berry.phtml?group=help&subgroup=pipelines&topic=fgenesh_plus_plus" target="_blank"> Fgenesh++ </a> is a bioinformatics pipeline for automatic prediction of genes in eukaryotic genomes. It is presently not installed in Galaxy Australia, but the Australian Biocommons and partners have licensed the software and made it available via commandline. Australian researchers can apply for access through the Australian BioCommons.
</p>
button_html: Apply
button_link: https://www.biocommons.org.au/fgenesh-plus-plus
button_tip: Apply for access to Fgenesh++
- title_html: Can I use Apollo to share and edit the annotated genome?
description_html: >
<p>
Apollo is web-browser accessible system that lets you conduct real-time collaborative curation and editing of genome annotations.
</p>
<p>
The Australian BioCommons and our partners at QCIF and Pawsey provide a hosted <a href="https://apollo-portal.genome.edu.au/" target="_blank"> Apollo Portal service </a> where your genome assembly and supporting evidence files can be hosted. All system administration is taken care of, so you and your team can focus on the annotation curation itself.
</p>
<p>
This <a href="https://training.galaxyproject.org/training-material/topics/genome-annotation/tutorials/apollo-euk/tutorial.html" target="_blank"> Galaxy tutorial </a> provides a complete walkthrough of the process of refining eukaryotic genome annotations with Apollo.
</p>
button_html: More info
button_link: https://support.biocommons.org.au/support/solutions/articles/6000244843-apollo-for-collaborative-curation-and-editing
- title_html: Tutorials
description_html: >
<p class="lead">
Genome annotation with Maker
</p>
<p>
Genome annotation of eukaryotes is a little more complicated than for prokaryotes: eukaryotic genomes are usually larger than prokaryotes, with more genes. The sequences determining the beginning and the end of a gene are generally less conserved than the prokaryotic ones. Many genes also contain introns, and the limits of these introns (acceptor and donor sites) are not highly conserved. This <a href="https://training.galaxyproject.org/training-material/topics/genome-annotation/tutorials/annotation-with-maker/tutorial.html" target="_blank"> Galaxy tutorial </a> uses MAKER to annotate the genome of a small eukaryote: Schizosaccharomyces pombe (a yeast).
</p>
<hr>
<p class="lead">
Genome annotation with Funannotate
</p>
<p>
This <a href="https://training.galaxyproject.org/training-material/topics/genome-annotation/tutorials/funannotate/tutorial.html" target="_blank"> Galaxy tutorial </a> provides a complete walkthrough of the process of annotation with Funannotate, including the preparation of RNAseq data, structural annotation, functional annotation, visualisation, and comparing annotations.
</p>
- title_html: Galaxy Australia support
description_html: >
<p>
Any user of Galaxy Australia can request support through an online form.
</p>
button_html: Request support
button_link: /request/support