Workflows
What is a Workflow?Filters
Assembly polishing; can run alone or as part of a combined workflow for large genome assembly.
- What it does: Polishes (corrects) an assembly, using long reads (with the tools Racon and Medaka) and short reads (with the tool Racon). (Note: medaka is only for nanopore reads, not PacBio reads).
- Inputs: assembly to be polished: assembly.fasta; long reads - the same set used in the assembly (e.g. may be raw or filtered) fastq.gz format; short reads, R1 only, in fastq.gz format
- Outputs: ...
Fgenesh Annotation - TSI Workflow Description
Overview
One of a series of workflows to annotate a genome, tagged TSI-annotation. Based on command-line code by Luke Silver, converted into Galaxy Australia workflows.
Workflow Sequence
Run in this order:
- Repeat masking
- RNAseq QC and read trimming
- Find transcripts
- Combine transcripts
- Extract transcripts
- Convert formats
- Fgenesh annotation (this workflow)
Inputs Required
Files uploaded by the user:
assembled_genome.fasta...
Post-genome assembly quality control workflow using Quast, BUSCO, Meryl, Merqury and Fasta Statistics, with updates November 2024.
Workflow inputs: reads as fastqsanger.gz (not fastq.gz), and primary assembly.fasta. (To change reads format: click on the pencil icon next to the file in the Galaxy history, then "Datatypes", then set "New type" as fastqsanger.gz). Note: the reads should be those that were used for the assembly (i.e., the filtered/cleaned reads), not the raw reads.
What it does: ...
Type: Galaxy
Creators: Kate Farquharson, Gareth Price, Simon Tang, Anna Syme
Submitters: Johan Gustafsson, Anna Syme
ORBiT
A Nextflow workflow for analysing Oxford Nanopore Technologies (ONT) RNAseq direct read sequening (DRS) or cDNA data.
This workflow emphasises sensitivity to detect rare and novel features within the data. Multiple aspects of this workflow are tailored to enhance sensitivity:
- Alignment to reference genome rather than transcriptome
- Multiple tools per analysis type (n = 2 isoforms, n = 3 fusions)
- Reads quantification tools capable of detecting novel isoforms, and counting at the isoform ...
Type: Nextflow
Creators: Cali Willet, Amarinder Thind, Michael Geaghan, Mitchell O'Brien, Madison Gonebale, Marina Kennerson
Submitter: Georgina Samaha
This Workflow transcribes a video or audio with multiple speakers. After transcription, it allocates the names of the speakers and groups and cleans passages from the two main speakers for further analysis.
Associated Tutorial
This workflows is part of the tutorial Transcribing Audio and Video files with Automated Speech Recognition, available in the ...
EXCON (v2.3.1)
A Nextflow pipeline for gene family EXpansion and CONtraction analysis across multiple species using CAFE5.
Given a set of genome assemblies and annotations, EXCON builds orthogroups with OrthoFinder, fits and compares multiple CAFE models to identify gene families evolving at significantly different rates, and automatically selects the best-fitting model for downstream analysis. Optionally, GO enrichment analysis can be run on expanded and contracted gene families, and ...
Using:
- vadr annotation (model to select)
- vardict variant caller
- coverage depth
Provides summarizing files:
- png image of variant calling with annotations and coverage depths
- tsv file with all information of significant variants only
- vcf file with all information of significant variants only (to allow downstream NextStrain analyses)
Type: Galaxy
Creators: Fabrice Touzain, This study was founded by the French National Research Agency and by Santé publique France as part of the project "EMERGEN". Anses Ploufragan research was also supported by Agglomération de Saint-Brieuc, Département des Côtes d'Armor and Région Bretagne
Submitter: Fabrice Touzain
Using:
- vadr annotation (model to select)
- vardict variant caller
- coverage depth
Provides summarizing files:
- png image of variant calling with annotations and coverage depths
- tsv file with all information of significant variants only
- vcf file with all information of significant variants only (to allow downstream NextStrain analyses)
Type: Galaxy
Creators: Fabrice Touzain, This study was founded by the French National Research Agency and by Santé publique France as part of the project "EMERGEN". Anses Ploufragan research was also supported by Agglomération de Saint-Brieuc, Département des Côtes d'Armor and Région Bretagne
Submitter: Fabrice Touzain
Using:
- vadr annotation (model to select)
- vardict variant caller
- coverage depth
Provides summarizing files:
- png image of variant calling with annotations and coverage depths
- tsv file with all information of significant variants only
- vcf file with all information of significant variants only (to allow downstream NextStrain analyses)
Type: Galaxy
Creators: Fabrice Touzain, This study was founded by the French National Research Agency and by Santé publique France as part of the project "EMERGEN". Anses Ploufragan research was also supported by Agglomération de Saint-Brieuc, Département des Côtes d'Armor and Région Bretagne
Submitter: Fabrice Touzain
Using:
- vadr annotation (model to select)
- vardict variant caller
- coverage depth
Provides summarizing files:
- png image of variant calling with annotations and coverage depths
- tsv file with all information of significant variants only
- vcf file with all information of significant variants only (to allow downstream NextStrain analyses)
Type: Galaxy
Creators: Fabrice Touzain, This study was founded by the French National Research Agency and by Santé publique France as part of the project "EMERGEN". Anses Ploufragan research was also supported by Agglomération de Saint-Brieuc, Département des Côtes d'Armor and Région Bretagne
Submitter: Fabrice Touzain