Workflows
What is a Workflow?Filters
This instance of the HANAMI Reproducibility Workflow executed 22 members of the NICAM-DC model running the ICOMEX_JW test with 10 MPI ranks.
Its purpose was to test the ability of Flux in handling the execution of multiple concurrent MPI processes with a climate model in two nodes of MareNostrum ...
Type: Autosubmit
Creator: Manuel Giménez de Castro Marciani
Submitter: Manuel Giménez de Castro Marciani
Workflow for short paired end reads quality control, trimming and filtering. Multiple paired datasets will be merged into single paired dataset. Summary:
- Sequali QC on raw data files
- fastp for read quality trimming
- BBduk for phiX and rRNA filtering (optional)
- Filter human reads using Hostile (optional)
- Custom read filtering using Hostile (optional)
- Sequali QC on filtered (merged) data
Other UNLOCK workflows on WorkflowHub: https://workflowhub.eu/projects/16/workflows?view=default ...
Type: Common Workflow Language
Creators: Bart Nijsse, Jasper Koehorst, Changlin Ke
Submitter: Bart Nijsse
3D genome builder (3DGB)
3D genome builder (3DGB) is a workflow to build 3D models of genomes from HiC raw data and to integrate omics data on the produced models for further visual exploration. 3DGB bundles HiC-Pro, PASTIS and custom Python scripts into a unified Snakemake workflow with limited inputs (see Preparing Required Files). 3DGB produces ...
ENA Reads & Assembly Submission Workflow
Originally developed within the EVORA project, this two-step Galaxy workflow streamlines submissions to the European Nucleotide Archive (ENA). The workflow first submits raw sequencing reads via the Galaxy ENA upload tool, then submits assembled sequences using the Galaxy ENA Webin CLI tool. The process is fully interactive and GUI-driven while retaining ENA’s required validations ...
📄 Generalizable machine learning models for rapid antimicrobial resistance prediction in unseen healthcare settings
This repository contains the code used for the experiments in the paper:
Generalizable machine learning models for rapid antimicrobial resistance prediction in unseen healthcare settings by Diane Duroux, Paul P. Meyer, Giovanni Visonà, and Niko Beerenwinkel.
⚙️ Install the dependencies
You can set up the project with either pip or uv.
Option A - pip:
Install the ...
The bioinformatic workflow presented here enables the analysis of RNA sequencing data obtained from human reproductive tissues in unexplained recurrent pregnancy loss (uRPL) research. This pipeline requires a sample sheet containing the sample information (example_input_data.csv) and gene expression matrices generated using the Salmon tool in the nf-core/rnaseq bioinformatics pipeline (example_count_data.csv). For more information on how to use the nf-core/rnaseq pipeline including the required ...
Workflow for Metagenomics binning from assembly.
Minimal inputs are: Identifier, assembly (fasta) and an associated sorted BAM file
Summary
- MetaBAT2 (binning)
- MaxBin2 (binning)
- SemiBin2 (binning)
- Binette (bin merging)
- EukRep (eukaryotic classification)
- CheckM2 (bin completeness and contamination)
- BUSCO (bin completeness)
- GTDB-Tk (bin taxonomic classification)
- CoverM (bin abundances)
Including: Bin annotation (workflow: https://workflowhub.eu/workflows/1170):
- Bakta
...
Type: Common Workflow Language
Creators: Jasper Koehorst, Bart Nijsse
Submitters: Jasper Koehorst, Bart Nijsse
MAGNETO is an automated snakemake workflow dedicated to MAG (Metagenome-Assembled Genomes) reconstruction from metagenomic data.
It includes a fully-automated coassembly step informed by optimal clustering of metagenomic distances, and implements complementary genome binning strategies, for improving MAG recovery.
Key Features
- Quality Control (QC): Automatically assesses the quality and the contamination of input reads, ensuring that low-quality data are filtered out to improve downstream ...
Type: Snakemake
Creators: Samuel Chaffron, Audrey Bihouee, Benjamin Churcheward, Maxime Millet, Guillaume Fertin, Hugo Lefeuvre
Submitter: Hugo Lefeuvre
This project is an analysis pipeline using Snakemake for RNAseq analysis in order to find differentially expressed genes. It has been widely tested on human RNA sequencing from an Illumina HiSeq but should work on most systems and many other species, provided the necessary resource files can be downloaded.
Description
This pipeline is set for paired-end data only from Illumina HiSeq output files.
The main steps of the pipeline are:
- optionnal cleaning data with ...
Soil Metagenome Pipeline
Soil Metagenome Pipeline is a modular, Nextflow DSL2 workflow for assembling, polishing, binning, annotating, and functionally characterizing complex soil metagenomes. It orchestrates state-of-the-art tools for long- and short-read metagenomics, generates high-quality MAGs, assigns taxonomy, and screens for biosynthetic gene clusters (BGCs).
What it does
- Assembles long-read metagenomes (e.g., ONT) with Flye and optionally polishes with Medaka and/or NextPolish using ...