GenPipes is a flexible Python-based framework that facilitates the development and deployment of multi-step workflows optimized for High-Performance Computing clusters and the cloud. GenPipes comes with 13 validated and scalable pipelines for various genomics applications.
- Amplicon Sequencing Pipeline
- ChIP Sequencing Pipeline
- CoV Sequencing Pipeline
- DNA Sequencing Pipeline
- DNA Sequencing High Coverage Pipeline
- HiC Sequencing Pipeline
- Illumina Run Processing Pipeline
- Methylation Sequencing Pipeline
- Nanopore Pipeline
- RNA Sequencing Pipeline
- De-Novo RNA Sequencing Pipeline
- RNA Sequencing Light Pipeline
- Tumor Pair Sequencing Pipeline
This proof of concept pipeline is implemented in the following branch.
To run this proof of concept, you need to clone the repository and change to the appropriate branch. The location of the cloned directory
will be referred to as $GENPIPES
in the following instructions.
- From the cloned location, launch the pipeline using the attached readset.txt file:
python $GENPIPES/pipelines/goeke_rnaseq.py -c $GENPIPES/pipelines/goeke_rnaseq.slurm.ini --steps 1-3 --output pipeline_output --readsets readset.txt --job-scheduler slurm > run_commands.sh
- To submit jobs to the scheduler just run the following command:
bash run_commands.sh
Notes:
- Please note that this proof of concept pipeline requires de-compressed fastqs to work properly
- When running from Compute Canada or other supported GenPipes clusters, the pipeline will load software using linux modules, if running from another location, use the containerized version of GenPipes.
- GenPipes currently runs using python v2.7, a python v3 release is coming soon.
This pipeline is a minimal example of a Salmon-based pipeline using GenPipes. We welcome contributions to the documentation and workflow, please create an issue or submit a pull request!
Bourgey M, Dali R, Eveleigh R, Chen KC, Letourneau L, Fillon J, Michaud M, Caron M, Sandoval J, Lefebvre F, Leveque G, Mercier E, Bujold D, Marquis P, Van PT, Anderson de Lima Morais D, Tremblay J, Shao X, Henrion E, Gonzalez E, Quirion PO, Caron B, Bourque G. GenPipes: an open-source framework for distributed and scalable genomic analyses. Gigascience. 2019 Jun 1;8(6):giz037. doi: 10.1093/gigascience/giz037. PMID: 31185495; PMCID: PMC6559338.