This repository contains information for the EMBL-ABR webinar on "Snakemake: Repoducible and Scalable Bioifnormatic Workflows" presented by Nathan Watson-Haigh on 7th Feb 2019.
Abstract: Recent years have seen a groundswell of support in the bioscience community for improved reproducibility of data analyses. One solution used by bioinformaticians to achieve this is the use of the Snakemake workflow management system. It is a tool for creating reproducible and scalable data analyses.
This 1-hour webinar will take a look into Snakemake, cover the core concepts and examples required for making your first steps into reproducible Snakemake workflows. Building on this foundation, Nathan will introduce more advanced features which enable workflows to scale seamlessly to server, cluster, grid and cloud environments as well as different software execution environments.
Date/Time: Thursday 07 February 2019 12:30-13:30 ACDT
Registration Page: https://attendee.gotowebinar.com/register/7076869548376639747
Presenter: Nathan Watson-Haigh
Presentation: PowerPoint Show
Video Link(s): EMBL-ABR YouTube Channel
For the purpose of demonstrating a Snakemake workflow in reasonable time, we will be working on a subset of public WGS data from wheat. Specifically, we will be looking at data from a small (58Kbp) region on the long arm of chromosome 4A (chr4A_part2:235500000-235558000). This region contains 2 genes affected by a deletion which is present in some wheat accessions:
DAWN URL: chr4A_part2:235500000-235558000