Group Nextflow Assignment

Assignment Description

Produce a working Hi-C nextflow script based off of the NF-core Hi-C pipeline steps and also include a singularity container for reproducibility. https://github.com/nf-core/hic

Note: all code is strongly based off of the nf-core Hi-C script linked

Description of files for runnning Hi-C

1. Nextflow Script

'hi-c.nf' Runs Hi-C analysis in conjunction with Singularity container. Sample input FASTQ files present for test run.

Nextflow Dependency Files

1.1 FASTQ Input Samples

For use in the Hi-C pipeline to produce an output Test dataset reference source: https://github.com/nf-core/test-datasets/tree/hic

1.2 Alignment scripts

align.nf -Run “align.nf” to do Step 1 of the pipeline (mapping using a two steps strategy to rescue reads spanning the ligation sites).

1.3 Python Merging Script

mergeSAM.py (any other required python scripts) -"mergeSAM.py" merges the SAM files (which have undergone the 2-step alignment) into one paired-end BAM file – the final output of Step 1. This script and others located here: https://github.com/nf-core/hic/blob/master/bin.

1.4 NF Step one Output File Dependency

Final output file of step 1 is ${sample}_bwt2pairs.bam (SRR4292758_00_bwt2pairs.bam).

2. Singularity Container Directories and Descriptions

Base availability: Dockerfile and environment YAML: https://github.com/nf-core/hic

Modified: Index files placed in '/results/index' directory, all other files are placed in '/results/align' directory. '/data' directory: contains the raw fastq.gz files (SRR4292758_00_R1.fastq.gz, SRR4292758_00_R2.fastq.gz) '/reference' directory: contains the reference genome (W303_SGD_2015_JRIU00000000.fsa). N.B. - reference used to be .fsa.txt

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
Group 1 Nextflow Script.pdf		Group 1 Nextflow Script.pdf
README.md		README.md
SRR4292758_00_bwt2pairs.bam		SRR4292758_00_bwt2pairs.bam
align.nf		align.nf
align_no_trim.nf		align_no_trim.nf
hi-c.nf		hi-c.nf
mergeSAM.py		mergeSAM.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Group Nextflow Assignment

Assignment Description

Description of files for runnning Hi-C

1. Nextflow Script

Nextflow Dependency Files

1.1 FASTQ Input Samples

1.2 Alignment scripts

1.3 Python Merging Script

1.4 NF Step one Output File Dependency

2. Singularity Container Directories and Descriptions

About

Releases

Packages

Contributors 2

Languages

gavinf97/groupnextflow

Folders and files

Latest commit

History

Repository files navigation

Group Nextflow Assignment

Assignment Description

Description of files for runnning Hi-C

1. Nextflow Script

Nextflow Dependency Files

1.1 FASTQ Input Samples

1.2 Alignment scripts

1.3 Python Merging Script

1.4 NF Step one Output File Dependency

2. Singularity Container Directories and Descriptions

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages