Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mgatk tenx endless runtime #98

Open
AdrianParrilla opened this issue Aug 16, 2024 · 3 comments
Open

mgatk tenx endless runtime #98

AdrianParrilla opened this issue Aug 16, 2024 · 3 comments
Assignees
Labels

Comments

@AdrianParrilla
Copy link

Hi,
I was trying to process a bam file coming from a scATAC sequencing, but after waiting for hours the process seems to be stuck at the beginning (see image attached). The input bam file is 33 Gb and has around 500,000,000 reads in total (246,780 reads per cell) and 11% of the reads covering the mt DNA.

mgatk issue

The command I used was:

mgatk tenx -i /mnt/smb/TDA/scATAC_files/results/test_scARC_possorted_downsampled_bam.bam -n test_scARC -o test_scARC_mgatk -c 60 -bt CB -b /mnt/smb/TDA/scATAC_files/results/barcodes.tsv -g /mnt/smb/TDA/scATAC_files/masked_genome.fa --keep-temp-files

After 2 hours of running, the only output I got was empty folders:

mgatk issue 2

Why is the process stuck at the first step? Is the bam file too big?

Thanks in advance for your help,

@caleblareau
Copy link
Owner

caleblareau commented Aug 16, 2024 via email

@AdrianParrilla
Copy link
Author

Hi, thanks for your reply. I tried that but I got an error saying: "User specified mitochondrial genome does NOT match .bam file; correctly specify reference genome or .fasta file". I tried also to extract the mitochondrial genome from our masked_genome.fa and input it as custom with --mito-genome, but I get the same error.
Does the sample bam file need to be only the mitocondrial chromosome?

@caleblareau
Copy link
Owner

The bam file can have more than just the mitochondrial chromosome contig.

The issue is probably that the text following the > should match exactly the chromosome name (e.g., >chrM).

What is the contig name in the bam file and does cat <your fasta file> | grep ">" yield?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants