-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OptiType fails sometimes with BAM not found #419
Comments
So I don't think it's related to getting BAMs as input, since it's not following that code path. Rather, it appears to do this: https://github.com/FRED-2/OptiType/blob/master/OptiTypePipeline.py#L286 Then: https://github.com/FRED-2/OptiType/blob/master/OptiTypePipeline.py#L294 Then, I think it fails at: https://github.com/FRED-2/OptiType/blob/master/OptiTypePipeline.py#L298 |
For whatever reason, it looks like https://github.com/FRED-2/OptiType/blob/master/OptiTypePipeline.py#L288 didn't result in a BAM being created? I also see that the BAMs get removed when done, which explains why the other successes don't have BAMs there. |
This is a dup. of what @armish hit in RCC: https://github.com/hammerlab/rcc-analyses/issues/104 Leaving it open since this is the more general repo. |
Tried running this manually in the VM. Some more information:
|
Seems like an OOM situation. Looks like the 10 that failed, at first glance, were relatively large |
You can try remaking the cluster with bigger nodes? Or is there an argument you can pass to Optitype that tells it to use all 52GB of the default nodes? |
@ihodes first trying manually on a beefed up node; but if that works, how do I remake the cluster with bigger nodes? |
I'm not sure to be honest; you might be able to change it from the GCloud GKE interface, or you could take down the cluster you have and start a new one with different node type… @smondet do you know? |
@ihodes I've never tried to change the machine type "live" The machine-type is an option of |
Confirmed that this is a memory issue: when running the same commands manually on 30GB memory vs. 120GB memory, it fails on the former and succeeds on the latter. |
Do we know if we can filter reads to the MHC locus and save a lot of space? If so, we should add this filtering step to the pipeline in Biokepi |
Fair enough; I wonder if we could use BWA-mem to do this filtering instead? |
@ihodes probably, though it's not OptiType's recommendation:
|
Per @smondet's instructions, I ran on larger cluster nodes as follows:
Replaced my And then:
Confirmed that my GCP instance group had the right node type. Then re-ran my jobs. We'll see if that works! |
Success! |
Spoke too soon. 1 out of the 9 remaining jobs still failed with the same error :(. |
Is that a larger FASTQ than the others, by any chance?
|
@ihodes it's 81GB, which I didn't think was particularly larger, but I could be misremembering. |
@ihodes I was wrong; it is the largest one. Sigh. At least the problem is clear, but I'm becoming more convinced by your suggestion to filter using non-razerS3. |
It may be the only way forward… or you switch to 250+GB machines for extremely expensive runs. hammerlab/coclobas#19 will also help with degenerate cases like these in the future. |
@ihodes yeah already kicked off a 208GB machine run. Let's see if that works. |
It worked! |
I've been experiencing the same error that @tavinathanson described here with a set of files I'm working with, but it doesn't appear to be an issue with memory - requesting a machine with increased memory doesn't not eliminate the problem, and I've been able to run Optitype without error on larger fastq files from a different dataset without this problem. Further, when I try to run the razerS command on the command line, it doesn't return an error, but still doesn't produce a bam file. I'm at a bit of a loss for what to do. Any ideas as to what the problem may be? |
@maryawood: unfortunately still sounds like a memory issue or something related to it. Depending on the depth/coverage of your sequencing data, the memory requirements for razer3 can go through the roof and since this is related to the way razer3 keeps the data in the memory, there is very little you can do. I have been experimenting with different approaches and I found that using Here is the modified pipeline:
|
@armish thanks so much for the suggestion! I will give this a try |
Trying with @armish's setup (since mine didn't work; see #418), I get some of these, which I believe are issues with OptiType itself:
Digging a little deeper, I noticed:
The text was updated successfully, but these errors were encountered: