Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

abPOA puts big gap in middle of sequence #36

Open
glennhickey opened this issue Mar 23, 2022 · 2 comments
Open

abPOA puts big gap in middle of sequence #36

glennhickey opened this issue Mar 23, 2022 · 2 comments

Comments

@glennhickey
Copy link
Contributor

I'm aligning a short sequence to a very long one, and they look like

ATATA
ATATATATATA<...>A

When I align them with abpoa, it puts the gap in the middle

ATAT------------<...>A
ATATATATATA<...>A

But I'd much rather have an alignment like

ATATA-----------<...>
ATATATATATA<....>

I can understand that, strictly speaking, with the global alignment scoring model, there isn't much difference between these two scenarios, in practice I think the second one is always preferred. Do you think it would be possible to add some way to left shift or favour gaps at the ends in global alignment mode?

Thanks, as always.

to reproduce

wget http://public.gi.ucsc.edu/~hickey/debug/abpoa_fail_mar23.fa
abpoa abpoa_fail_mar23.fa -m 0 -r 1
yangao07 pushed a commit that referenced this issue Mar 28, 2022
@yangao07
Copy link
Owner

A quick fix has just been pushed. Try it out @glennhickey

@glennhickey
Copy link
Contributor Author

Thanks!! this does indeed solve some of the simpler cases I've found. Will test it on a larger dataset now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants