Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mapping rate decreasing with time #172

Open
cgroza opened this issue Apr 19, 2023 · 3 comments
Open

Mapping rate decreasing with time #172

cgroza opened this issue Apr 19, 2023 · 3 comments

Comments

@cgroza
Copy link

cgroza commented Apr 19, 2023

Hello,

I have a question about something I noticed.
Initially, wfmash aligns very fast, but then the number of aligned bp per second slowly declines.
This continues until it is almost done, with the last 10% taking the most time.
I assume this happens because easy to align sequences are dispatched fast, until all the working threads are occupied by difficult to align sequences that take a long time (low complexity sequence)?

Is there a way to make wfmash spend less time on such sequences and output a rougher alignment and save time?

@AndreaGuarracino
Copy link
Member

Hi @cgroza, I am getting similar problems with harder species to align like potato, primates, etc... I am working on reducing such a problem.

However, do you think you can share just a few sequence pairs that are the slowest to be aligned for you? I would like to verify that the high runtime is due to the same reasons I've seen in our tests.

@AndreaGuarracino
Copy link
Member

Ah, I am assuming you meant alignment rate and not mapping rate.

@cgroza
Copy link
Author

cgroza commented Jun 14, 2023

Yes that's correct.
In my case, these were primate genomes assembled with nanopore and polished with short reads.
I suspect the culprits are low complexity sequences.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants