Mapping rate decreasing with time #172

cgroza · 2023-04-19T01:22:51Z

Hello,

I have a question about something I noticed.
Initially, wfmash aligns very fast, but then the number of aligned bp per second slowly declines.
This continues until it is almost done, with the last 10% taking the most time.
I assume this happens because easy to align sequences are dispatched fast, until all the working threads are occupied by difficult to align sequences that take a long time (low complexity sequence)?

Is there a way to make wfmash spend less time on such sequences and output a rougher alignment and save time?

AndreaGuarracino · 2023-06-14T11:06:46Z

Hi @cgroza, I am getting similar problems with harder species to align like potato, primates, etc... I am working on reducing such a problem.

However, do you think you can share just a few sequence pairs that are the slowest to be aligned for you? I would like to verify that the high runtime is due to the same reasons I've seen in our tests.

AndreaGuarracino · 2023-06-14T11:07:43Z

Ah, I am assuming you meant alignment rate and not mapping rate.

cgroza · 2023-06-14T19:32:05Z

Yes that's correct.
In my case, these were primate genomes assembled with nanopore and polished with short reads.
I suspect the culprits are low complexity sequences.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mapping rate decreasing with time #172

Mapping rate decreasing with time #172

cgroza commented Apr 19, 2023

AndreaGuarracino commented Jun 14, 2023

AndreaGuarracino commented Jun 14, 2023

cgroza commented Jun 14, 2023

Mapping rate decreasing with time #172

Mapping rate decreasing with time #172

Comments

cgroza commented Apr 19, 2023

AndreaGuarracino commented Jun 14, 2023

AndreaGuarracino commented Jun 14, 2023

cgroza commented Jun 14, 2023