-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a new iterator to skip "bad" bases without canonicalization #40
Comments
Hi Antônio! You're welcome and thanks for opening this issue. I think you're right that there's a gap between Perhaps a |
Hi @bovee Yes, I think a I'm unsure whether |
I like the idea of an |
Hi Roderick!
I'm developing a k-mer counting Python package for internal usage and I'm using needletail as a backend. While developing it, I noticed that
Kmers
andCanonicalKmers
are inconsistent regarding non-ATCG characters. WhileKmers
count them, they are skipped byCanonicalKmers
(understandably so).Because of that, my function only uses
CanonicalKmers
even when counting non-canonical k-mers (I just reverse complement the sequence ifcanonical
boolean is true), which causes additional computational burden.I don't know if this decision was made by design, but maybe
Kmers
should include an argument that allows the user to choose whether non-ATCG characters should be ignored.Thank you for all your work in needletail!
The text was updated successfully, but these errors were encountered: