-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
just align (not global alignment) PE reads (without FLASH) to the amplicon sequence and quantify indel rate? #120
Comments
BTW, I think crispresso can take bam as input, instead of fastq, but looking at the code, I think what the program really does is extracting the reads from the bam file and then do a global alignment again, right? |
Hi Yichao, Thanks for using CRISPResso -- we've been thinking about this as well, and recognize that flash is a potential source of errors. You are correct that CRISPResso realigns every read even if provided a bam input. Our biologically-informed alignment is our special sauce that makes the quantification accurate, and we like to think it's a lot better than bwa or another aligner. I think it's probably best to do trimming before CRISPResso or to use trimmomatic or the filtering within crispresso to get rid of the AAAAA or TTTTTT sequences as well as sequences that are the wrong length. As we work on the flash-less alignment, do you have a case of reads and a reference that are particularly challenging? Just to make sure that there is nothing we are missing? |
Thanks for the quick reply! I don't have a very challenging dataset, I can just solve my problems by some data cleaning. Looking forward to this FLASH-less version! Thanks, |
Hi Kendell, |
Try the parameter |
Thank you, Kendell, I tried running with this parameter, but it returned error with the following: Also, regarding the "force_merge_pairs" - I would imaging using this option or relaxing "min_pair_end_reads_overlap" to <10 could introduce some artefacts. Would you know what type of issues to watch out for? Thank you! |
Sorry -- I just pushed a fix for the force_merge_pairs error. You can either wait for a next release in a week or two, or install from source. The artifacts would be that if reads were merged, there could be deletions or insertions at the merge site. For example, for:
In this case, there is an apparent deletion that is induced by the merging. |
Is your feature request related to a problem? Please describe.
I think it's quite common that FLASH can't really merge the PE reads, For example:
Describe the solution you'd like
Just use BWA to align PE reads to the amplicon sequence and extract indel frequency?
Thanks,
Yichao
The text was updated successfully, but these errors were encountered: