Replies: 5 comments
-
CRAM is a good idea, thank you. Compression of pairs of fully supported. Simply giving the output name a .gz extension will compress the file (and .gz files are automatically uncompressed). |
Beta Was this translation helpful? Give feedback.
-
Actually.. Since we now use pysam for parsing alignment files, possible CRAM would just work already? Have you tried? @agalitsyna I think you implemented the pysam parsing code, do you know whether CRAM files would also be parsed? |
Beta Was this translation helpful? Give feedback.
-
hi! Indeed, outputs are automatically compressed, based on the extension. You could also output into stdout and pipe into a compression tool of your choice! |
Beta Was this translation helpful? Give feedback.
-
Ah for cram pysam needs a different reading mode "rc" instead of "rb" for bam. But is that the only change we would need to implement? |
Beta Was this translation helpful? Give feedback.
-
Thanks for the reply. I'm now using BAM and CRAM files, which I am piping. I'm also outputting the files as .gz to save disk space! |
Beta Was this translation helpful? Give feedback.
-
Dear pairtools Team,
First and foremost, I would like to extend my gratitude to the development team for creating such an incredible tool. I am currently working with very large files generated by bwa. For example, my generated BAM file is approximately 600GB in size. Additionally, the file parsed using pairtools parse is exceeding 1.6TB.
It would be highly beneficial if pairtools could support lossless CRAM files as input. Furthermore, the ability to compact .pairs files on the fly for disk space conservation would be immensely valuable. Is this functionality feasible?
Thank you for your consideration.
Beta Was this translation helpful? Give feedback.
All reactions