Parse a FASTA file into a pandas DataFrame efficiently. Some searching on Stack Overflow and the interwebs did not readily reveal an efficient solution to use Pandas for FASTA file data. Pandas is already a great Python module. I wanted to find the fastest way to get FASTA data into a dataframe with the least number of manipulations. Pandas can open compressed files directly from an FTP site or an S3 bucket, so I focussed on finding a solution that would only use the Pandas module. Perhaps this will allow it have a longer useful life. I spent more time than I probably should have chipping away at the number of methods, trying to reduce the syntax, and finding incremental performance gains. Feel free to use it, and please let me know if there is a faster or more efficient way.
-
Notifications
You must be signed in to change notification settings - Fork 0
Parse a FASTA file into a pandas DataFrame efficiently
License
wiebepo/Pandas-FASTA
About
Parse a FASTA file into a pandas DataFrame efficiently
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published