-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove dependency on pandas #44
Comments
so remove pandas and replace it with xlrd and openpyxl for now? |
Should be able to use just xlrd. I don't think openpyxl can do xls files but xlrd can do both xls and xlsx files. |
I have replaced pandas with xlrd in the tests |
To replace
|
Currently pandas is used purely to read the excel file and pandas uses xlrd to do all the work anyway. Installing pandas on some computers is difficult (eg raspberry pi running raspbian) so if we could remove the requirement for pandas then the software could be deployed in more places requiring less resources. Documentation for xlrd says it is best for .xls files but openpyxl is best for .xlsx files. Unfortunately source data file can currently be either format. May need to use both libraries until gov stops using .xls files.
The text was updated successfully, but these errors were encountered: