Most of the python MsOffice processing libraries, like openpyxl,python-docx,python-pptx, are dealing with new office 2007 file formate (ie: xlsx,pptx,docx). This package can change old office 2003 to new,ie: doc2docx,xls2xlsx,ppt2pptx.
Pywin32 must be pre-installed and python3 is required.
If you have difficulty on installing pywin32, go to Christoph Gohlke for wheel package,download whl file and pip install file name.
pip install changeOffice
Usage is simple:
- step1: tell converter where your data is:
from changeOffice import Change
c=Change("./data")
./data is the root dir path you put your data in ,nested dirs works` - step2: change formate and the api name is obvious self-explanation:
c.doc2docx()
c.et2xls()# .et file must be converted to xls before and then convert xls to xlsx
c.xls2xlsx()
c.ppt2pptx()
- step3: to see the effect:
print (c.get_allPath())