Reading/Writing MS Word files in Python
Solution 1:
See python-docx, its official documentation is available here.
This has worked very well for me.
Solution 2:
If you only what to read, it is simplest to use the linux soffice command to convert it to text, and then load the text into python: