For customers who do not use Libre office, the java/poi extractor can be used to extract content from the most common office document formats. The supported formats are: doc, docx, ppt, pptx, xls, xlsx.
Enable the java/poi extractor for office documents:
In HQ, select the Formats menu on the left
Search for doc
Select Microsoft Word Document
Select the Extractors tab
For java/poi, select the move up option until it is at the top (if java/poi is missing, see below)
Repeat this process for the following formats: docx, ppt, pptx, xls, xlsx
Select the System menu on the left
Select, Restart/Shutdown
Restart HQ
NOTE: If the java/poi option is missing from the menu, use the following procedure:
On the file system, navigate to HQ_HOME/config
With HQ stopped, delete or rename mimes.json
Start HQ and enable java/poi