Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

For customers who do not use Libre office, the java/poi extractor can be used to extract content from the most common office document formats. The supported formats are: doc, docx, ppt, pptx, xls, xlsx.

Enable the java/poi extractor for office documents:

  1. In HQ, select the Formats menu on the left

  2. Search for doc

  3. Select Microsoft Word Document

    Formats.png
  4. Select the Extractors tab

    Extractors.png
  5. For java/poi, select the move up option until it is at the top (if java/poi is missing, see below)

  6. Repeat this process for the following formats: docx, ppt, pptx, xls, xlsx

  7. Select the System menu on the left

  8. Select, Restart/Shutdown

  9. Restart HQ

    Restart.png

NOTE: If the java/poi option is missing from the menu, use the following procedure:

  1. On the file system, navigate to HQ_HOME/config

  2. With HQ stopped, delete or rename mimes.json

  3. Start HQ and enable java/poi

  • No labels