Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The indexing phase consists of two processes: Scanning and Extraction:

  • Scanning is done by Connectors, which crawl the repository and gather easily accessible information from each document, such as file name, size, last modification date etc. This process does not involve opening any files.

  • Extraction is done by Extractors which open data files and extract additional information from data crawled during the scanning phase. This might involve reading text from a Word document or reading metadata tags from an image file. Voyager chooses the extractor based on the format (mime type) of the data. An Extractor can often operate on multiple related file types, and a particular type of file may have more than one associated Extractor.  

...

  • Metadata Extraction allows manipulating specific entries from a data set's associated XML metadata. For more information, see Metadata Extraction.

  • Document Transformers can be used to update data properties directly in the index. For more information, see Document Transformers.

  • Geotagging associates geographic location with non-spatial data. For more information, see Geotagging.

Searching

After Indexing and Enrichment are complete, users can search within the data using Voyager's powerful and flexible search tools.  Search  Search results are displayed in Navigo.