Adding Pipeline Steps in HQ
There are four places you can add Pipeline steps, shown in the following diagram:
Â
Post-Scan - steps here are executed immediately after Scanning is complete
Pre-Extraction - steps here are executed just prior to Extraction
Post-Extraction - steps here are executed immediately after Extraction is complete
Pre-Index -steps here are executed just prior to the information being written to the Index
See Creating Pipelines for details on how to create and save Pipelines.
Pipeline Steps
You can add one or more of the following steps to different points in the Pipeline sequence:
S3 Thumbnail Upload
Uploads thumbnails to Amazon S3Calculate MD5 ChecksumCalculates an MD5 checksum of the source file contentIndex Debug Information
Indexes raw debug information about the indexing jobIndex Children
Indexes child entriesIndex Links
Indexes information about linked entitiesCopy Field
This transformer copies the contents of an existing index field (source) into another field (target). The source field is not modified.Rename Field
Renames one field (Source Field) to another (Destination Field). The source field is then removed from the index.ÂRemove Field
Removes a field from the IndexTransform Field Value
Transforms a field value to a different valueConvert from 2.x to 1.x Index
Converts documents from Voyager 2.x to 1.x index schemaRemove Temp Files
Removes temporary files downloaded or generated during indexingCheck Blob Store Files
Checks for meta files in Microsoft Azure blob storageSet Geo Field
Sets the geo field for the entry from spatial informationVoyager Server Thumbnail Upload
Uploads thumbnails generated by the agent to a Voyager Server instancePropagate Fields
Propagates fields to childrenAdd Format Tags
Adds format tags based on a record's mime typeSet Value for Field
Sets a field to a specific value. Useful for grouping.Append Value to Field
Appends a value to a field If the field is empty, behaves like Set Value for FieldExtract Entities with NLP
Uses Natural Language Processing to extract categorized entities from the text contentCreate Thumbnail with Base Map
Create Thumbnail using the current Base MapGeoTag Standard
Geotags the document content with the standard GazetteerGeotag Custom
Geotags the document content with a custom GazetteerSet Extent from PRJ File
Transforms spatial extent based on projection information found in a component prj fileAdd meta XML tags
Adds meta XML tags