Voyager Search Glossary
Term | Definition |
Agent | The part of Vose that handles searching and indexing of Repositories (also Indexing Agent) |
ArcGIS | A Geographic Information System (GIS) created by Esri for using and analyzing maps and geographic data |
ArcGIS Online | Web application created by Esri for using and sharing geographic data |
ArcMap | Esri application to display, edit and query geographic data |
Authentication | The process of verifying a user and their privileges (permissions) |
Base Map | A map on which geographic information is displayed |
Bitmap | An image based on pixels. Commonly used Bitmap formats include PNG, JPG and GIF. GIS programs use bitmaps that encode geographic data in the pixel values as well as the pixel locations. |
Boosting | Boosting involves promoting specific query results so that they show up higher in tthe search results - includes Boost Queries and Boost Functions |
Bounding Box | The geographic coordinates for a rectangle that contains specific GIS data |
Cart | Contains one or more records that are available for Processing |
Connector | The software that scans files in a repository and reads basic information from specific file types |
Coordinate System | A system to help define where a location is in space. There are many types of Coordinate Systems, but the most commonly used are Geographic Coordinate System (GCS) and Projected Coordinate System (PCS) |
Discovery | See Scanning |
Display Templates | Customizable templates that control the appearance of different aspects of the Navigo UI, including the Search Results page, Details page, Filters and Sorting. The Flex Index can easily scale to contain very large data sets (10+ million recrods.) |
Esri | Creator of ArcGIS, ArcGIS Online and ArcMap software to use, display and analyze geographic information |
Extractor | The software that opens files in a repository and extracts detailed information to add to the Index |
Federated Search | Federation uses multiple, separate Voyager Indexes to create a single, searchable Federated Index |
File Type | See MIME Types |
Filters | Filters allow refining of search results by Format, Spatial Reference, Location etc. |
Flex Index | In Voyager 2.0, the Flex Index (based on SolrCloud) is centrally managed (by HQ) and comes with built-in fail-over and load-balancing. Content is organized into multiple logical pieces, or shards, that can be hosted on multiple machines. Shards are managed by Zookeeper. Indexing is handled by one or more Indexing Agents. |
FME | Feature Manipulation Engine - a platform for translating spatial data into other formats that can be shared by different applications |
Format | See MIME Types |
Gazetteer | A Gazetteer provides location information used to Geotag non-spatial data |
GCS | Geographic Coordinate System - GCS uses a grid on the surface of a 3D globe to identify locations. There are many types of GCS. See also PCS. |
Geodatabase | A database storing GIS information. Geodatabase files have .gdb or .mdb extensions. |
GeoPDF | A geospatial extension to the PDF file format, used to present GIS and mapping data in a standard PDF file |
Geospatial | Geospatial refers to data with a geographic or location component |
Geotagging | Geotagging adds location information (spatial references) to non-spatial data |
GIS | In the most basic sense, Geographic Information Systems use geographic data to create maps and produce spatial analysis |
GML | Geography Markup Language, based on XML |
HQ | The central component of Voyager 2.0 that manages Discovery, Indexing Agents and data repositories. Currently requires the presence of Voyager Server for Security and other management controls. |
Index | The set of searchable data derived from cataloging Repositories |
Index Shards | Logical partitions of the Index in SolrCloud and Voyager 2.0, composed of one or more replicas |
Indexing | The process of adding content to the searchable Index - includes Scanning and Extraction |
Indexing Agent | The part of Voyager 2.0 that handles searching and indexing of Repositories |
Indexing Pipeline | See Pipeline |
Java | Refers either to the Java programming language or the Java platform, a set of tools allowing for easy cross-platform application development |
JSON | JavaScript Object Notation is a minimal, text-based format used for exchanging data |
Layer | A data source such as a shapefile, geodatabase feature class or raster that defines how data should be symbolized on a map |
LDAP | Lightweight Directory Access Protocol - a client-server protocol for accessing Directory Services over TCP/IP |
Location | A set of data, usually geographic in nature (Repository in Voyager 2.0) |
Lucene | An open-source Java library that implements indexing, analyzing and searching |
Metadata | Information that describes the content, quality, condition and other characteristics of data, separate from the data itself |
MIME Types | Multipurpose Internet Mail Extension is a standardized way of identifying files on the Internet based on their nature and format |
NLP | Natural Language Processing is a way of using computer systems to try and interpret text information in the same way that a person would |
OCR | Optical Character Recognition is the process of reading text from files such as scanned documents or images |
PCS | Projected Coordinate System - PCS transforms a 3D model and projects it onto a 2D surfaces such as maps or computer screens. There are many types of GCS, such as Mercator, Transverse Mercator and Gnomonic. All projections of a 3D globe onto a 2D surface introduce distortions in any map of the world, and the choice of projection becomes largely one of aesthetics. See also GCS. |
Pipeline | The Pipeline consists of a series of customizable functions that can be used to manipulate and transform data during the Indexing process |
Portal for ArcGIS | A map-centric content management system deployed within an organization's infrastructure |
Processing | Processing refers to the set of tasks that can modify, refine, delete or export input data in the Cart |
Query | A Query is a search request applied to the Solr Index |
Query Expansion | Query Expansion takes search term(s) and calculates other, similar terms using NLP. It then uses these expanded terms as well as your original search term when it executes a query. |
Raster | See Bitmap |
Record | Refers to a single file found in a Repository - the data on which an Index is built |
Regex/Regular Expressions | A Regular Expression (sometimes called regex) is a special text string for describing a search pattern |
Relevancy | When Voyager executes a query against an index, it lists more-relevant items at the top of the search results and less-relevant items at the bottom. |
Replication | A master-slave configuration that can improve reliability and fault tolerance, where the contents of a Master Index are replicated in Slave Indexes |
Repository | See Location |
SAML | The Security Assertion Markup Language is an open standard that allows security credentials to be shared by multiple computers across a network |
Saved Search | A saved Index query that can be re-used and shared with other users |
Scanning | The process of examing records to read basic information such as file name and file type. This is the initial phase of Indexing and is followed by Extraction. Also referred to as Discovery in earlier Voyager versions. |
Shapefile | A format for storing vector GIS data. Geographic features in a shapefile can be represented by points, lines, or polygons (areas). Shapefiles do not store topography, only layers. Each Shapefile has three required files: .shp (the main file), .shx (Index file) and .dbf (DBase table file) |
Single Sign-on | A process where the user only needs to sign on once to access multiple systems |
Snapshot | A copy of the information in the Index directory at a specific time, which can include additional information such as Saved Searches or Discovery Settings. Similar to a Backup. |
Solr | The open-source search platform underlying Voyager Search, built on the Lucene JAVA library |
SolrCloud | Distributed search and indexing solution using Index Shards managed by Zookeeper. Includes built-in failover and load-balancing |
Spatial Settings | Determines whether Voyager stores only bounding boxes or bounding boxes plus spatial geometry for Index records |
Vector Graphics | Vector Graphics are images described by mathematical formulas instead of individual pixels. Vector images are commonly in SVG (Scalable Vector Graphics) format |
WFS | Web Feature Server - provides ability to query and manipulate geographic features. Transactions are in GML. |
WMS | Web Map Service - serves georeferenced maps (usually Bitmaps or Vector Graphics) across the network |
Worker | A Worker is a Java thread used to execute a task such as scanning or indexing |
XML | eXtensible Markup Language - XML uses markup to describe data content (vs data presentation) and does not use pre-defined tags as does HTML |
Â