Adding a Solr Repository in HQ
HQ can not only index many different types of data sources, it can also index an existing Solr index. This incorporates a new Solr index into the existing Index. To add the contents of a remote Solr index:
Go to HQ > Repositories
Click Add New
Select Database
Click Next
Choose Solr Search Index from the Database Type drop-down menu
Enter the details for the Solr index in the dialog that appears
Â
Solr URLEnter the URL of a Solr server
Collection
Enter the name of the Solr collection to index. When this is a Voyager index this defaults to v0.v0Enter the location of a Voyager Solr Index
FieldsEnter a comma-separated list of fields to fetch from Solr (the Solr fl query parameter)
FiltersEnter a comma-separated list of filters to apply to Solr query fetching content (the Solr fq query parameter)
ParametersEnter any additional parameters to apply to the Solr query in query string format, ie. param1=value2&m2=value2&...
Page SizeEnter number of documents to fetch from Solr in a single request (default value is 1000)
ID FieldEnter the name of the field in the remote Solr index to treat as a unique identifier. If the remote Solr index is a Voyager index this defaults to id
Name FieldEnter the name of the field in the remote Solr index to treat as a name. If the remote Solr index is a Voyager index this defaults to name
Remap LocationThis flag controls how to handle documents in the remote Solr index that have a location field. When this is set, the remote value will be overwritten with the local value (this is the default).Â
Index DirectlyThis flag controls whether to skip the standard pipeline when indexing documents from the remote Solr index. When this is set, documents will be copied directly into the local Solr index. Direct Indexing mode is faster than indexing through the Voyager pipeline but disregards any pipeline logic (unless Apply Pipeline is set, see below).
Apply PipelineThis flag applies in Direct Indexing mode (see previous setting). When this is set, the pipeline logic is partially applied to indexed documents. This includes pre-pipeline steps, document transformers, and post-pipeline steps but not other aspects of the pipeline. While this adds overhead to indexing documents, it is still faster than indexing in non-direct mode.