One or many Dgidx components are defined depending on the deployment type you choose.
If an Agraph deployment type is chosen, an indexing cluster
component is defined. This object is used to apply actions to an
entire cluster of Dgidxs, rather than manually iterating over a
number of Dgidxs. In addition, the object contains logic associated
with executing Dgidxs in parallel based on Dgidx groups, which are
described below. Multiple indexing clusters can be defined, with no
restriction around which Dgidx belongs to each cluster or how many
clusters a Dgidx belongs to.
An indexing cluster is configured with references to all Dgidxs
that belong to that cluster. In addition, the cluster can be
configured to copy data in parallel or serially. This setting
applies to copies that are performed to retrieve source data and
configuration to each server that hosts a Dgidx component. By
default, the template sets this value to true.
<!--
########################################################################
# Indexing Cluster
#
-->
<indexing-cluster id="IndexingCluster" getDataInParallel="true">
<agidx ref="Agidx1" />
<dgidx ref="Dgidx1" />
<dgidx ref="Dgidx2" />
</indexing-cluster>
In addition to standard Dgidx configuration settings and process arguments, the Deployment Template uses several configurable properties and custom directories during processing:
- numLogBackups - Number of log directory backups to store.
- numIndexbackups - Number of index backups to store.
-
incomingDataHost - Host to which source data
files are extracted.
-
incomingDataDir - Directory to which source
data files are extracted.
-
incomingDataFileName - Filename of the source
data files that are extracted.
-
configHost - Host from which configuration files and dimensions are retrieved for Dgidx to process.
-
configDir - Directory from which configuration files and dimensions are retrieved for Dgidx to process.
- configFileName - Filename of the configuration files and dimensions that are retrieved for Dgidx to process.
- skipTestingForFilesDuringCleanup - Used for directory-cleaning operations. If set to "true", will skip the directory-contents
test and instead proceed directly to cleaning the directory. The default behavior is to test the directory contents and skip cleanup if the directory is
not empty.
- The properties documented in the "Fault tolerance and polling interval properties" topic.
In addition to standard Dgidx configuration and process
arguments, Dgidx processes add a custom property used to define
which Dgidx processes run in parallel with each other when they
belong to an indexing cluster.
dgidxGroup - Indicates the Dgidx's membership
in a Dgidx group. When the run method on an indexing cluster is
executed, Dgidx processes within the same Dgidx group are run in
parallel. Dgidx group values are arbitrary strings. The indexing
cluster iterates through the groups in alphabetical order, though
non-standard characters may result in groups being updated in an
unexpected order.