Dgraph baseline update script

The baseline update script defined in the AppConfig.xml document for a Dgraph deployment is included in this section, with numbered steps indicating the actions performed at each point in the script.

<script id="BaselineUpdate">
    <![CDATA[ 
  log.info("Starting baseline update script.");
  1. Obtain lock. The baseline update attempts to set an "update_lock" flag in the EAC to serve as a lock or mutex. If the flag is already set, this step fails, ensuring that the update cannot be started more than once simultaneously, as this would interfere with data processing. The flag is removed in the case of an error or when the script completes successfully.
        // obtain lock
        if (LockManager.acquireLock("update_lock")) {
    
  2. Validate data readiness. Check that a flag called "baseline_data_ready" has been set in the EAC. This flag is set as part of the data extraction process to indicate that files are ready to be processed (or, in the case of an application that uses direct database access, the flag indicates that a database staging table has been loaded and is ready for processing). This flag is removed as soon as the script copies the data out of the data/incoming directory, indicating that new data may be extracted.
        // test if data is ready for processing
        if (Forge.isDataReady()) {
    
  3. If Workbench integration is enabled, download and merge Workbench configuration. The ConfigManager copies all Developer Studio config files to the complete_index_config directory. Then, all Workbench-maintained configuration files are downloaded. Any files that are configured in the ConfigManager component to be maintained by the Oracle Endeca Workbench are copied to the complete_index_config directory, overwriting the Developer Studio copy of the same file, if one exists. The final result is a complete set of configuration files for Forge to use. If Workbench integration is not enabled, the ConfigManager copies all Developer Studio config files to the complete_index_config directory.
        if (ConfigManager.isWebStudioEnabled()) {
           // get Web Studio config, merge with Dev Studio config
           ConfigManager.downloadWsConfig();
           ConfigManager.fetchMergedConfig();
        } else {
           ConfigManager.fetchDsConfig();
        }
    
  4. Clean processing directories. Files from the previous update are removed from the data/processing, data/forge_output, data/temp, data/dgidx_output and data/partials/cumulative_partials directories.
        // clean directories
        Forge.cleanDirs();
        PartialForge.cleanCumulativePartials();
        Dgidx.cleanDirs();
    
  5. Copy data to processing directory. Extracted data in data/incoming is copied to data/processing.
        // fetch extracted data files to forge input
        Forge.getIncomingData();
    
  6. Release Lock. The "baseline_data_ready" flag is removed from the EAC, indicating that the incoming data has been retrieved for baseline processing.
        LockManager.releaseLock("baseline_data_ready");
  7. Copy config to processing directory. Configuration files are copied from data/complete_index_config to data/processing.
        // fetch config files to forge input
        Forge.getConfig();
    
  8. Archive Forge logs. The logs/forges/Forge directory is archived, to create a fresh logging directory for the Forge process and to save the previous Forge run's logs.
        // archive logs
        Forge.archiveLogDir();
    
  9. Forge. The Forge process executes.
        Forge.run();
  10. Archive Dgidx logs. The logs/dgidxs/Dgidx directory is archived, to create a fresh logging directory for the Dgidx process and to save the previous Dgidx run's logs.
        // archive logs
        Dgidx.archiveLogDir();
    
  11. Dgidx. The Dgidx process executes.
        Dgidx.run();
  12. Distribute index to each server. A single copy of the new index is distributed to each server that hosts a Dgraph. If multiple Dgraphs are located on the same server but specify different srcIndexDir attributes, multiple copies of the index are delivered to that server.
  13. Update MDEX Engines. The Dgraphs are updated. Engines are updated according to the restartGroup property specified for each Dgraph. The update process for each Dgraph is as follows:
    1. Create dgraph_input_new directory.
    2. Create a local copy of the new index in dgraph_input_new.
    3. Stop the Dgraph.
    4. Archive Dgraph logs (e.g. logs/dgraphs/Dgraph1) directory.
    5. Rename dgraph_input to dgraph_input_old.
    6. Rename dgraph_input_new to dgraph_input.
    7. Start the Dgraph.
    8. Remove dgraph_input_old.
    This somewhat complex update functionality is implemented to minimize the amount of time that a Dgraph is stopped. This restart approach ensures that the Dgraph is stopped just long enough to rename two directories.
        // distributed index, update Dgraphs
        DistributeIndexAndApply.run();
    
    <script id="DistributeIndexAndApply">
        <bean-shell-script>
          <![CDATA[ 
        DgraphCluster.cleanDirs();
        DgraphCluster.copyIndexToDgraphServers();
        DgraphCluster.applyIndex();
          ]]>
        </bean-shell-script>
      </script>
    
  14. If Workbench integration is enabled, upload post-Forge dimensions to Oracle Endeca Workbench. The latest dimension values generated by the Forge process are uploaded to Oracle Endeca Workbench, to ensure that any new dimension values (including values for autogen dimensions and external dimensions) are available to Oracle Endeca Workbench for use in, for example, dynamic business rule triggers.
    Note: This action does not add new dimensions or remove existing dimensions. These changes can be made by invoking the update_web_studio_config.[bat|sh] script.
        // if Workbench is integrated, update Workbench with latest 
        // dimension values
        if (ConfigManager.isWebStudioEnabled()) {
           ConfigManager.cleanDirs();
           Forge.getPostForgeDimensions();
           ConfigManager.updateWsDimensions();
        }
    
  15. Archive index and Forge state. The newly created index and the state files in Forge's state directory are archived on the indexing server.
        // archive state files, index
        Forge.archiveState();
        Dgidx.archiveIndex();
    
  16. Cycle LogServer. The LogServer is stopped and restarted. During the downtime, the LogServer's error and output logs are archived.
        // cycle LogServer
        LogServer.cycle();
    
  17. Release Lock. The "update_lock" flag is removed from the EAC, indicating that another update may be started.
        // release lock
        LockManager.releaseLock("update_lock");
    
        log.info("Baseline update script finished.");
          } else {
        log.warning("Failed to obtain lock.");
          }
        ]]>
      </bean-shell-script>
    </script>