The [appdir]/config/script/fetchCasCrawlDataConfig.xml file is the global CAS crawl configuration for the application.
To edit fetchCasCrawlDataConfig.xml for your crawling environment:
<custom-component id="CAS" host-id="CASHost" class="com.Oracle Endeca.eac.toolkit.component.cas.ContentAcquisitionServerComponent">
<properties>
<property name="casHost" value="WEB009.mycompany.com" />
<property name="casPort" value="8500" />
<property name="casCrawlFullOutputDestDir"
value="./data/complete_cas_crawl_output/full" />
<property name="casCrawlIncrementalOutputDestDir"
value="./data/complete_cas_crawl_output/incremental" />
<property name="casCrawlOutputDestHost" value="CASHost" />
</properties>
< /custom-component>
If the port on which the CAS Service was started is not the default
8500 number, change the value of the property.
For an XML output format (either compressed or uncompressed), locate the <script id="fetchIncrementalCasCrawlData"> section of the script and change the filename extension of the placeholder file from "placeholder.bin" to "placeholder.xml", so that the code looks like this:
if (! fileUtil.dirContainsFiles(incrDestDir, Forge.getHostId())) {
placeholder = incrDestDir + "/placeholder.xml";
For uncompressed formats (either binary or XML), remove or comment
out these statements that run a shell script to compress the placeholder file:
shell.init("zip_incremental_cas_crawl_placeholder",
Forge.getHostId(), zipCmd, Forge.getWorkingDir());
shell.run();