Oracle Secure Enterprise Search Java API Reference
10g Release 1 (10.1.6) Beta

B19032-01


oracle.search.sdk.crawler
Interface CrawlingThreadService


public interface CrawlingThreadService

CrawlingThreadService is an interface used by a crawler plugin to perform crawl related tasks. It has execution context specific to the crawling thread that invokes the plugin crawl() method


Method Summary
 void markStatusNotChanged(DocumentMetadata meta)
          mark a url entry as not requiring any changes or updates.
 void submitForProcessing(DocumentContainer target)
          submit the document for processing.

 

Method Detail

submitForProcessing

public void submitForProcessing(DocumentContainer target)
                         throws ProcessingException
submit the document for processing. It will be indexed if its status code is DocumentContainer.STATUS_OK_FOR_INDEX. After the processing is done this document will be automatically remove from the queue.
Parameters:
target - the document container containing the content and metadata.
Throws:
ProcessingException

markStatusNotChanged

public void markStatusNotChanged(DocumentMetadata meta)
                          throws ProcessingException
mark a url entry as not requiring any changes or updates. This will simply remove the entry from the URL Queue and will not re-index or perform any additional operations on this url entry. This should be used when re-crawling a content and when there is no change * to a particular URL.
Parameters:
meta - the metadata object corresponding to the url entry
Throws:
ProcessingException

Oracle Secure Enterprise Search Java API Reference
10g Release 1 (10.1.6) Beta

B19032-01


Copyright © 2006, Oracle. All rights reserved.