Tell Me Glossary
 

Board Meeting - General Guidelines

Previous previous

Steve: It was a very productive meeting with you Mr. Timothy. It was a great knowledge sharing session. Before we ramp up, do you have any general guidelines that you would want to share with us?

Timothy: Yes, it was a great opportunity for me too, to share my knowledge about the XML Framework. I do have some general guidelines guidelines for the component in the target repository that generates the XML feeds:

1. For directory feeds, the number of documents per directory should be fewer than 10000. There are two cases to be considered here:
i. Feed Files: The number of items per feed file should be set such that the total number of feed files in the feed directory is kept under 10000.
ii. Content Files: If the feed files specify content through attachment links and the targets of these links are stored in the filesystem, ensure that the targets are distributed in multiple directories so that the total number of files per directory is kept under 10000.


2. When feeds are generated real-time over HTTP, ensure that the component generating the feeds is sensitive to time-out issues of feed requests. The feed served as the response for every request should be made available within this time-out interval. Else, the request from SES will timeout. The request will be retried as many times as specified while setting up the source in SES. The crawl will abort if all these attempts fail.

Steve: Thank you so much for your time and guidance Timothy. We will surely implement whatever we learned today to our Genie application and use SES to crawl our feeds.

Timothy: You are welcome. Please contact me if you need any other information or help.

Steve, Andrew, Philip, and Timothy to each other: Have a good day. Bye!!