Good article.
I was also wondering about the last column. You seem to imply that if
the most frequent information is at the end of the document then it
won't be taken into consideration by the indexer. Wouldn't it make sense
for the indexer to base its decisions after reading the entire document
(which seems to be the case if more than one file is given)?