Tomasic, A. and Garcia-Molina, H. and Shoens, K. (1994) Incremental Updates of Inverted Lists for Text Document Retrieval. Technical Report. Stanford InfoLab. (Publication Note: ACM International Conference on Management of Data, May 24-27, 1994, Minneapolis, Minnesota (SIGMOD 1994))
With the proliferation of the orld's "information highways" a renewed interest in effcient document indexing techniques has come about. In this paper, the problem of incremental updates of inverted lists is addressed using a new dual-structure index data structure. The index dynamically separates long and short inverted lists and optimizes the retrieval, update, and storage of each type of list. To study the behavior of the index, a space of engineering tradeoffs which range from optimizing update time to optimizing query performance is described. We quantitatively explore this space by using actual data and hardware in combination with a simulation of an information retrieval system. We then describe the best algorithm for a variety of criteria.
|Item Type:||Techreport (Technical Report)|
|Related URLs:||Project Homepage||http://infolab.stanford.edu/|
|Deposited By:||Import Account|
|Deposited On:||25 Feb 2000 16:00|
|Last Modified:||05 Feb 2009 15:49|
Repository Staff Only: item control page