Shoens, K. and Tomasic, A. and Garcia-Molina, H. (1994) Synthetic Workload Performance Analysis of Incremental Updates. Technical Report. Stanford. (Publication Note: 17th annual international ACM SIGIR conference on Research and development in information retrieval, Dublin, Ireland July 03 - 06, 1994 (SIGIR 1994) )
Declining disk and CPU costs have kindled a renewed interest in effcient document indexing techniques. In this paper, the problem of incremental updates of inverted lists is addressed using a dual-structure index data structure that dynamically separates long and short inverted lists and optimizes the retrieval, update, and storage of each type of list. The behavior of this index is studied with the use of a synthetically-generated document collection and a simulation model of the algorithm. The index structure is shown to support rapid insertion of documents, fast queries, and to scale well to large document collections and many disks.
|Item Type:||Techreport (Technical Report)|
|Related URLs:||Project Homepage||http://infolab.stanford.edu/|
|Deposited By:||Import Account|
|Deposited On:||25 Feb 2000 16:00|
|Last Modified:||05 Feb 2009 15:47|
Repository Staff Only: item control page