Stanford InfoLab Publication Server

WebBase: A repository of web pages

Hirai, Jun and Raghavan, Sriram and Garcia-Molina, Hector and Paepcke, Hector (2000) WebBase: A repository of web pages. In: Ninth International World Web Conference (WWW 2000), May 15-19, 2000, Amsterdam.




In this paper, we study the problem of constructing and maintaining a large shared repository of web pages. We discuss the unique characteristics of such a repository, propose an architecture, and identify its functional modules. We focus on the storage manager module, and illustrate how traditional techniques for storage and indexing can be tailored to meet the requirements of a web repository. To evaluate design alternatives, we also present experimental results from a prototype repository called WebBase, that is currently being developed at Stanford University. Keywords : Repository, WebBase, Architecture, Storage management

Item Type:Conference or Workshop Item (Paper)
Additional Information:Previous number = SIDL-WP-1999-0124
Subjects:Computer Science > Digital Libraries
Projects:Digital Libraries
Related URLs:Project Homepage
ID Code:473
Deposited By:Import Account
Deposited On:30 Oct 2001 16:00
Last Modified:27 Dec 2008 14:38

Download statistics

Repository Staff Only: item control page