Stanford InfoLab Publication Server

Multicasting a Web Repository [extended version]

Lam, Wang and Garcia-Molina, Hector (2001) Multicasting a Web Repository [extended version]. Technical Report. Stanford InfoLab.

BibTeXDublinCoreEndNoteHTML

[img]
Preview
PDF
331Kb

Abstract

Web crawlers generate significant loads on Web servers, and are difficult to operate. Instead of running crawlers at many "client" sites, we propose a central crawler and Web repository that then multicasts appropriate subsets of the central repository to clients. Loads at Web servers are reduced because a single crawler visits the servers, as opposed to all the client crawlers. In this paper we model and evaluate such a central Web multicast facility. We develop multicast algorithms for the facility, comparing them with ones for "broadcasts disks." We also evaluate performance as several factors, such as object granularity and client batching, are varied.

Item Type:Techreport (Technical Report)
Additional Information:Previous number = SIDL-WP-2001-0151
Subjects:Computer Science > Databases and the Web
Projects:Digital Libraries
Related URLs:Project Homepagehttp://www-diglib.stanford.edu/diglib/pub/
ID Code:521
Deposited By:Import Account
Deposited On:29 Nov 2001 16:00
Last Modified:27 Dec 2008 10:23

Download statistics

Repository Staff Only: item control page