Lam, Wang and Garcia-Molina, Hector (2001) Multicasting a Web Repository [extended version]. Technical Report. Stanford InfoLab.
Web crawlers generate significant loads on Web servers, and are difficult to operate. Instead of running crawlers at many "client" sites, we propose a central crawler and Web repository that then multicasts appropriate subsets of the central repository to clients. Loads at Web servers are reduced because a single crawler visits the servers, as opposed to all the client crawlers. In this paper we model and evaluate such a central Web multicast facility. We develop multicast algorithms for the facility, comparing them with ones for "broadcasts disks." We also evaluate performance as several factors, such as object granularity and client batching, are varied.
|Item Type:||Techreport (Technical Report)|
|Additional Information:||Previous number = SIDL-WP-2001-0151|
|Subjects:||Computer Science > Databases and the Web|
|Related URLs:||Project Homepage||http://www-diglib.stanford.edu/diglib/pub/|
|Deposited By:||Import Account|
|Deposited On:||29 Nov 2001 16:00|
|Last Modified:||27 Dec 2008 10:23|
Repository Staff Only: item control page