Cho, J. and Garcia-Molina, H. (2000) Estimating Frequency of Change. Technical Report. Stanford.
BibTeX | DublinCore | EndNote | HTML |
| PDF 235Kb |
Abstract
Many online data sources are updated autonomously and independently. In this paper, we make the case for estimating the change frequency of the data, to improve web crawlers, web caches and to help data mining. We first identify various scenarios, where different applications have different requirements on the accuracy of the estimated frequency. Then we develop several "frequency estimators" for the identified scenarios. In developing the estimators, we analytically show how precise/effective the estimators are, and we show that the estimators that we propose can improve precision significantly.
Item Type: | Techreport (Technical Report) | |
---|---|---|
Uncontrolled Keywords: | Web change, estimation, change frequency, web crawler | |
Subjects: | Computer Science > Databases and the Web Computer Science > Digital Libraries | |
Projects: | Digital Libraries | |
Related URLs: | Project Homepage | http://www-diglib.stanford.edu/diglib/pub/ |
ID Code: | 471 | |
Deposited By: | Import Account | |
Deposited On: | 25 Feb 2000 16:00 | |
Last Modified: | 27 Dec 2008 11:52 |
Download statistics
Repository Staff Only: item control page