Stanford InfoLab Publication Server

Synchronizing a database to Improve Freshness

Cho, J. and Garcia-Molina, H. (1999) Synchronizing a database to Improve Freshness. Technical Report. Stanford InfoLab. (Publication Note: ACM International Conference on Management of Data (SIGMOD 2000) Dallas, Texas, May 14-19, 2000)




In this paper we study how to refresh a local copy of an autonomous data source to maintain the copy up-to-date. As the size of the data grows, it becomes more difficult to maintain the copy "fresh," making it crucial to synchronize the copy effectively. We define two freshness metrics, change models of the underlying data, and synchronization policies. We analytically study how effective the various policies are. We also experimentally verify our analysis, based on data collected from 270 web sites for more than 4 months, and we show that our new policy improves the "freshness" very significantly compared to current policies in use.

Item Type:Techreport (Technical Report)
Uncontrolled Keywords:Web, database, warehousing, synchronization, refresh
Subjects:Computer Science > Data Warehousing
Projects:Digital Libraries
Related URLs:Project Homepage
ID Code:396
Deposited By:Import Account
Deposited On:25 Feb 2000 16:00
Last Modified:27 Dec 2008 21:07

Download statistics

Repository Staff Only: item control page