Stanford InfoLab Publication Server

Approximate DataGuides

Goldman, R. and Widom, J. (1999) Approximate DataGuides. Technical Report. Stanford InfoLab.




DataGuides are concise and accurate summaries of semistructured databases, enabling schema exploration and improving query processing. Unfortunately, DataGuides can be very expensive to compute, especially for large, cyclic databases. For many DataGuide uses, an ``approximate'' summary of the database's structure can be beneficial yet much cheaper to compute. We summarize several uses of DataGuides and define Approximate DataGuides (ADGs), which relax certain aspects of the DataGuide definition. An ADG allows some inaccuracy yet retains properties that make it useful in numerous situations. The core of the paper presents two general approaches for building ADGs, describing algorithms and experimental results.

Item Type:Techreport (Technical Report)
Uncontrolled Keywords:DataGuides, semistructured data, Lore, OEM
Subjects:Computer Science > Semistructured Data
Related URLs:Project Homepage
ID Code:412
Deposited By:Import Account
Deposited On:25 Feb 2000 16:00
Last Modified:28 Dec 2008 09:11

Download statistics

Repository Staff Only: item control page