Stanford InfoLab Publication Server

Representative Objects: Concise Representations of Semistructured, Hierarchical Data

Nestorov, S. and Ullman, J. and Wiener, J. and Chawathe, S. (1997) Representative Objects: Concise Representations of Semistructured, Hierarchical Data. In: 13th International Conference on Data Engineering (ICDE 1997), April 7-11, 1997, Birmingham, UK.




In this paper we introduce the representative object, which uncovers the inherent schema(s) in semistructured, hierarchical data sources and provides a concise description of the structure of the data. Semistructured data, unlike data stored in typical relational or object-oriented databases, does not have xed schema that is known in advance and stored separately from the data. With the rapid growth of the World Wide Web, semistructured hierarchical data sources are becoming widely available to the casual user . The lack of external schema information currently makes browsing and querying these data sources inefcient at best, and impossible at worst. We show how representative objects make schema discovery efcient and facilitate the generation of meaningful queries over the data.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:semistructured data, schema discovery
Subjects:Computer Science > Semistructured Data
Projects:Information Integration
Related URLs:Project Homepage
ID Code:269
Deposited By:Import Account
Deposited On:25 Feb 2000 16:00
Last Modified:04 Jan 2009 12:04

Download statistics

Repository Staff Only: item control page