Stanford InfoLab Publication Server

Join Queries with External Text Sources: Execution and Optimization Techniques

Chaudhuri, S. and Dayal, U. and Yan, T. (1995) Join Queries with External Text Sources: Execution and Optimization Techniques. In: ACM International Conference on Management of Data (SIGMOD 1995), May 22-25, 1995, San Jose, California.




Text is a pervasive information type, and many applications require querying over text sources in addition to structured data. This paper studies the problem of query processing in a system that loosely integrates extensible database system and a text retrieval system. We focus on a class of conjunctive queries that include joins between text and structured data, in addition to selections over these two types of data. We adapt techniques from distributed query processing and introduce a novel class of join methods based on probing that is especially useful for joins with text systems, and we present a cost model for the various alternative query processing methods. Experimental results confirm the utility of these methods. The space of query plans is extended due to the additional techniques, and we describe an optimization algorithm for searching this extended space. The techniques we describe in this paper are applicable to other types of external data managers loosely integrated with a database system.

Item Type:Conference or Workshop Item (Paper)
Subjects:Computer Science
Related URLs:Project Homepage
ID Code:130
Deposited By:Import Account
Deposited On:25 Feb 2000 16:00
Last Modified:14 Jan 2009 14:38

Download statistics

Repository Staff Only: item control page