Stanford InfoLab Publication Server

Integrating a Structured-Text Retrieval System with an Object- Oriented Database System

Yan, T. (1994) Integrating a Structured-Text Retrieval System with an Object- Oriented Database System. Technical Report. Stanford InfoLab. (Publication Note: 20th International Conference on Very Large Data Bases, (VLDB 1994), September 12-15, 1994, Santiago de Chile, Chile)




We describe the integration of a structured-text retrieval system (TextMachine) into an object-oriented database system (OpOur approach is a light-weight one, using the external function capability of the database system to encapsulate the text retrieval system as an external information source. Yet, we are able to provide a tight integration in the query language and processing; the user can access the text retrieval system using a standard database query language. The effcient and effective retrieval of structured text performed by the text retrieval system is seamlessly combined with the rich modeling and general-purpose querying capabilities of the database system, resulting in an integrated system with querying power beyond those of the underlying systems. The integrated system also provides uniform access to textual data in the text retrieval system and structured data in the database system, thereby achieving information fusion. We discuss the design and implementation of our prototype system, and address issues such as the proper framework for external integration, the modeling of complex categorization and structure hierarchies of documents (under automatic document schema impand techniques to reduce the performance overhead of accessing an external source.

Item Type:Techreport (Technical Report)
Subjects:Computer Science
Related URLs:Project Homepage
ID Code:72
Deposited By:Import Account
Deposited On:25 Feb 2000 16:00
Last Modified:14 Jan 2009 16:23

Download statistics

Repository Staff Only: item control page