Gravano, Luis and Garcia-Molina, Hector and Tomasic, Anthony (1994) The effectiveness of GlOSS for the text-database discovery problem. In: ACM International Conference on Management of Data (SIGMOD 1994), May 24-27, 1994, Minneapolis, Minnesota.
The popularity of on-line document databases has led to a new problem: finding which text databases (out or many candidate choices) are the most relevant to a user. Identifying the relevant databases for a given query is the text database discovery problem. The first part of this paper presents a practical solution based on estimating the result size of a query and a database. The method is termed GLOSS-Glossary of Servers Server. The second part of this paper evaluates the effectiveness of GLOSS based on a trace of real user queries. In addition, we analyze the storage cost of our approach.
|Item Type:||Conference or Workshop Item (Paper)|
|Subjects:||Computer Science > Databases and the Web|
|Related URLs:||Project Homepage||http://www-diglib.stanford.edu/diglib/pub/|
|Deposited By:||Import Account|
|Deposited On:||19 Nov 2001 16:00|
|Last Modified:||05 Feb 2009 15:29|
Repository Staff Only: item control page