Gravano, L. and Garcia-Molina, H. and Tomasic, A. (1994) Precision and Recall of GlOSS Estimators for Database Discovery. Technical Report. Stanford University. (Publication Note: Part of paper appeared in Third International Conference on Parallel and Distributed Information Systems, Austin, Texas, September 28-30, 1994 (PDIS 1994))
| BibTeX | DublinCore | EndNote | HTML |
| PDF 558Kb |
Abstract
The availability of large numbers of network information sources has led to a new problem: finding which text databases (out of perhaps thousands of choices) are the most relevant to a query. We call this the text-database discovery problem. Our solution to this problem, GlOSS{Glossary-Of-Servers Server, keeps statistics on the available databases to decide which ones are potentially useful for a given query. In this paper we present different query-result size estimators for GlOSS and we evaluate them with metrics based on the precision and recall concepts of text-document information-retrieval theory. Our generalization of these metrics uses different notions of the set of relevant databases to define different query semantics.
| Item Type: | Techreport (Technical Report) | |
|---|---|---|
| Subjects: | Computer Science | |
| Projects: | Miscellaneous | |
| Related URLs: | Project Homepage | http://infolab.stanford.edu/ |
| ID Code: | 62 | |
| Deposited By: | Import Account | |
| Deposited On: | 25 Feb 2000 16:00 | |
| Last Modified: | 05 Feb 2009 15:24 |
Download statistics
Repository Staff Only: item control page

