Stanford InfoLab Publication Server

Trio: A System for Integrated Management of Data, Accuracy, and Lineage

Widom, Jennifer (2004) Trio: A System for Integrated Management of Data, Accuracy, and Lineage. Technical Report. Stanford InfoLab.




Trio is a new database system that manages not only data, but also the accuracy and lineage of the data. Approximate (uncertain, probabilistic, incomplete, fuzzy, and imprecise!) databases have been proposed in the past, and the lineage problem also has been studied. The goals of the Trio project are to distill previous work into a simple and usable model, design a query language as an understandable extension to SQL, and most importantly build a working system---a system that augments conventional data management with both accuracy and lineage as an integral part of the data. This paper provides numerous motivating applications for Trio and lays out preliminary plans for the data model, query language, and prototype system.

Item Type:Techreport (Technical Report)
Uncontrolled Keywords:data accuracy approximation uncertainty lineage
Related URLs:Project Homepage
ID Code:658
Deposited By:Import Account
Deposited On:05 Aug 2004 17:00
Last Modified:23 Dec 2008 09:52

