Ikeda, Robert and Widom, Jennifer (2009) Data Lineage: A Survey. Technical Report. Stanford InfoLab.
Lineage, or provenance, in its most general form describes where data came from, how it was derived, and how it was updated over time. Information management systems today exploit lineage in tasks ranging from data verification in curated databases to confidence computation in probabilistic databases. Here, we formalize and categorize lineage, discuss a set of selected papers, and then identify open problems in lineage research.
|Item Type:||Techreport (Technical Report)|
|Deposited By:||Robert Ikeda|
|Deposited On:||13 Apr 2009 14:35|
|Last Modified:||04 Aug 2009 17:07|
Repository Staff Only: item control page