Stanford InfoLab Publication Server

Data Lineage: A Survey

Ikeda, Robert and Widom, Jennifer (2009) Data Lineage: A Survey. Technical Report. Stanford InfoLab.




Lineage, or provenance, in its most general form describes where data came from, how it was derived, and how it was updated over time. Information management systems today exploit lineage in tasks ranging from data verification in curated databases to confidence computation in probabilistic databases. Here, we formalize and categorize lineage, discuss a set of selected papers, and then identify open problems in lineage research.

Item Type:Techreport (Technical Report)
ID Code:918
Deposited By:Robert Ikeda
Deposited On:13 Apr 2009 14:35
Last Modified:04 Aug 2009 17:07

Download statistics

Repository Staff Only: item control page