Stanford InfoLab Publication Server

Uncertainty in Data Integration

Das Sarma, Anish and Dong, Xin and Halevy, Alon (2008) Uncertainty in Data Integration. Technical Report. Stanford InfoLab. (Publication Note: In "Managing and Mining Uncertain Data" Ed. Charu Aggarwal, Springer, 2008)




Data integration has been an important area of research for several years. In this chapter, we argue that supporting modern data integration applications requires systems to handle uncertainty at every step of integration. We provide a formal framework for data integration systems with uncertainty. We define probabilistic schema mappings and probabilistic mediated schemas, show how they can be constructed automatically for a set of data sources, and provide techniques for query answering. The foundations laid out in this chapter enable bootstrapping a pay-as-you-go integration system completely automatically.

Item Type:Techreport (Technical Report)
Additional Information:This chapter will appear in the following book: "Managing and Mining Uncertain Data" Ed. Charu Aggarwal, Springer.
Uncontrolled Keywords:data integration, uncertainty, pay-as-you-go, mediated schema, schema mapping
ID Code:845
Deposited By:Import Account
Deposited On:21 Jul 2008 17:00
Last Modified:10 Dec 2008 15:48

Download statistics

Repository Staff Only: item control page