Stanford InfoLab Publication Server

Incorporating Uncertainty in Data Management and Integration

Agrawal, Parag (2012) Incorporating Uncertainty in Data Management and Integration. PhD thesis, Stanford University.




Modern-day applications like information extraction on the web, data integration, entity resolution, scientific data management, and sensor data management are all required to cope with uncertainty in data. Motivated by this observation, recent years have witnessed a surge of research in the field of uncertain databases. The basic goal of this research is to abstract the common challenges and develop principled, general, and efficient techniques for dealing with uncertainty in the context of data management systems. This thesis makes advances in the field of uncertain data management by presenting efficient techniques for managing and integrating uncertain data. Specifically, the contributions may be classified under three areas: (1) Generalizing: We generalize uncertain databases to incorporate continuous probability distributions and incomplete information; (2) Integration: We establish foundations for integration of uncertain data sources; (3) Efficiency: We develop efficient algorithms for joins and indexing over uncertain data.

Item Type:Thesis (PhD)
ID Code:1053
Deposited By:Parag Agrawal
Deposited On:30 Aug 2012 14:42
Last Modified:30 Aug 2012 14:42

Download statistics

Repository Staff Only: item control page