Stanford InfoLab Publication Server

A Model for Data Leakage Detection

Papadimitriou, Panagiotis and Garcia-Molina, Hector (2009) A Model for Data Leakage Detection. In: 25th International Conference on Data Engineering, March 29 - April 4, 2009, Shanghai, China.


This is the latest version of this item.

PDF - Published Version


We study the following problem: A data distributor has given sensitive data to a set of supposedly trusted agents (third parties). Some of the data is leaked and found in an unauthorized place (e.g., on the web or somebody's laptop). The distributor must assess the likelihood that the leaked data came from one or more agents, as opposed to having been independently gathered by other means. We propose data allocation strategies (across the agents) that improve the probability of identifying leakages. These methods do not rely on alterations of the released data (e.g., watermarks). In some cases we can also inject "realistic but fake" data records to further improve our chances of detecting leakage and identifying the guilty party.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:leakage, detection
Projects:PORTIA (DB-Privacy)
Related URLs:Project Homepage
ID Code:886
Deposited By:Panagiotis Papadimitriou
Deposited On:24 Nov 2008 08:26
Last Modified:24 Nov 2008 08:26

Available Versions of this Item

Download statistics

Repository Staff Only: item control page