Stanford InfoLab Publication Server

Identifying Users in Social Networks with Limited Information

Vesdapunt, Norases and Garcia-Molina, Hector (2014) Identifying Users in Social Networks with Limited Information. Technical Report. Stanford InfoLab.

BibTeXDublinCoreEndNoteHTML
WarningThere is a more recent version of this item available.

[img]
Preview
PDF - Updated Version
5Mb

Abstract

We study the problem of Entity Resolution (ER) with limited information. ER is the problem of identifying and merging records that represent the same real-world entity. In this paper, we focus on the resolution of a single node g from one social graph (Google+ in our case) against a second social graph (Twitter in our case). We want to find the best match for g in Twitter, by dynamically probing the Twitter graph (using a public API), limited by the number of API calls that social systems allow. We propose two strategies that are designed for limited information and can be adapted to different limits. We evaluate our strategies against a naive one on a real dataset and show that our strategies can provide improved accuracy with significantly fewer API calls.

Item Type:Techreport (Technical Report)
ID Code:1116
Deposited By:Norases Vesdapunt
Deposited On:04 Dec 2014 17:51
Last Modified:03 Apr 2015 00:56

Download statistics

Repository Staff Only: item control page