Stanford InfoLab Publication Server

SimRank: A Measure of Structural-Context Similarity

Jeh, Glen and Widom, Jennifer (2001) SimRank: A Measure of Structural-Context Similarity. Technical Report. Stanford.




The problem of finding "similar" objects arises in many applications, and many domain-specific techniques have been developed, e.g., matching text across documents or computing overlap among item-sets. We propose a complementary approach, applicable in any domain with object-to-object relationships, that measures similarity of the structural context in which objects occur, based on their relationships with other objects. Effectively, we compute a measure that says "two objects are similar if they are related to similar objects." For a given domain, our general technique can be combined with other domain-specific similarity measures. The formalization and computation of our similarity measure, called "SimRank", is similar in spirit to previous recursive algorithms (such as PageRank) for computing importance of Web pages, although ours is more complex and expensive since we must consider object-pairs instead of single objects. We suggest techniques for efficient computation, and we provide experimental results on two application domains showing the computational feasibility and effectiveness of our approach.

Item Type:Techreport (Technical Report)
Uncontrolled Keywords:SimRank, similarity, structural-context
Subjects:Computer Science
Related URLs:Project Homepage
ID Code:508
Deposited By:Import Account
Deposited On:09 Oct 2001 17:00
Last Modified:27 Dec 2008 10:04

Download statistics

Repository Staff Only: item control page