Stanford InfoLab Publication Server

Link Spam Detection Based on Mass Estimation

Gyongyi, Zoltan and Berkhin, Pavel and Garcia-Molina, Hector and Pedersen, Jan (2005) Link Spam Detection Based on Mass Estimation. Technical Report. Stanford.

WarningThere is a more recent version of this item available.



Link spamming intends to mislead search engines and trigger an artificially high link-based ranking of specific target web pages. This paper introduces the concept of spam mass, a measure of the impact of link spamming on a page's ranking. We discuss how to estimate spam mass and how the estimates can help identifying pages that benefit significantly from link spamming. In our experiments on the host-level Yahoo! web graph we use spam mass estimates to successfully identify tens of thousands of instances of heavy-weight link spamming.

Item Type:Techreport (Technical Report)
Uncontrolled Keywords:web search; link spam detection
Subjects:Computer Science > Databases and the Web
Related URLs:Project Homepage
ID Code:697
Deposited By:Import Account
Deposited On:02 Nov 2005 16:00
Last Modified:22 Dec 2008 18:00

Available Versions of this Item

Download statistics

Repository Staff Only: item control page