Gyongyi, Zoltan and Berkhin, Pavel and Garcia-Molina, Hector and Pedersen, Jan (2005) Link Spam Detection Based on Mass Estimation. Technical Report. Stanford.
Link spamming intends to mislead search engines and trigger an artificially high link-based ranking of specific target web pages. This paper introduces the concept of spam mass, a measure of the impact of link spamming on a page's ranking. We discuss how to estimate spam mass and how the estimates can help identifying pages that benefit significantly from link spamming. In our experiments on the host-level Yahoo! web graph we use spam mass estimates to successfully identify tens of thousands of instances of heavy-weight link spamming.
|Item Type:||Techreport (Technical Report)|
|Uncontrolled Keywords:||web search; link spam detection|
|Subjects:||Computer Science > Databases and the Web|
|Related URLs:||Project Homepage||http://infolab.stanford.edu/|
|Deposited By:||Import Account|
|Deposited On:||02 Nov 2005 16:00|
|Last Modified:||22 Dec 2008 18:00|
Available Versions of this Item
- Link Spam Detection Based on Mass Estimation. (deposited 02 Nov 2005 16:00) [Currently Displayed]
Repository Staff Only: item control page