Gyongyi, Zoltan and Berkhin, Pavel and Garcia-Molina, Hector and Pedersen, Jan (2006) Link Spam Detection Based on Mass Estimation. In: 32nd International Conference on Very Large Data Bases (VLDB 2006), September 12-15, 2006, Seoul, Korea.
This is the latest version of this item.
Link spamming intends to mislead search engines and trigger an artificially high link-based ranking of specific target web pages. This paper introduces the concept of spam mass, a measure of the impact of link spamming on a page's ranking. We discuss how to estimate spam mass and how the estimates can help identifying pages that benefit significantly from link spamming. In our experiments on the host-level Yahoo! web graph we use spam mass estimates to successfully identify tens of thousands of instances of heavy-weight link spamming.
|Item Type:||Conference or Workshop Item (Paper)|
|Uncontrolled Keywords:||web search; link spam|
|Subjects:||Computer Science > Databases and the Web|
|Related URLs:||Project Homepage||http://infolab.stanford.edu/|
|Deposited By:||Import Account|
|Deposited On:||06 Nov 2006 16:00|
|Last Modified:||18 Dec 2008 14:44|
Available Versions of this Item
- Link Spam Detection Based on Mass Estimation. (deposited 02 Nov 2005 16:00)
- Link Spam Detection Based on Mass Estimation. (deposited 06 Nov 2006 16:00) [Currently Displayed]
Repository Staff Only: item control page