Gyongyi, Zoltan and Berkhin, Pavel and Garcia-Molina, Hector and Pedersen, Jan (2006) Link Spam Detection Based on Mass Estimation. In: 32nd International Conference on Very Large Data Bases (VLDB 2006), September 12-15, 2006, Seoul, Korea.
BibTeX | DublinCore | EndNote | HTML |
This is the latest version of this item.
| PDF 331Kb |
Abstract
Link spamming intends to mislead search engines and trigger an artificially high link-based ranking of specific target web pages. This paper introduces the concept of spam mass, a measure of the impact of link spamming on a page's ranking. We discuss how to estimate spam mass and how the estimates can help identifying pages that benefit significantly from link spamming. In our experiments on the host-level Yahoo! web graph we use spam mass estimates to successfully identify tens of thousands of instances of heavy-weight link spamming.
Item Type: | Conference or Workshop Item (Paper) | |
---|---|---|
Uncontrolled Keywords: | web search; link spam | |
Subjects: | Computer Science > Databases and the Web | |
Projects: | Miscellaneous | |
Related URLs: | Project Homepage | http://infolab.stanford.edu/ |
ID Code: | 854 | |
Deposited By: | Import Account | |
Deposited On: | 06 Nov 2006 16:00 | |
Last Modified: | 18 Dec 2008 14:44 |
Available Versions of this Item
- Link Spam Detection Based on Mass Estimation. (deposited 02 Nov 2005 16:00)
- Link Spam Detection Based on Mass Estimation. (deposited 06 Nov 2006 16:00) [Currently Displayed]
Download statistics
Repository Staff Only: item control page