Stanford InfoLab Publication Server

Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search

Haveliwala, Taher H. (2003) Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search. Technical Report. Stanford InfoLab. (Publication Note: IEEE Transactions on Knowledge and Data Engineering, 2003. Extended version of the WWW2002 paper on Topic-Sensitive PageRank.)

BibTeXDublinCoreEndNoteHTML

This is the latest version of this item.

[img]
Preview
PDF
266Kb

Abstract

The original PageRank algorithm for improving the ranking of search-query results computes a single vector, using the link structure of the Web, to capture the relative ``importance'' of Web pages, independent of any particular search query. To yield more accurate search results, we propose computing a {\em set} of PageRank vectors, biased using a set of representative topics, to capture more accurately the notion of importance with respect to a particular topic. For ordinary keyword search queries, we compute the topic-sensitive PageRank scores for pages satisfying the query using the topic of the query keywords. For searches done in context (e.g., when the search query is performed by highlighting words in a Web page), we compute the topic-sensitive PageRank scores using the topic of the context in which the query appeared. By using linear combinations of these (precomputed) biased PageRank vectors to generate context-specific importance scores for pages at query time, we show that we can generate more accurate rankings than with a single, generic PageRank vector.

Item Type:Techreport (Technical Report)
Additional Information:Extended version of the WWW2002 paper on Topic-Sensitive PageRank.
Uncontrolled Keywords:PageRank, link analysis, web search
Subjects:Computer Science
Projects:Miscellaneous
Related URLs:Project Homepage, Project Homepagehttp://infolab.stanford.edu/, http://infolab.stanford.edu/
ID Code:750
Deposited By:Import Account
Deposited On:16 May 2003 17:00
Last Modified:24 Dec 2008 10:09

Available Versions of this Item

Download statistics

Repository Staff Only: item control page