Stanford InfoLab Publication Server

Searching the Web

Arasu, Arvind and Cho, Junghoo and Garcia-Molina, Hector and Paepcke, Andreas and Raghavan, Sriram (2000) Searching the Web. Technical Report. Stanford.




We offer an overview of current Web search engine design. After introducing a generic search engine architecture, we examine each engine component in turn. We cover crawling, local Web page storage, indexing, and the use of link analysis for boosting search performance. The most common design and implementation techniques for each of these components are presented. We draw for this presentation from the literature, and from our own experimental search engine testbed. Emphasis is on introducing the fundamental concepts, and the results of several performance analyses we conducted to compare different designs.

Item Type:Techreport (Technical Report)
Uncontrolled Keywords:Search engine, crawling, indexing, link analysis, PageRank, HITS, hubs, authorities, information retrieval
Subjects:Computer Science > Databases and the Web
Digital Libraries
Related URLs:Project Homepage
ID Code:457
Deposited By:Import Account
Deposited On:14 Dec 2000 16:00
Last Modified:27 Dec 2008 11:22

Download statistics

Repository Staff Only: item control page