Stanford InfoLab Publication Server

Clustering for Searching Near-Replicas of Images on the World-Wide Web

Chang, Edward and Li, Chen and Wang, James (1999) Clustering for Searching Near-Replicas of Images on the World-Wide Web. Technical Report. Stanford InfoLab.




Internet piracy has been one of the major concerns for Web publishing. In this study we present a system, RIME, that we have prototyped for detecting unauthorized image copying on the World-Wide Web. To speed up the copy detection, RIME uses a new clustering/hashing approach that first clusters similar images on adjacent disk cylinders and then builds indexes to access the clusters made in this way. Searching for the replicas of an image often takes just one IO to look up the location of the cluster containing similar objects and one sequential file IO to read in this cluster. Our experimental results show that RIME can detect image copies both more efficiently and effectively than the traditional content-based image retrieval systems that use tree-like structures to index images. In addition, RIME copes well with image format conversion, resampling, requantization, and geometric transformations.

Item Type:Techreport (Technical Report)
Additional Information:Previous number = SIDL-WP-1999-0104
Subjects:Computer Science > Digital Libraries
Projects:Digital Libraries
Related URLs:Project Homepage
ID Code:416
Deposited By:Import Account
Deposited On:30 Oct 2001 16:00
Last Modified:27 Dec 2008 21:03

Download statistics

Repository Staff Only: item control page