Stanford InfoLab Publication Server

Searching Near-Replicas of Images via Clustering

Chang, E. and Li, C. and Wang, J. and Mork, P. and Wiederhold, G. (1999) Searching Near-Replicas of Images via Clustering. In: SPIE Multimedia Storage and Archiving Systems IV, September 20, 1999, Boston, MA.




Searching Near-Replicas of Images via Clustering Edward Chang, Chen Li, James Wang, Peter Mork and Gio Wiederhold Department of Computer Science, Stanford University echang,chenli,wangz,pmork, Abstract Internet piracy has been one of the major concerns for Web publishing. In study we present a system, RIME, that we have prototyped for detecting unauthorized image copying on the World-Wide Web. To speed up the copy detection, RIME uses a new clustering/hashing approach that first clusters similar images on adjacent disk cylinders and then builds indexes to access the clusters made in this way. Searching for the replicas of an image often takes just one IO to look up the location of the cluster containing similar objects and one sequential file IO to read in this cluster. Our experimental results show that RIME can detect image copies both more efficiently and effectively than the traditional content-based image retrieval systems that use tree-like structures to index images. In addition, RIME copes well with image format conversion, resampling, requantization and geometric transformations. Keywords: clustering, copy detection, multidimensional indexes, similarity search.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:copy detection, wavelets, multidimensional indexes
Subjects:Computer Science > Databases and the Web
Projects:Image Database
Digital Libraries
Related URLs:Project Homepage, Project Homepage,
ID Code:391
Deposited By:Import Account
Deposited On:25 Feb 2000 16:00
Last Modified:27 Dec 2008 21:02

Download statistics

Repository Staff Only: item control page