Stanford InfoLab Publication Server

Tagging Human Knowledge

Heymann, Paul and Paepcke, Andreas and Garcia-Molina, Hector (2010) Tagging Human Knowledge. In: Third ACM International Conference on Web Search and Data Mining (WSDM2010), February 3-6, 2010, New York City, NY, USA.


This is the latest version of this item.

PDF (Preprint Version, Last Updated November 18, 2009) - Accepted Version


A fundamental premise of tagging systems is that regular users can organize large collections for browsing and other tasks using uncontrolled vocabularies. Until now, that premise has remained relatively unexamined. Using library data, we test the tagging approach to organizing a collection. We find that tagging systems have three major large scale organizational features: consistency, quality, and completeness. In addition to testing these features, we present results suggesting that users produce tags similar to the topics designed by experts, that paid tagging can effectively supplement tags in a tagging system, and that information integration may be possible across tagging systems.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:Collaborative tagging systems, tagging, social cataloging, libraries, information organization, classifications, controlled vocabulary, metadata, synonymy, entropy, information integration, Mechanical Turk, paid taggers.
ID Code:956
Deposited By:Paul Heymann
Deposited On:13 Jan 2010 15:23
Last Modified:13 Jan 2010 15:23

Available Versions of this Item

Download statistics

Repository Staff Only: item control page