A Caltech Library Service


Gomes, Ryan and Welinder, Peter and Krause, Andreas and Perona, Pietro (2011) Crowdclustering. Computation & Neural Systems Technical Report, CNS-TR. California Institute of Technology , Pasadena, CA. (Submitted)

WarningThere is a more recent version of this item available.

PDF - Submitted Version
See Usage Policy.


Use this Persistent URL to link to this item:


Is it possible to crowdsource categorization? Amongst the challenges: (a) each worker has only a partial view of the data, (b) different workers may have different clustering criteria and may produce different numbers of categories, (c) the underlying category structure may be hierarchical. We propose a Bayesian model of how workers may approach clustering and show how one may infer clusters / categories, as well as worker parameters, using this model. Our experiments, carried out on large collections of images, suggest that Bayesian crowdclustering works well and may be superior to single-expert annotations.

Item Type:Report or Paper (Technical Report)
Additional Information:This work was supported by ONR MURI grant N00014-06-1-0734 and NSF grant IIS-0953413.
Group:Computation & Neural Systems Technical Reports
Funding AgencyGrant Number
Office of Naval Research (ONR) Multidisciplinary University Research Initiative (MURI) N00014-06-1-0734
Record Number:CaltechAUTHORS:20110628-202526159
Persistent URL:
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:24244
Deposited By: Ryan Gomes
Deposited On:29 Jun 2011 21:29
Last Modified:26 Dec 2012 13:21

Available Versions of this Item

Repository Staff Only: item control page