CaltechAUTHORS
  A Caltech Library Service

Multiclass Recognition and Part Localization with Humans in the Loop

Wah, Catherine and Branson, Steve and Perona, Pietro and Belongie, Serge (2011) Multiclass Recognition and Part Localization with Humans in the Loop. In: 2011 International Conference on Computer Vision. IEEE , Piscataway, NJ, pp. 2524-2531. ISBN 978-1-4577-1102-2. https://resolver.caltech.edu/CaltechAUTHORS:20180615-160643981

Full text is not posted in this repository. Consult Related URLs below.

Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20180615-160643981

Abstract

We propose a visual recognition system that is designed for fine-grained visual categorization. The system is composed of a machine and a human user. The user, who is unable to carry out the recognition task by himself, is interactively asked to provide two heterogeneous forms of information: clicking on object parts and answering binary questions. The machine intelligently selects the most informative question to pose to the user in order to identify the object's class as quickly as possible. By leveraging computer vision and analyzing the user responses, the overall amount of human effort required, measured in seconds, is minimized. We demonstrate promising results on a challenging dataset of uncropped images, achieving a significant average reduction in human effort over previous methods.


Item Type:Book Section
Related URLs:
URLURL TypeDescription
https://doi.org/10.1109/ICCV.2011.6126539DOIArticle
ORCID:
AuthorORCID
Perona, Pietro0000-0002-7583-5809
Additional Information:© 2011 IEEE. The authors thank Boris Babenko, Ryan Farrell, Kristen Grauman, and Peter Welinder for helpful discussions and feedback, as well as Jitendra Malik for suggesting time-to-decision as a relevant performance metric. Funding for this work was provided by the NSF GRFP for CW under Grant DGE 0707423, NSF Grant AGS-0941760, ONR MURI Grant N00014–08-1–0638, ONR MURI Grant N00014–06-1–0734, and ONR MURI Grant 1015 G NA127.
Funders:
Funding AgencyGrant Number
NSF Graduate Research FellowshipDGE-0707423
NSFAGS-0941760
Office of Naval Research (ONR)N00014-08-1-0638
Office of Naval Research (ONR)N00014-06-1-0734
Office of Naval Research (ONR)1015 G NA127
Subject Keywords:computer vision, image recognition
Record Number:CaltechAUTHORS:20180615-160643981
Persistent URL:https://resolver.caltech.edu/CaltechAUTHORS:20180615-160643981
Official Citation:C. Wah, S. Branson, P. Perona and S. Belongie, "Multiclass recognition and part localization with humans in the loop," 2011 International Conference on Computer Vision, Barcelona, 2011, pp. 2524-2531. doi: 10.1109/ICCV.2011.6126539
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:87173
Collection:CaltechAUTHORS
Deposited By: George Porter
Deposited On:18 Jun 2018 16:36
Last Modified:03 Oct 2019 19:53

Repository Staff Only: item control page