Published December 7, 1999 | Version Published
Journal Article Open

Image recognition: Visual grouping, recognition, and learning

Abstract

Vision extracts useful information from images. Reconstructing the three-dimensional structure of our environment and recognizing the objects that populate it are among the most important functions of our visual system. Computer vision researchers study the computational principles of vision and aim at designing algorithms that reproduce these functions. Vision is difficult: the same scene may give rise to very different images depending on illumination and viewpoint. Typically, an astronomical number of hypotheses exist that in principle have to be analyzed to infer a correct scene description. Moreover, image information might be extracted at different levels of spatial and logical resolution dependent on the image processing task. Knowledge of the world allows the visual system to limit the amount of ambiguity and to greatly simplify visual computations. We discuss how simple properties of the world are captured by the Gestalt rules of grouping, how the visual system may learn and organize models of objects for recognition, and how one may control the complexity of the description that the visual system computes.

Additional Information

© 1999 by the National Academy of Sciences. This paper is a summary of a session presented at the fifth annual German-American Frontiers of Science symposium, held June 10–13, 1999, at the Alexander von Humboldt Foundation in Potsdam, Germany.

Attached Files

Published - BUHpnas99.pdf

Files

BUHpnas99.pdf

Files (130.6 kB)

Name Size Download all
md5:eedc6609c3e633dd831f2a2a82ba2921
130.6 kB Preview Download

Additional details

Identifiers

PMCID
PMC33948
Eprint ID
1491
Resolver ID
CaltechAUTHORS:BUHpnas99

Dates

Created
2006-01-23
Created from EPrint's datestamp field
Updated
2021-11-08
Created from EPrint's last_modified field