University of Lincoln
Browse

The visual active memory perspective on integrated recognition systems

Version 2 2024-03-12, 21:19
Version 1 2023-10-19, 20:58
journal contribution
posted on 2024-03-12, 21:19 authored by Christian Bauckhage, Sven Wachsmuth, Marc HanheideMarc Hanheide, S. Wrede, Gerhard Sagerer, G. Heidemann, H. Ritter

Object recognition is the ability of a system to relate visual stimuli to its knowledge of the world. Although humans perform this task effortlessly and without thinking about it, a general algorithmic solution has not yet been found. Recently, a shift from devising isolated recognition techniques towards integrated systems could be observed [Y. Aloimonos, Active vision revisited, in: Y. Aloimonos (Ed.), Active Perception, Lawrence Efibaum, 1993, pp. 1–18; H. Christensen, Cognitive (vision) systems, ERCIM News (April, 2003). 17–18]. The visual active memory (VAM) perspective refines this system view towards an interactive computational framework for recognition systems in human everyday environments. VAM is in line with the recently emerged Cognitive Vision paradigm [H. Christensen, Cognitive (vision) systems, ERCIM News (April, 2003). 17–18] which is concerned with vision systems that evaluate, gather and integrate contextual knowledge for visual analysis. It consists of active processes that generate knowledge by means of a tight cooperation of perception, reasoning, learning and prior models. In addition, VAM emphasizes the dynamic representation of gathered knowledge. The memory is assumed to be structured in a hierarchy of successive memory systems that mediate the modularly defined processing components of the recognition system. Recognition and learning take place in the stress field of objects, actions, activities, scene context, and user interaction. In this paper, we exemplify the VAM perspective by means of existing demonstrator systems. Assuming three different perspectives (biological foundation, system engineering, and computer vision), we will show that the VAM concept is central to the cognitive capabilities of the system and that it leads to a more general object recognition framework.

History

School affiliated with

  • School of Computer Science (Research Outputs)

Publication Title

Image and Vision Computing

Volume

26

Issue

1

Pages/Article Number

5-14

Publisher

elsevier

ISSN

0262-8856

Date Submitted

2012-10-26

Date Accepted

2012-10-26

Date of First Publication

2012-10-26

Date of Final Publication

2012-10-26

ePrints ID

6710

Usage metrics

    University of Lincoln (Research Outputs)

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC