An architecture for vision and action

Authors:
R. James Firby;Roger E. Kahn;Peter N. Prokopowicz;Michael J. Swain
Affiliations:
Department of Computer Science, University of Chicago, Chicago, IL;Department of Computer Science, University of Chicago, Chicago, IL;Department of Computer Science, University of Chicago, Chicago, IL;Department of Computer Science, University of Chicago, Chicago, IL
Venue:
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Year:
1995

Citing 7
Cited 20

Dynamic monocular machine vision

Machine Vision and Applications
Obstacle Avoidance Using Flow Field Divergence

IEEE Transactions on Pattern Analysis and Machine Intelligence
Color indexing

International Journal of Computer Vision
Specialization of perceptual processes

Specialization of perceptual processes
A computational and evolutionary perspective on the role of representation in vision

CVGIP: Image Understanding
What I have learned

CVGIP: Image Understanding
Toward general vision

CVGIP: Image Understanding

Integrating range and object data for robot navigation

AGENTS '97 Proceedings of the first international conference on Autonomous agents
Cooperating with people: the intelligent classroom

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Template-based recognition of pose and motion gestures on a mobile robot

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Integrating POMDP and reinforcement learning for a two layer simulated robot architecture

Proceedings of the third annual conference on Autonomous Agents
Improving human computer interaction in a classroom environment using computer vision

Proceedings of the 5th international conference on Intelligent user interfaces
The intelligent classroom: providing competent assistance

Proceedings of the fifth international conference on Autonomous agents
Plan-based interfaces: keeping track of user tasks and acting to cooperate

Proceedings of the 7th international conference on Intelligent user interfaces
AV-Shell, an Environment for Autonomous Robotic Applications Using Active Vision

Autonomous Robots
A Gesture Based Interface for Human-Robot Interaction

Autonomous Robots
Multimodal event parsing for intelligent user interfaces

Proceedings of the 8th international conference on Intelligent user interfaces
Intelligencer

IEEE Intelligent Systems
Use of Cognitive Robotics Logic in a Double Helix Architecture for Autonomous Systems

Revised Papers from the International Seminar on Advances in Plan-Based Control of Robotic Agents,
The ThreadMill architecture for stream-oriented human communication analysis applications

Proceedings of the 6th international conference on Multimodal interfaces
Modeling embodied visual behaviors

ACM Transactions on Applied Perception (TAP)
Concurrent reactive plans: anticipating and forestalling execution failures

Concurrent reactive plans: anticipating and forestalling execution failures
Plan-based control of robotic agents: improving the capabilities of autonomous robots

Plan-based control of robotic agents: improving the capabilities of autonomous robots
Navigation for everyday life

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
GARGOYLE: an environment for real-time, context-sensitive active vision

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
Context-Dependent task computing in pervasive environment

UCS'06 Proceedings of the Third international conference on Ubiquitous Computing Systems
Modeling the brain's operating system

BVAI'05 Proceedings of the First international conference on Brain, Vision, and Artificial Intelligence

Quantified Score

Hi-index	0.01

Visualization

Abstract

Vision systems that have successfully supported nontrivial tasks have invariably taken advantage of constraints derived from the task and environment to increase reliability and lower the complexity of perception. We propose that it is possible to build a general purpose vision system, that is, one that can support a wide variety of tasks, and take advantage of such constraints. The central idea within our proposed architecture is the reactive skill. Skills are concurrent control routines assembled at run time using instructions from a symbolic execution system. Visual modules are used as resources in the construction of these skills. Skills control the agent as continuous feedback loops but are constructed using discrete, symbolic instructions. The key to general-purpose vision is the ability to parametrize the primitive elements of the vision system and to compose visual and control routines in a variety of ways. We demonstrate the architecture in the context of an implemented example task of a robot collecting trash off a floor and depositing it in a garbage can.