Clustering in image space for place recognition and visualannotations for human-robot interaction

Authors:
A. M. Martinez;J. Vitria
Affiliations:
Robot Vision Lab., Purdue Univ., West Lafayette, IN;-
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Year:
2001

Citing 0
Cited 6

Vision for Mobile Robot Navigation: A Survey

IEEE Transactions on Pattern Analysis and Machine Intelligence
Morphological Scale Spaces and Associative Morphological Memories: Results on Robustness and Practical Applications

Journal of Mathematical Imaging and Vision
Genetic-Based EM Algorithm for Learning Gaussian Mixture Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
A stochastic version of Expectation Maximization algorithm for better estimation of Hidden Markov Model

Pattern Recognition Letters
A constraint-based evolutionary learning approach to the expectation maximization for optimal estimation of the hidden Markov model for speech signal modeling

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on human computing
A variable initialization approach to the EM algorithm for better estimation of the parameters of hidden markov model based acoustic modeling of speech signals

ICDM'06 Proceedings of the 6th Industrial Conference on Data Mining conference on Advances in Data Mining: applications in Medicine, Web Mining, Marketing, Image and Signal Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

The most classical way of attempting to solve the vision-guided navigation problem for autonomous robots corresponds to the use of three-dimensional (3-D) geometrical descriptions of the scene; what is known as model-based approaches. However, these approaches do not facilitate the user's task because they require that geometrically precise models of the 3-D environment be given by the user. In this paper, we propose the use of “annotations” posted on some type of blackboard or “descriptive” map to facilitate this user-robot interaction. We show that, by using this technique, user commands can be as simple as “go to label 5.” To build such a mechanism, new approaches for vision-guided mobile robot navigation have to be found. We show that this can be achieved by using mixture models within an appearance-based paradigm. Mixture models are more useful in practice than other pattern recognition methods such as principal component analysis (PCA) or Fisher discriminant analysis (FDA)-also known as linear discriminant analysis (LDA), because they can represent nonlinear subspaces. However, given the fact that mixture models are usually learned using the expectation-maximization (EM) algorithm which is a gradient ascent technique, the system cannot always converge to a desired final solution, due to the local maxima problem. To resolve this, a genetic version of the EM algorithm is used. We then show the capabilities of this latest approach on a navigation task that uses the above described “annotations.”