Simultaneous spotting of signs and fingerspellings based on hierarchical conditional random fields and boostmap embeddings

Authors:
Hee-Deok Yang;Seong-Whan Lee
Affiliations:
School of Computer Engineering, Chosun University, Seosuk-dong, Dong-ku, Gwangju 501-759, Republic of Korea;Department of Computer Science and Engineering, Korea University, Anam-dong, Seongbuk-ku, Seoul 136-713, Republic of Korea and Department of Brain and Cognitive Engineering, Korea University, Anam ...
Venue:
Pattern Recognition
Year:
2010

Citing 22
Cited 3

Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video

IEEE Transactions on Pattern Analysis and Machine Intelligence
An HMM-Based Threshold Model Approach for Gesture Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Improved Boosting Algorithms Using Confidence-rated Predictions

Machine Learning - The Eleventh Annual Conference on computational Learning Theory
A framework for recognizing the simultaneous aspects of American sign language

Computer Vision and Image Understanding - Modeling people toward vision-based underatanding of a person's shape, appearance, and movement
Shape Matching and Object Recognition Using Shape Contexts

IEEE Transactions on Pattern Analysis and Machine Intelligence
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Properties of Embedding Methods for Similarity Searching in Metric Spaces

IEEE Transactions on Pattern Analysis and Machine Intelligence
Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Robust Real-Time Face Detection

International Journal of Computer Vision
Automatic Sign Language Analysis: A Survey and the Future beyond Lexical Meaning

IEEE Transactions on Pattern Analysis and Machine Intelligence
Efficient Shape Matching Using Shape Contexts

IEEE Transactions on Pattern Analysis and Machine Intelligence
Robust Online Change-point Detection in Video Sequences

CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
Detecting Coarticulation in Sign Language using Conditional Random Fields

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 02
Learning embeddings for indexing, retrieval, and classification, with applications to object and shape recognition in image databases

Learning embeddings for indexing, retrieval, and classification, with applications to object and shape recognition in image databases
Learning-based dynamic coupling of discrete and continuous trackers

Computer Vision and Image Understanding - Special issue on modeling people: Vision-based understanding of a person's shape, appearance, movement, and behaviour
Exact indexing of dynamic time warping

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
BoostMap: An Embedding Method for Efficient Nearest Neighbor Retrieval

IEEE Transactions on Pattern Analysis and Machine Intelligence
Sign Language Recognition by Combining Statistical DTW and Independent Classification

IEEE Transactions on Pattern Analysis and Machine Intelligence
Sign Language Spotting with a Threshold Model Based on Conditional Random Fields

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Unified Framework for Gesture Recognition and Spatiotemporal Gesture Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Modelling and recognition of the linguistic components in American Sign Language

Image and Vision Computing
Parametric correspondence and chamfer matching: two new techniques for image matching

IJCAI'77 Proceedings of the 5th international joint conference on Artificial intelligence - Volume 2

Turkish fingerspelling recognition system using Generalized Hough Transform, interest regions, and local descriptors

Pattern Recognition Letters
Robust sign language recognition by combining manual and non-manual features based on conditional random field and support vector machine

Pattern Recognition Letters
Towards subject independent continuous sign language recognition: A segment and merge approach

Pattern Recognition

Quantified Score

Hi-index	0.01

Visualization

Abstract

A sign language consists of two types of action; signs and fingerspellings. Signs are dynamic gestures discriminated by continuous hand motions and hand configurations, while fingerspellings are a combination of continuous hand configurations. Sign language spotting is the task of detection and recognition of signs and fingerspellings in a signed utterance. The internal structures of signs and fingerspellings differ significantly. Therefore, it is difficult to spot signs and fingerspellings simultaneously. In this paper, a novel method for spotting signs and fingerspellings is proposed. It can distinguish signs, fingerspellings and non-sign patterns, and is robust to the various sizes, scales and rotations of the signer's hand. This is achieved through a hierarchical framework consisting of three steps: (1) Candidate segments of signs and fingerspellings are discriminated using a two-layer conditional random field (CRF). (2) Hand shapes of segmented signs and fingerspellings are verified using BoostMap embeddings. (3) The motions of fingerspellings are verified in order to distinguish those which have similar hand shapes and different hand motions. Experiments demonstrate that the proposed method can spot signs and fingerspellings from utterance data at rates of 83% and 78%, respectively.