A generic platform for addressing the multimodal challenge
CHI '95 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Assessing face and speech consistency for monologue detection in video
Proceedings of the tenth ACM international conference on Multimedia
“Put-that-there”: Voice and gesture at the graphics interface
SIGGRAPH '80 Proceedings of the 7th annual conference on Computer graphics and interactive techniques
Automatic image annotation and retrieval using cross-media relevance models
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The Journal of Machine Learning Research
Mutual disambiguation of 3D multimodal interaction in augmented and virtual reality
Proceedings of the 5th international conference on Multimodal interfaces
Optimal multimodal fusion for multimedia data analysis
Proceedings of the 12th annual ACM international conference on Multimedia
Skin Segmentation Using Color Pixel Classification: Analysis and Comparison
IEEE Transactions on Pattern Analysis and Machine Intelligence
Image retrieval: Ideas, influences, and trends of the new age
ACM Computing Surveys (CSUR)
Search strategies in multimodal image retrieval
Proceedings of the second international symposium on Information interaction in context
A collaborative Bayesian image retrieval framework
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Score normalization in multimodal biometric systems
Pattern Recognition
Journal of Signal Processing Systems
AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
Recognizing Human Emotional State From Audiovisual Signals*
IEEE Transactions on Multimedia
A unified framework for image retrieval using keyword and visual features
IEEE Transactions on Image Processing
Event detection in field sports video using audio-visual features and a support vector Machine
IEEE Transactions on Circuits and Systems for Video Technology
Hi-index | 0.00 |
This paper outlines several multimedia systems that utilize a multimodal approach. These systems include audiovisual based emotion recognition, image and video retrieval, and face and head tracking. Data collected from diverse sources/sensors are employed to improve the accuracy of correctly detecting, classifying, identifying, and tracking of a desired object or target. It is shown that the integration of multimodality data will be more efficient and potentially more accurate than if the data was acquired from a single source. A number of cutting-edge applications for multimodal systems will be discussed. An advanced assistance robot using the multimodal systems will be presented.