Definition
Multimodal interfaces process two or more combined user input modes, such as speech, pen, touch, manual gestures, and gaze, in a coordinated manner with multimedia system output.
They are a new class of emerging systems that aim to recognize naturally occurring forms of human language and behavior, with the incorporation of one or more recognition-based technologies (e.g., speech, pen, vision). Multimodal interfaces represent a paradigm shift away from conventional graphical user interfaces (Fig. 1). They are being developed largely because they offer a relatively expressive, transparent, efficient, robust, and highly mobile form of human–computer interaction. They represent users’ preferred interaction style, and they support users’ ability to flexibly combine modalities or to switch from one input mode to another that may be better suited to a particular task or setting.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
S.L. Oviatt, P.R. Cohen, L. Wu, J. Vergo, L. Duncan, B. Suhm, J. Bers, T. Holzman, T. Winograd, J. Landay, J. Larson, and D. Ferro, “Designing the User Interface for Multimodal Speech and Gesture Applications: State-of-the-Art Systems and Research Directions,” J. Carroll (Ed.), “Human Computer Interaction,” Vol. 15, No. 4, 2000, pp. 263–322 (also in Human-Computer Interaction in the New Millennium, Reading, MA.: Addison-Wesley, 2001).
G. Potamianos, C. Neti, J. Luettin, and I. Matthews, “Audio-Visual Automatic Speech Recognition: An Overview,” G. Bailly, E. Vatikiotis-Bateson, and P. Perrier (Eds.), “Issues in Visual and Audio-Visual Speech Processing,” MIT, Cambridge, MA, 2004.
S.L. Oviatt, “Multimodal Interfaces,” J. Jacko and A. Sears (Eds.), “Handbook of Human–Computer Interaction,” Lawrence Erlbaum, Mahwah, New Jersey, 2003, Chapter 14, pp. 286–304.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag
About this entry
Cite this entry
(2008). Multimodal Interfaces. In: Furht, B. (eds) Encyclopedia of Multimedia. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-78414-4_159
Download citation
DOI: https://doi.org/10.1007/978-0-387-78414-4_159
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-74724-8
Online ISBN: 978-0-387-78414-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering