User interface technologies has been studied in various disciplines for decades. Considering that modern CE products are usually supplied with both microphones and cameras, the challenge to employ both audio and visual information in interactive multimedia has recently received much attention in both academia and industry. Yet interactive multimedia is still an under-explored field. Many challenges exist when moving to multimodal interaction, for example: how to annotate and search huge amounts of data acquired by using multiple sensors, especially in the unconstrained end-user environments; how to effectively extract and select representative multimedia features for human behavior recognition; and how to select the fusion strategy of multimodal data for a given application. To address these challenges, existing approaches must be adapted or new solutions suitable for multimedia interaction must be found.
This book brings together high-quality and up-to-date research advances in the areas of multimedia interaction, user interfaces, and applications of consumer electronics.