Perception in Multimodal Dialogue Systems [electronic resource] :4th IEEE Tutorial and Research Workshop on Perception and Interactive Technologies for Speech-Based Systems, PIT 2008, Kloster Irsee, Germany, June 16-18, 2008. Proceedings /
Contributor(s): André, Elisabeth [editor.] | Dybkjær, Laila [editor.] | Minker, Wolfgang [editor.] | Neumann, Heiko [editor.] | Pieraccini, Roberto [editor.] | Weber, Michael [editor.] | SpringerLink (Online service).Material type: BookSeries: Lecture Notes in Computer Science: 5078Publisher: Berlin, Heidelberg : Springer Berlin Heidelberg, 2008.Description: X, 311 p. online resource.Content type: text Media type: computer Carrier type: online resourceISBN: 9783540693697.Subject(s): Computer science | User interfaces (Computer systems) | Artificial intelligence | Computational linguistics | Image processing | Application software | Computers and civilization | Computer Science | Computer Applications | User Interfaces and Human Computer Interaction | Language Translation and Linguistics | Artificial Intelligence (incl. Robotics) | Computers and Society | Image Processing and Computer VisionOnline resources: Click here to access online
Invited Keynote -- Whence and Whither: The Automatic Recognition of Emotions in Speech (Invited Keynote) -- Multimodal and Spoken Dialogue Systems -- A Generic Spoken Dialogue Manager Applied to an Interactive 2D Game -- Adaptive Dialogue Management in the NIMITEK Prototype System -- Adaptive Search Results Personalized by a Fuzzy Recommendation Approach -- Factors Influencing Modality Choice in Multimodal Applications -- Codebook Design for Speech Guided Car Infotainment Systems -- Evaluating Text Normalization for Speech-Based Media Selection -- Classification of Spoken Utterances and Sound -- A Two Phases Statistical Approach for Dialog Management -- Detecting Problematic Dialogs with Automated Agents -- Call Classification with Hundreds of Classes and Hundred Thousands of Training Utterances ... ... and No Target Domain Data -- Hard vs. Fuzzy Clustering for Speech Utterance Categorization -- Static and Dynamic Modelling for the Recognition of Non-verbal Vocalisations in Conversational Speech -- Recognition of Eye Gaze, Head Pose, Mimics and Lip Movements -- Writing with Your Eye: A Dwell Time Free Writing System Adapted to the Nature of Human Eye Gaze -- Unsupervised Learning of Head Pose through Spike-Timing Dependent Plasticity -- Spoken Word Recognition from Side of Face Using Infrared Lip Movement Sensor -- Neurobiologically Inspired, Multimodal Intention Recognition for Technical Communication Systems (NIMITEK) -- Speech Recognition -- Deploying DSR Technology on Today’s Mobile Phones: A Feasibility Study -- Real-Time Recognition of Isolated Vowels -- Improving Robustness in Jacobian Adaptation for Noisy Speech Recognition -- Comparing Linear Feature Space Transformations for Correlated Features -- Vocal Emotion Recognition and Annotation -- EmoVoice — A Framework for Online Recognition of Emotions from Voice -- Real-Time Emotion Recognition Using Echo State Networks -- Emotion Classification of Audio Signals Using Ensemble of Support Vector Machines -- On the Influence of Phonetic Content Variation for Acoustic Emotion Recognition -- On the Use of Kappa Coefficients to Measure the Reliability of the Annotation of Non-acted Emotions -- Annotation of Emotion in Dialogue: The Emotion in Cooperation Project -- Human-Like Social Dialogue -- Potential Benefits of Human-Like Dialogue Behaviour in the Call Routing Domain -- Human-Likeness in Utterance Generation: Effects of Variability -- Designing Socially Aware Conversational Agents -- A Prototype for Future Spoken Dialog Systems Using an Embodied Conversational Agent -- Innovative Interfaces in MonAMI: The Reminder -- Evaluation Methods -- Evaluation Methods for Multimodal Systems: A Comparison of Standardized Usability Questionnaires -- Subjective Evaluation Method for Speech-Based Uni- and Multimodal Applications -- Weighting the Coefficients in PARADISE Models to Increase Their Generalizability -- EXPROS: A Toolkit for Exploratory Experimentation with Prosody in Customized Diphone Voices -- Automatic Evaluation Tool for Multimodal Dialogue Systems -- Towards a Perception-Based Evaluation Model for Spoken Dialogue Systems.
This book constitutes the refereed proceedings of the 4th IEEE Tutorial and Research Workshop on Perception and Interactive Technologies for Speech-Based Systems, PIT 2008, held in Kloster Irsee, Germany, in June 2008. The 37 revised full papers presented together with 1 invited keynote lecture were carefully selected from numerous submissions for inclusion in the book. The papers are organized in topical sections on multimodal and spoken dialogue systems, classification of dialogue acts and sound, recognition of eye gaze, head poses, mimics and speech as well as combinations of modalities, vocal emotion recognition, human-like and social dialogue systems, and evaluation methods for multimodal dialogue systems.