img Leseprobe Leseprobe

Cognitively Inspired Audiovisual Speech Filtering

Towards an Intelligent, Fuzzy Based, Multimodal, Two-Stage Speech Enhancement System

Amir Hussain, Andrew Abel

PDF
ca. 53,49
Amazon iTunes Thalia.de Weltbild.de Hugendubel Bücher.de ebook.de kobo Osiander Google Books Barnes&Noble bol.com Legimi yourbook.shop Kulturkaufhaus ebooks-center.de
* Affiliatelinks/Werbelinks
Hinweis: Affiliatelinks/Werbelinks
Links auf reinlesen.de sind sogenannte Affiliate-Links. Wenn du auf so einen Affiliate-Link klickst und über diesen Link einkaufst, bekommt reinlesen.de von dem betreffenden Online-Shop oder Anbieter eine Provision. Für dich verändert sich der Preis nicht.

Springer International Publishing img Link Publisher

Naturwissenschaften, Medizin, Informatik, Technik / Anwendungs-Software

Beschreibung

This book presents a summary of the cognitively inspired basis behind multimodal speech enhancement, covering the relationship between audio and visual modalities in speech, as well as recent research into audiovisual speech correlation. A number of audiovisual speech filtering approaches that make use of this relationship are also discussed. A novel multimodal speech enhancement system, making use of both visual and audio information to filter speech, is presented, and this book explores the extension of this system with the use of fuzzy logic to demonstrate an initial implementation of an autonomous, adaptive, and context aware multimodal system. This work also discusses the challenges presented with regard to testing such a system, the limitations with many current audiovisual speech corpora, and discusses a suitable approach towards development of a corpus designed to test this novel, cognitively inspired, speech filtering system.                                                                                

Weitere Titel in dieser Kategorie
Cover The Official Raspberry Pi Handbook 2025
The Makers of The MagPi magazine
Cover C# Interview Guide
Konstantin Semenenko

Kundenbewertungen

Schlagwörter

Multimodal processing, Hearing and listening devices, Audiovisual, Speech filtering, Fuzzy Logic