On noise masking for automatic missing data speech recognition: A survey and discussion [An article from: Computer Speech & Language]

Name: On noise masking for automatic missing data speech recognition: A survey and discussion [An article from: Computer Speech & Language]
Author: C. Cerisara, S. Demange, J.P. Haton
ISBN: 978B000PDU1C1

Author C. Cerisara, S. Demange, J.P. Haton

Publisher Elsevier

Shop on Amazon — pick your country

🇺🇸 USA 🇨🇦 Canada 🇬🇧 UK 🇩🇪 Germany 🇫🇷 France 🇮🇳 India

7.95 USD

Buy New on Amazon 🇺🇸

Available for download now

Book Details

Author(s) C. Cerisara, S. Demange, J.P. Haton

Publisher Elsevier

ISBN / ASIN B000PDU1CE

ISBN-13 978B000PDU1C1

Availability Available for download now

Marketplace United States 🇺🇸

Description

This digital document is a journal article from Computer Speech & Language, published by Elsevier in 2007. The article is delivered in HTML format and is available in your Amazon.com Media Library immediately after purchase. You can view it with any web browser.

Description:
Automatic speech recognition (ASR) has reached very high levels of performance in controlled situations. However, the performance degrades significantly when environmental noise occurs during the recognition process. Nowadays, the major challenge is to reach a good robustness to adverse conditions, so that automatic speech recognizers can be used in real situations. Missing data theory is a very attractive and promising approach. Unlike other denoising methods, missing data recognition does not match the whole data with the acoustic models, but instead considers part of the signal as missing, i.e. corrupted by noise. While speech recognition with missing data can be handled efficiently by methods such as data imputation or marginalization, accurately identifying missing parts (also called masks) remains a very challenging task. This paper reviews the main approaches that have been proposed to address this problem. The objective of this study is to identify the mask estimation methods that have been proposed so far, and to open this domain up to other related research, which could be adapted to overcome this difficult challenge. In order to restrict the range of methods, only the techniques using a single microphone are considered.