Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR [An article from: Computer Speech & Language] Buy on Amazon

https://www.ebooknetworking.net/books_detail-B000RR2LD0.html

Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR [An article from: Computer Speech & Language]

PublisherElsevier

Book Details

PublisherElsevier
ISBN / ASINB000RR2LD0
ISBN-13978B000RR2LD6
MarketplaceUnited Kingdom  🇬🇧

Description

This digital document is a journal article from Computer Speech & Language, published by Elsevier in 2005. The article is delivered in HTML format and is available in your Amazon.com Media Library immediately after purchase. You can view it with any web browser.

Description:
In this article we review several successful extensions to the standard hidden-Markov-model/artificial neural network (HMM/ANN) hybrid, which have recently made important contributions to the field of noise robust automatic speech recognition. The first extension to the standard hybrid was the ''multi-band hybrid'', in which a separate ANN is trained on each frequency sub-band, followed by some form of weighted combination of ANN state posterior probability outputs prior to decoding. However, due to the inaccurate assumption of sub-band independence, this system usually gives degraded performance, except in the case of narrow-band noise. All of the systems which we review overcome this independence assumption and give improved performance in noise, while also improving or not significantly degrading performance with clean speech. The ''all-combinations multi-band'' hybrid trains a separate ANN for each sub-band combination. This, however, typically requires a large number of ANNs. The ''all-combinations multi-stream'' hybrid trains an ANN expert for every combination of just a small number of complementary data streams. Multiple ANN posteriors combination using maximum a-posteriori (MAP) weighting gives rise to the further successful strategy of hypothesis level combination by MAP selection. An alternative strategy for exploiting the classification capacity of ANNs is the ''tandem hybrid'' approach in which one or more ANN classifiers are trained with multi-condition data to generate discriminative and noise robust features for input to a standard ASR system. The ''multi-stream tandem hybrid'' trains an ANN for a number of complementary feature streams, permitting multi-stream data fusion. The ''narrow-band tandem hybrid'' trains an ANN for a number of particularly narrow frequency sub-bands. This gives improved robustness to noises not seen during training. Of the systems presented, all of the multi-stream systems provide generic models for multi-modal data fusion. Test results for each system are presented and discussed.
Donate to EbookNetworking
Prev
Next