Applied Mel-Frequency Discrete Wavelet Coefficients and Parallel Model Compensation for Noise-Robust Speech Recognition

dc.contributor.author Tüfekçi, Zekeriya
dc.contributor.author Gowdy, John N.
dc.contributor.author Gürbüz, Sabri
dc.contributor.author Patterson, Eric
dc.coverage.doi 10.1016/j.specom.2006.06.006
dc.date.accessioned 2021-01-24T18:28:22Z
dc.date.available 2021-01-24T18:28:22Z
dc.date.issued 2006
dc.description.abstract Interfering noise severely degrades the performance of a speech recognition system. The Parallel Model Compensation (PMC) technique is one of the most efficient techniques for dealing with such noise. Another approach is to use features local in the frequency domain, such as Mel-Frequency Discrete Wavelet Coefficients (MFDWCs). In this paper, we investigate the use of PMC and MFDWC features to take advantage of both noise compensation and local features (MFDWCs) to decrease the effect of noise on recognition performance. We also introduce a practical weighting technique based on the noise level of each coefficient. We evaluate the performance of several wavelet-schemes using the NOISEX-92 database for various noise types and noise levels. Finally, we compare the performance of these versus Mel-Frequency Cepstral Coefficients (MFCCs), both using PMC. Experimental results show significant performance improvements for MFDWCs versus MFCCs, particularly after compensating the HMMs using the PMC technique. The best feature vector among the six MFDWCs we tried gave 13.72 and 5.29 points performance improvement, on the average, over MFCCs for -6 and 0 dB SNR, respectively. This corresponds to 39.9% and 62.8% error reductions, respectively. Weighting the partial score of each coefficient based on the noise level further improves the performance. The average error rates for the best MFDWCs dropped from 19.57% to 16.71% and from 3.14% to 2.14% for -6 dB and 0 dB noise levels, respectively, using the weighting scheme. These improvements correspond to 14.6% and 31.8% error reductions for -6 dB and 0 dB noise levels, respectively. (c) 2006 Elsevier B.V. All rights reserved. en_US
dc.identifier.doi 10.1016/j.specom.2006.06.006
dc.identifier.issn 0167-6393
dc.identifier.issn 1872-7182
dc.identifier.scopus 2-s2.0-33748459554
dc.identifier.uri https://doi.org/10.1016/j.specom.2006.06.006
dc.identifier.uri https://hdl.handle.net/11147/9739
dc.language.iso en en_US
dc.publisher Elsevier en_US
dc.relation.ispartof Speech Communication en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Noise robust ASR en_US
dc.subject Wavelets en_US
dc.subject Local feature en_US
dc.subject Feature weighting en_US
dc.title Applied Mel-Frequency Discrete Wavelet Coefficients and Parallel Model Compensation for Noise-Robust Speech Recognition en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.institutional Tüfekçi, Zekeriya
gdc.bip.impulseclass C5
gdc.bip.influenceclass C4
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Electrical and Electronics Engineering en_US
gdc.description.endpage 1307 en_US
gdc.description.issue 10 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q2
gdc.description.startpage 1294 en_US
gdc.description.volume 48 en_US
gdc.description.wosquality Q1
gdc.identifier.openalex W2100274538
gdc.identifier.wos WOS:000241586500006
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.downloads 0
gdc.oaire.impulse 2.0
gdc.oaire.influence 4.7667736E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Noise robust ASR
gdc.oaire.keywords Feature weighting
gdc.oaire.keywords Wavelet
gdc.oaire.keywords Local feature
gdc.oaire.popularity 6.6290267E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 03 medical and health sciences
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.oaire.sciencefields 0305 other medical science
gdc.oaire.views 2
gdc.openalex.collaboration International
gdc.openalex.fwci 0.9589557
gdc.openalex.normalizedpercentile 0.74
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 26
gdc.plumx.crossrefcites 26
gdc.plumx.mendeley 20
gdc.plumx.scopuscites 31
gdc.scopus.citedcount 31
gdc.wos.citedcount 18
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4003-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
1-s2.0-S016763930600077X-main.pdf
Size:
289.5 KB
Format:
Adobe Portable Document Format