A Comparative Study of Glottal Source Estimation Techniques
Loading...
Files
Date
2012
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier Ltd.
Open Access Color
BRONZE
Green Open Access
Yes
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Abstract: Source-tract decomposition (or glottal flow estimation) is one of the basic problems of speech processing. For this, several techniques have been proposed in the literature. However, studies comparing different approaches are almost nonexistent. Besides, experiments have been systematically performed either on synthetic speech or on sustained vowels. In this study we compare three of the main representative state-of-the-art methods of glottal flow estimation: closed-phase inverse filtering, iterative and adaptive inverse filtering, and mixed-phase decomposition. These techniques are first submitted to an objective assessment test on synthetic speech signals. Their sensitivity to various factors affecting the estimation quality, as well as their robustness to noise are studied. In a second experiment, their ability to label voice quality (tensed, modal, soft) is studied on a large corpus of real connected speech. It is shown that changes of voice quality are reflected by significant modifications in glottal feature distributions. Techniques based on the mixed-phase decomposition and on a closed-phase inverse filtering process turn out to give the best results on both clean synthetic and real speech signals. On the other hand, iterative and adaptive inverse filtering is recommended in noisy environments for its high robustness. © 2011 Elsevier Ltd. All rights reserved.
Description
Keywords
Glottal flow estimation, Inverse filtering, Mixed-phase decomposition, Voice quality, Source-tract separation, FOS: Computer and information sciences, Source-tract Separation, Sound (cs.SD), Computer Science - Computation and Language, Voice Quality, Glottal flow estimation, Inverse filtering, Mixed-phase decomposition, Computer Science - Sound, Source-tract separation, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Glottal Flow Estimation, Inverse Filtering, Voice quality, [SCCO.LING] Cognitive science/Linguistics, Mixed-Phase Decomposition, Computation and Language (cs.CL), Electrical Engineering and Systems Science - Audio and Speech Processing
Fields of Science
02 engineering and technology, 03 medical and health sciences, 0202 electrical engineering, electronic engineering, information engineering, 0305 other medical science
Citation
Drugman, T., Bozkurt, B., and Dutoit, T. (2012). A comparative study of glottal source estimation techniques. Computer Speech and Language, 26(1), 20-34. doi:10.1016/j.csl.2011.03.003
WoS Q
Q2
Scopus Q
Q1

OpenCitations Citation Count
80
Source
Computer Speech and Language
Volume
26
Issue
1
Start Page
20
End Page
34
Collections
PlumX Metrics
Citations
CrossRef : 48
Scopus : 101
Captures
Mendeley Readers : 63
SCOPUS™ Citations
101
checked on Apr 27, 2026
Web of Science™ Citations
86
checked on Apr 27, 2026
Page Views
699
checked on Apr 27, 2026
Downloads
485
checked on Apr 27, 2026
Google Scholar™


