A Comparative Study of Glottal Source Estimation Techniques

Loading...

Date

2012

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier Ltd.

Open Access Color

BRONZE

Green Open Access

Yes

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Top 10%
Popularity
Top 10%

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

Abstract: Source-tract decomposition (or glottal flow estimation) is one of the basic problems of speech processing. For this, several techniques have been proposed in the literature. However, studies comparing different approaches are almost nonexistent. Besides, experiments have been systematically performed either on synthetic speech or on sustained vowels. In this study we compare three of the main representative state-of-the-art methods of glottal flow estimation: closed-phase inverse filtering, iterative and adaptive inverse filtering, and mixed-phase decomposition. These techniques are first submitted to an objective assessment test on synthetic speech signals. Their sensitivity to various factors affecting the estimation quality, as well as their robustness to noise are studied. In a second experiment, their ability to label voice quality (tensed, modal, soft) is studied on a large corpus of real connected speech. It is shown that changes of voice quality are reflected by significant modifications in glottal feature distributions. Techniques based on the mixed-phase decomposition and on a closed-phase inverse filtering process turn out to give the best results on both clean synthetic and real speech signals. On the other hand, iterative and adaptive inverse filtering is recommended in noisy environments for its high robustness. © 2011 Elsevier Ltd. All rights reserved.

Description

Keywords

Glottal flow estimation, Inverse filtering, Mixed-phase decomposition, Voice quality, Source-tract separation, FOS: Computer and information sciences, Source-tract Separation, Sound (cs.SD), Computer Science - Computation and Language, Voice Quality, Glottal flow estimation, Inverse filtering, Mixed-phase decomposition, Computer Science - Sound, Source-tract separation, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Glottal Flow Estimation, Inverse Filtering, Voice quality, [SCCO.LING] Cognitive science/Linguistics, Mixed-Phase Decomposition, Computation and Language (cs.CL), Electrical Engineering and Systems Science - Audio and Speech Processing

Fields of Science

02 engineering and technology, 03 medical and health sciences, 0202 electrical engineering, electronic engineering, information engineering, 0305 other medical science

Citation

Drugman, T., Bozkurt, B., and Dutoit, T. (2012). A comparative study of glottal source estimation techniques. Computer Speech and Language, 26(1), 20-34. doi:10.1016/j.csl.2011.03.003

WoS Q

Q2

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
80

Source

Computer Speech and Language

Volume

26

Issue

1

Start Page

20

End Page

34
PlumX Metrics
Citations

CrossRef : 48

Scopus : 101

Captures

Mendeley Readers : 63

SCOPUS™ Citations

101

checked on Apr 27, 2026

Web of Science™ Citations

86

checked on Apr 27, 2026

Page Views

699

checked on Apr 27, 2026

Downloads

485

checked on Apr 27, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
5.49777541

Sustainable Development Goals

SDG data is not available