A Comparative Study of Glottal Source Estimation Techniques

dc.contributor.author Drugman, Thomas
dc.contributor.author Bozkurt, Barış
dc.contributor.author Dutoit, Thierry
dc.coverage.doi 10.1016/j.csl.2011.03.003
dc.date.accessioned 2017-02-06T09:13:40Z
dc.date.available 2017-02-06T09:13:40Z
dc.date.issued 2012
dc.description.abstract Abstract: Source-tract decomposition (or glottal flow estimation) is one of the basic problems of speech processing. For this, several techniques have been proposed in the literature. However, studies comparing different approaches are almost nonexistent. Besides, experiments have been systematically performed either on synthetic speech or on sustained vowels. In this study we compare three of the main representative state-of-the-art methods of glottal flow estimation: closed-phase inverse filtering, iterative and adaptive inverse filtering, and mixed-phase decomposition. These techniques are first submitted to an objective assessment test on synthetic speech signals. Their sensitivity to various factors affecting the estimation quality, as well as their robustness to noise are studied. In a second experiment, their ability to label voice quality (tensed, modal, soft) is studied on a large corpus of real connected speech. It is shown that changes of voice quality are reflected by significant modifications in glottal feature distributions. Techniques based on the mixed-phase decomposition and on a closed-phase inverse filtering process turn out to give the best results on both clean synthetic and real speech signals. On the other hand, iterative and adaptive inverse filtering is recommended in noisy environments for its high robustness. © 2011 Elsevier Ltd. All rights reserved. en_US
dc.description.sponsorship Belgian Fonds National de la Recherche Scientifique (FNRS) en_US
dc.identifier.citation Drugman, T., Bozkurt, B., and Dutoit, T. (2012). A comparative study of glottal source estimation techniques. Computer Speech and Language, 26(1), 20-34. doi:10.1016/j.csl.2011.03.003 en_US
dc.identifier.doi 10.1016/j.csl.2011.03.003
dc.identifier.doi 10.1016/j.csl.2011.03.003 en_US
dc.identifier.issn 0885-2308
dc.identifier.scopus 2-s2.0-80955173659
dc.identifier.uri http://doi.org/10.1016/j.csl.2011.03.003
dc.identifier.uri https://hdl.handle.net/11147/4792
dc.language.iso en en_US
dc.publisher Elsevier Ltd. en_US
dc.relation.ispartof Computer Speech and Language en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Glottal flow estimation en_US
dc.subject Inverse filtering en_US
dc.subject Mixed-phase decomposition en_US
dc.subject Voice quality en_US
dc.subject Source-tract separation en_US
dc.title A Comparative Study of Glottal Source Estimation Techniques en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.institutional Bozkurt, Barış
gdc.author.yokid 115225
gdc.bip.impulseclass C4
gdc.bip.influenceclass C4
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Electrical and Electronics Engineering en_US
gdc.description.endpage 34 en_US
gdc.description.issue 1 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q1
gdc.description.startpage 20 en_US
gdc.description.volume 26 en_US
gdc.description.wosquality Q2
gdc.identifier.openalex W1999319498
gdc.identifier.wos WOS:000295810600002
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.accesstype BRONZE
gdc.oaire.diamondjournal false
gdc.oaire.impulse 31.0
gdc.oaire.influence 9.411902E-9
gdc.oaire.isgreen true
gdc.oaire.keywords FOS: Computer and information sciences
gdc.oaire.keywords Source-tract Separation
gdc.oaire.keywords Sound (cs.SD)
gdc.oaire.keywords Computer Science - Computation and Language
gdc.oaire.keywords Voice Quality
gdc.oaire.keywords Glottal flow estimation
gdc.oaire.keywords Inverse filtering
gdc.oaire.keywords Mixed-phase decomposition
gdc.oaire.keywords Computer Science - Sound
gdc.oaire.keywords Source-tract separation
gdc.oaire.keywords Audio and Speech Processing (eess.AS)
gdc.oaire.keywords FOS: Electrical engineering, electronic engineering, information engineering
gdc.oaire.keywords Glottal Flow Estimation
gdc.oaire.keywords Inverse Filtering
gdc.oaire.keywords Voice quality
gdc.oaire.keywords [SCCO.LING] Cognitive science/Linguistics
gdc.oaire.keywords Mixed-Phase Decomposition
gdc.oaire.keywords Computation and Language (cs.CL)
gdc.oaire.keywords Electrical Engineering and Systems Science - Audio and Speech Processing
gdc.oaire.popularity 2.0692525E-8
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 02 engineering and technology
gdc.oaire.sciencefields 03 medical and health sciences
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields 0305 other medical science
gdc.openalex.collaboration International
gdc.openalex.fwci 5.49777541
gdc.openalex.normalizedpercentile 0.95
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 80
gdc.plumx.crossrefcites 48
gdc.plumx.mendeley 63
gdc.plumx.scopuscites 101
gdc.scopus.citedcount 101
gdc.wos.citedcount 86
local.message.claim 2022-06-06T11:05:51.667+0300 *
local.message.claim |rp01762 *
local.message.claim |submit_approve *
local.message.claim |dc_contributor_author *
local.message.claim |None *
relation.isAuthorOfPublication.latestForDiscovery 5330d4dd-0015-42cd-8ff8-58d948df8241
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4018-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
4792.pdf
Size:
1.26 MB
Format:
Adobe Portable Document Format
Description:
Makale

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: