Causal-Anticausal Decomposition of Speech Using Complex Cepstrum for Glottal Source Estimation
| dc.contributor.author | Drugman, Thomas | |
| dc.contributor.author | Bozkurt, Barış | |
| dc.contributor.author | Dutoit, Thierry | |
| dc.coverage.doi | 10.1016/j.specom.2011.02.004 | |
| dc.date.accessioned | 2017-03-15T08:50:28Z | |
| dc.date.available | 2017-03-15T08:50:28Z | |
| dc.date.issued | 2011 | |
| dc.description.abstract | Complex cepstrum is known in the literature for linearly separating causal and anticausal components. Relying on advances achieved by the Zeros of the Z-Transform (ZZT) technique, we here investigate the possibility of using complex cepstrum for glottal flow estimation on a large-scale database. Via a systematic study of the windowing effects on the deconvolution quality, we show that the complex cepstrum causal-anticausal decomposition can be effectively used for glottal flow estimation when specific windowing criteria are met. It is also shown that this complex cepstral decomposition gives similar glottal estimates as obtained with the ZZT method. However, as complex cepstrum uses FFT operations instead of requiring the factoring of high-degree polynomials, the method benefits from a much higher speed. Finally in our tests on a large corpus of real expressive speech, we show that the proposed method has the potential to be used for voice quality analysis. | en_US |
| dc.description.sponsorship | Fonds National de la Recherche Scientifique (FNRS); Scientific and Technological Research Council of Turkey | en_US |
| dc.identifier.citation | Drugman, T., Bozkurt, B.,and Dutoit, T. (2011). Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation. Speech Communication, 53(6) , 855-866. doi:10.1016/j.specom.2011.02.004 | en_US |
| dc.identifier.doi | 10.1016/j.specom.2011.02.004 | en_US |
| dc.identifier.doi | 10.1016/j.specom.2011.02.004 | |
| dc.identifier.issn | 0167-6393 | |
| dc.identifier.issn | 1872-7182 | |
| dc.identifier.scopus | 2-s2.0-79955528226 | |
| dc.identifier.uri | https://doi.org/10.1016/j.specom.2011.02.004 | |
| dc.identifier.uri | https://hdl.handle.net/11147/5059 | |
| dc.language.iso | en | en_US |
| dc.publisher | Elsevier Ltd. | en_US |
| dc.relation.ispartof | Speech Communication | en_US |
| dc.rights | info:eu-repo/semantics/openAccess | en_US |
| dc.subject | Complex cepstrum | en_US |
| dc.subject | Glottal source estimation | en_US |
| dc.subject | Homomorphic analysis | en_US |
| dc.subject | Source-tract separation | en_US |
| dc.subject | Speech processing | en_US |
| dc.title | Causal-Anticausal Decomposition of Speech Using Complex Cepstrum for Glottal Source Estimation | en_US |
| dc.type | Article | en_US |
| dspace.entity.type | Publication | |
| gdc.author.institutional | Bozkurt, Barış | |
| gdc.bip.impulseclass | C4 | |
| gdc.bip.influenceclass | C4 | |
| gdc.bip.popularityclass | C4 | |
| gdc.coar.access | open access | |
| gdc.coar.type | text::journal::journal article | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | İzmir Institute of Technology. Electrical and Electronics Engineering | en_US |
| gdc.description.endpage | 866 | en_US |
| gdc.description.issue | 6 | en_US |
| gdc.description.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | Q2 | |
| gdc.description.startpage | 855 | en_US |
| gdc.description.volume | 53 | en_US |
| gdc.description.wosquality | Q1 | |
| gdc.identifier.openalex | W2025920047 | |
| gdc.identifier.wos | WOS:000290829500006 | |
| gdc.index.type | WoS | |
| gdc.index.type | Scopus | |
| gdc.oaire.accesstype | BRONZE | |
| gdc.oaire.diamondjournal | false | |
| gdc.oaire.impulse | 21.0 | |
| gdc.oaire.influence | 7.1847266E-9 | |
| gdc.oaire.isgreen | true | |
| gdc.oaire.keywords | FOS: Computer and information sciences | |
| gdc.oaire.keywords | Sound (cs.SD) | |
| gdc.oaire.keywords | Computer Science - Computation and Language | |
| gdc.oaire.keywords | Source-Tract Separation | |
| gdc.oaire.keywords | Glottal source estimation | |
| gdc.oaire.keywords | Complex Cepstrum | |
| gdc.oaire.keywords | Computer Science - Sound | |
| gdc.oaire.keywords | Source-tract separation | |
| gdc.oaire.keywords | Speech processing | |
| gdc.oaire.keywords | Audio and Speech Processing (eess.AS) | |
| gdc.oaire.keywords | Complex cepstrum | |
| gdc.oaire.keywords | Glottal Source Estimation | |
| gdc.oaire.keywords | FOS: Electrical engineering, electronic engineering, information engineering | |
| gdc.oaire.keywords | Homomorphic analysis | |
| gdc.oaire.keywords | [SCCO.LING] Cognitive science/Linguistics | |
| gdc.oaire.keywords | Computation and Language (cs.CL) | |
| gdc.oaire.keywords | Homomorphic Analysis | |
| gdc.oaire.keywords | Electrical Engineering and Systems Science - Audio and Speech Processing | |
| gdc.oaire.popularity | 6.191941E-9 | |
| gdc.oaire.publicfunded | false | |
| gdc.oaire.sciencefields | 02 engineering and technology | |
| gdc.oaire.sciencefields | 03 medical and health sciences | |
| gdc.oaire.sciencefields | 0202 electrical engineering, electronic engineering, information engineering | |
| gdc.oaire.sciencefields | 0305 other medical science | |
| gdc.openalex.collaboration | International | |
| gdc.openalex.fwci | 7.76637077 | |
| gdc.openalex.normalizedpercentile | 0.98 | |
| gdc.openalex.toppercent | TOP 10% | |
| gdc.opencitations.count | 44 | |
| gdc.plumx.crossrefcites | 39 | |
| gdc.plumx.mendeley | 43 | |
| gdc.plumx.scopuscites | 59 | |
| gdc.scopus.citedcount | 59 | |
| gdc.wos.citedcount | 43 | |
| local.message.claim | 2022-06-06T11:05:51.667+0300 | * |
| local.message.claim | |rp01762 | * |
| local.message.claim | |submit_approve | * |
| local.message.claim | |dc_contributor_author | * |
| local.message.claim | |None | * |
| relation.isAuthorOfPublication.latestForDiscovery | 5330d4dd-0015-42cd-8ff8-58d948df8241 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 9af2b05f-28ac-4018-8abe-a4dfe192da5e |
