Electrical - Electronic Engineering / Elektrik - Elektronik Mühendisliği
Permanent URI for this collectionhttps://hdl.handle.net/11147/11
Browse
4 results
Search Results
Article Citation - WoS: 4Ramcess 2.x Framework-Expressive Voice Analysis for Realtime and Accurate Synthesis of Singing(Springer Verlag, 2008) d'Alessandro, Nicolas; Babacan, Onur; Bozkurt, Barış; Dubuisson, Thomas; Holzapfel, Andre; Kessous, Loic; Vlieghe, MaximeIn this paper we present the work that has been achieved in the context of the second version of the RAMCESS singing synthesis framework. The main improvement of this study is the integration of new algorithms for expressive voice analysis, especially the separation of the glottal source and the vocal tract. Realtime synthesis modules have also been refined. These elements have been integrated in an existing digital instrument: the HANDSKETCH 1.X, a bimanual controller. Moreover this digital instrument is compared to existing systems.Article Citation - WoS: 43Citation - Scopus: 59Causal-Anticausal Decomposition of Speech Using Complex Cepstrum for Glottal Source Estimation(Elsevier Ltd., 2011) Drugman, Thomas; Bozkurt, Barış; Dutoit, ThierryComplex cepstrum is known in the literature for linearly separating causal and anticausal components. Relying on advances achieved by the Zeros of the Z-Transform (ZZT) technique, we here investigate the possibility of using complex cepstrum for glottal flow estimation on a large-scale database. Via a systematic study of the windowing effects on the deconvolution quality, we show that the complex cepstrum causal-anticausal decomposition can be effectively used for glottal flow estimation when specific windowing criteria are met. It is also shown that this complex cepstral decomposition gives similar glottal estimates as obtained with the ZZT method. However, as complex cepstrum uses FFT operations instead of requiring the factoring of high-degree polynomials, the method benefits from a much higher speed. Finally in our tests on a large corpus of real expressive speech, we show that the proposed method has the potential to be used for voice quality analysis.Conference Object Citation - WoS: 1Citation - Scopus: 2Glottal Source Estimation Using an Automatic Chirp Decomposition(Springer, 2010) Drugman, Thomas; Bozkurt, Barış; Dutoit, ThierryIn a previous work, we showed that the glottal source can be estimated from speech signals by computing the Zeros of the Z-Transform (ZZT). Decomposition was achieved by separating the roots inside (causal contribution) and outside (anticausal contribution) the unit circle. In order to guarantee a correct deconvolution, time alignment on the Glottal Closure Instants (GCIs) was shown to be essential. This paper extends the formalism of ZZT by evaluating the Z-transform on a contour possibly different from the unit circle. A method is proposed for determining automatically this contour by inspecting the root distribution. The derived Zeros of the Chirp Z-Transform (ZCZT)-based technique turns out to be much more robust to GCI location errors. © 2010 Springer-Verlag.Conference Object Citation - WoS: 3Citation - Scopus: 6Phase-Based Methods for Voice Source Analysis(Springer Verlag, 2007) D’Alessandro, Christophe; Bozkurt, Barış; Doval, Boris; Dutoit, Thierry; Henrich, Nathalie; Tuan, Vu Ngoc; Sturmel, NicolasVoice source analysis is an important but difficult issue for speech processing. In this talk, three aspects of voice source analysis recently developed at LIMSI (Orsay, France) and FPMs (Mons, Belgium) are discussed. In a first part, time domain and spectral domain modelling of glottal flow signals are presented. It is shown that the glottal flow can be modelled as an anticausal filter (maximum phase) before the glottal closing, and as a causal filter (minimum phase) after the glottal closing. In a second part, taking advantage of this phase structure, causal and anticausal components of the speech signal are separated according to the location in the Z-plane of the zeros of the Z-Transform (ZZT) of the windowed signal. This method is useful for voice source parameters analysis and source-tract deconvolution. Results of a comparative evaluation of the ZZT and linear prediction for source/tract separation are reported. In a third part, glottal closing instant detection using the phase of the wavelet transform is discussed. A method based on the lines of maximum phase in the time-scale plane is proposed. This method is compared to EGG for robust glottal closing instant analysis.
