Electrical - Electronic Engineering / Elektrik - Elektronik Mühendisliği

Permanent URI for this collectionhttps://hdl.handle.net/11147/11

Browse

Search Results

Now showing 1 - 8 of 8
  • Conference Object
    Citation - Scopus: 6
    Music Information Retrieval for Turkish Music: Problems, Solutions and Tools
    (Institute of Electrical and Electronics Engineers Inc., 2009) Bozkurt, Barış; Gedik, Ali Cenk; Karaosmanoğlu, M. Kemal
    Bu çalışma bilgi erişimi uygulamaları açısından Türk müziğinin Batı müziği ile farklılıklarını tartışmaya açmaktadır. Türk müziği bilgi erişimi için frekans histogramı kullanımını önermekte ve otomatik karar sesi tespiti, makam sınıflandırma, ses sistemi analizi, kuram – icra uyuşma düzeyinin ölçülmesi gibi uygulamalar için geliştirilmiş bir dizi aracı içeren Makam Aracı (Makam Toolbox) 1.0’ın ve beraberinde büyük bir parametrik veritabanının tanıtımını yapmaktadır.
  • Article
    Citation - WoS: 67
    Citation - Scopus: 78
    Chirp Group Delay Analysis of Speech Signals
    (Elsevier, 2007) Bozkurt, Barış; Couvreur, Laurent; Dutoit, Thierry
    This study proposes new group delay estimation techniques that can be used for analyzing resonance patterns of short-term discrete-time signals and more specifically speech signals. Phase processing or equivalently group delay processing of speech signals are known to be difficult due to large spikes in the phase/group delay functions that mask the formant structure. In this study, we first analyze in detail the z-transform zero patterns of short-term speech signals in the z-plane and discuss the sources of spikes on group delay functions, namely the zeros closely located to the unit circle. We show that windowing largely influences these patterns, therefore short-term phase processing. Through a systematic study, we then show that reliable phase/group delay estimation for speech signals can be achieved by appropriate windowing and group delay functions can reveal formant information as well as some of the characteristics of the glottal flow component in speech signals. However, such phase estimation is highly sensitive to noise and robust extraction of group delay based parameters remains difficult in real acoustic conditions even with appropriate windowing. As an alternative, we propose processing of chirp group delay functions, i.e. group delay functions computed on a circle other than the unit circle in z-plane, which can be guaranteed to be spike-free. We finally present one application in feature extraction for automatic speech recognition (ASR). We show that chirp group delay representations are potentially useful for improving ASR performance. (c) 2007 Elsevier B.V. All rights reserved.
  • Article
    Citation - WoS: 4
    Ramcess 2.x Framework-Expressive Voice Analysis for Realtime and Accurate Synthesis of Singing
    (Springer Verlag, 2008) d'Alessandro, Nicolas; Babacan, Onur; Bozkurt, Barış; Dubuisson, Thomas; Holzapfel, Andre; Kessous, Loic; Vlieghe, Maxime
    In this paper we present the work that has been achieved in the context of the second version of the RAMCESS singing synthesis framework. The main improvement of this study is the integration of new algorithms for expressive voice analysis, especially the separation of the glottal source and the vocal tract. Realtime synthesis modules have also been refined. These elements have been integrated in an existing digital instrument: the HANDSKETCH 1.X, a bimanual controller. Moreover this digital instrument is compared to existing systems.
  • Article
    Citation - WoS: 5
    Citation - Scopus: 8
    Evaluation of the Makam Scale Theory of Arel for Music Information Retrieval on Traditional Turkish Art Music
    (Routledge, 2009) Gedik,A.C.; Bozkurt,B.
    Current music information retrieval (MIR) methods are specifically tailored to the needs of western music. Therefore, it is not straightforward to apply these methods to non-western musics such as traditional Turkish art music (TTAM). Western music theory plays a crucial role in MIR studies. The divergence, however, between theory and practice in traditional Turkish art music (TTAM) results in a lack of a reliable theory of TTAM on which MIR techniques can be based. This is particularly true for theories regarding pitch scales and interval structures in TTAM. In this paper, we evaluate the most influential (yet disputable) theory of TTAM, Arel theory, by means of a makam classification task, to understand whether it can provide a basis for MIR studies on TTAM in a similar way western music theory provides a basis for MIR studies on western music. It is shown that Arel theory is overall successful when applied for modality finding in TTAM and that it can be improved if small modifications are introduced following pitch values obtained from musical practice. © 2009, Copyright Taylor & Francis Group, LLC.
  • Article
    Citation - WoS: 13
    Citation - Scopus: 20
    Weighing Diverse Theoretical Models on Turkish Maqam Music Against Pitch Measurements: a Comparison of Peaks Automatically Derived From Frequency Histograms With Proposed Scale Tones
    (Taylor and Francis Ltd., 2009) Bozkurt, Barış; Yarman, Ozan; Karaosmanoğlu, M. Kemal; Akkoç, Can
    Since the early 20th century, various theories have been advanced in order to mathematically explain and notate modes of Traditional Turkish music known as maqams. In this article, maqam scales according to various theoretical models based on different tunings are compared with pitch measurements obtained from select recordings of master Turkish performers in order to study their level of match with analysed data. Chosen recordings are subjected to a fully computerized sequence of signal processing algorithms for the automatic determination of the set of relative pitches for each maqam scale: f0 estimation, histogram computation, tonic detection + histogram alignment, and peak picking. For nine well-recognized maqams, automatically derived relative pitches are compared with scale tones defined by theoretical models using quantitative distance measures. We analyse and interpret histogram peaks based on these measures to find the theoretical models most conforming with all the recordings, and hence, with the quotidian performance trends influenced by them.
  • Conference Object
    Citation - WoS: 16
    Citation - Scopus: 46
    Complex Cepstrum-Based Decomposition of Speech for Glottal Source Estimation
    (International Speech Communication Association, 2009) Drugman, Thomas; Bozkurt, Barış; Dutoit, Thierry
    Homomorphic analysis is a well-known method for the separation of non-linearly combined signals. More particularly, the use of complex cepstrum for source-tract deconvolution has been discussed in various articles. However there exists no study which proposes a glottal flow estimation methodology based on cepstrum and reports effective results. In this paper, we show that complex cepstrum can be effectively used for glottal flow estimation by separating the causal and anticausal components of a windowed speech signal as done by the Zeros of the Z-Transform (ZZT) decomposition. Based on exactly the same principles presented for ZZT decomposition, windowing should be applied such that the windowed speech signals exhibit mixed-phase characteristics which conform the speech production model that the anticausal component is mainly due to the glottal flow open phase. The advantage of the complex cepstrum-based approach compared to the ZZT decomposition is its much higher speed.
  • Article
    Citation - WoS: 25
    Citation - Scopus: 39
    An Automatic Pitch Analysis Method for Turkish Maqam Music
    (Taylor and Francis Ltd., 2008) Bozkurt, Barış
    Automatic pitch analysis of large audio databases is essential for studies on music information retrieval and developing a pitch scale theory for Turkish maqam music. However no such study is available. In this article, we first determine the main obstacle as the alignment of frequency analysis results from multiple files. We then propose a new method to automatically detect the tonic of a recording, align the data, and estimate overall frequency histograms from large databases. We show that such histograms can be successfully used for pitch scale (tuning) studies on the recordings of Tanburi Cemil Bey, an undisputed master of the genre
  • Conference Object
    Citation - WoS: 3
    Citation - Scopus: 6
    Phase-Based Methods for Voice Source Analysis
    (Springer Verlag, 2007) D’Alessandro, Christophe; Bozkurt, Barış; Doval, Boris; Dutoit, Thierry; Henrich, Nathalie; Tuan, Vu Ngoc; Sturmel, Nicolas
    Voice source analysis is an important but difficult issue for speech processing. In this talk, three aspects of voice source analysis recently developed at LIMSI (Orsay, France) and FPMs (Mons, Belgium) are discussed. In a first part, time domain and spectral domain modelling of glottal flow signals are presented. It is shown that the glottal flow can be modelled as an anticausal filter (maximum phase) before the glottal closing, and as a causal filter (minimum phase) after the glottal closing. In a second part, taking advantage of this phase structure, causal and anticausal components of the speech signal are separated according to the location in the Z-plane of the zeros of the Z-Transform (ZZT) of the windowed signal. This method is useful for voice source parameters analysis and source-tract deconvolution. Results of a comparative evaluation of the ZZT and linear prediction for source/tract separation are reported. In a third part, glottal closing instant detection using the phase of the wavelet transform is discussed. A method based on the lines of maximum phase in the time-scale plane is proposed. This method is compared to EGG for robust glottal closing instant analysis.