Electrical - Electronic Engineering / Elektrik - Elektronik Mühendisliği
Permanent URI for this collectionhttps://hdl.handle.net/11147/11
Browse
2 results
Search Results
Conference Object Citation - WoS: 1Citation - Scopus: 2A Case Study on Logging Visual Activities: Chess Game(Springer Verlag, 2006) Ozan, Şükrü; Gümüştekin, ŞevketAutomatically recognizing and analyzing visual activities in complex environments is a challenging and open-ended problem. In this study this task is performed in a chess game scenario where the rules, actions and the environment are well defined. The purpose here is to detect and observe a FIDE (Fédération International des Ėchecs) compatible chess board, generating a log file of the moves made by human players. A series of basic image processing operations have been applied to perform the desired task. The first step of automatically detecting a chess board is followed by locating the positions of the pieces. After the initial setup is established every move made by a player is automatically detected and verified. Intel® Open Source Computer Vision Library (OpenCV) is used in the current software implementation.Conference Object Citation - WoS: 3Citation - Scopus: 6Phase-Based Methods for Voice Source Analysis(Springer Verlag, 2007) D’Alessandro, Christophe; Bozkurt, Barış; Doval, Boris; Dutoit, Thierry; Henrich, Nathalie; Tuan, Vu Ngoc; Sturmel, NicolasVoice source analysis is an important but difficult issue for speech processing. In this talk, three aspects of voice source analysis recently developed at LIMSI (Orsay, France) and FPMs (Mons, Belgium) are discussed. In a first part, time domain and spectral domain modelling of glottal flow signals are presented. It is shown that the glottal flow can be modelled as an anticausal filter (maximum phase) before the glottal closing, and as a causal filter (minimum phase) after the glottal closing. In a second part, taking advantage of this phase structure, causal and anticausal components of the speech signal are separated according to the location in the Z-plane of the zeros of the Z-Transform (ZZT) of the windowed signal. This method is useful for voice source parameters analysis and source-tract deconvolution. Results of a comparative evaluation of the ZZT and linear prediction for source/tract separation are reported. In a third part, glottal closing instant detection using the phase of the wavelet transform is discussed. A method based on the lines of maximum phase in the time-scale plane is proposed. This method is compared to EGG for robust glottal closing instant analysis.
