Phd Degree / Doktora

Permanent URI for this collectionhttps://hdl.handle.net/11147/2869

Browse

Search Results

Now showing 1 - 2 of 2
  • Doctoral Thesis
    Density Grid Based Stream Clustering Algorithm
    (Izmir Institute of Technology, 2019) Ahmed, Rowanda Daoud; Ayav, Tolga; Ayav, Tolga; Dalkılıç, Gökhan
    Recently as applications produce overwhelming data streams, the need for strategies to analyze and cluster streaming data becomes an urgent and a crucial research area for knowledge discovery. The main objective and the key aim of data stream clustering is to gain insights into incoming data. Recognizing all probable patterns in this boundless data which arrives at varying speeds and structure and evolves over time, is very important in this analysis process. The existing data stream clustering strategies so far, all suffer from different limitations, like the inability to find the arbitrary shaped clusters and handling outliers in addition to requiring some parameter information for data processing. For fast, accurate, efficient and effective handling for all these challenges, we proposed DGStream, a new online-offline grid and density-based stream clustering algorithm. We conducted many experiments and evaluated the performance of DGStream over different simulated databases and for different parameter settings where a wide variety of concept drifts, novelty, evolving data, number and size of clusters and outlier detection are considered. Our algorithm is suitable for applications where the interest lies in the most recent information like stock market, or if the analysis of existing information is required as well as cases where both the old and the recent information are all equally important. The experiments, over the synthetic and real datasets, show that our proposed algorithm outperforms the other algorithms in efficiency.
  • Doctoral Thesis
    Development of Genetic Algorithm Based Classification and Cluster Analysis Methods for Analytical Data
    (Izmir Institute of Technology, 2009) Öztürk, Betül; Özdemir, Durmuş
    In this study genetic algorithm based classification and clustering methods were aimed to develop for the spectral data. The developed methods were completely achieved hybridization of nature inspired algorithm (genetic algorithms, GAs) to other classification or clustering methods. The first method was genetic algorithm based principal component analysis (GAPCAD), and the second was genetic algorithm based discriminant analysis (GADA). Both methods were performed to achieve the best discrimination between the olive oil and vegetable oil samples. The classifications of samples were examined directly from their spectral data obtained from using near infrared spectrometry, Fourier transform infrared (FTIR) spectrometry, and spectrofluorometry. The GA was used to optimize the performance of classification or clustering techniques. on training set in order to maximize the correct classification of acceptable and unacceptable samples or samples of dissimilar properties and to reduce the spectral data by wavelength selection. After GA optimization the classification results of training set were controlled by validation set. Lastly, the success of both algorithms was compared to the results of PCA and SIMCA.