Electrical - Electronic Engineering / Elektrik - Elektronik Mühendisliği
Permanent URI for this collectionhttps://hdl.handle.net/11147/11
Browse
2 results
Search Results
Article Citation - WoS: 4Citation - Scopus: 4A New Shapley-Based Feature Selection Method in a Clinical Decision Support System for the Identification of Lung Diseases(MDPI, 2023) Kababulut, Fevzi Yasin; Kuntalp, Damla Gurkan; Düzyel, Okan; Özcan, Nermin; Kuntalp, MehmetThe aim of this study is to propose a new feature selection method based on the class-based contribution of Shapley values. For this purpose, a clinical decision support system was developed to assist doctors in their diagnosis of lung diseases from lung sounds. The developed systems, which are based on the Decision Tree Algorithm (DTA), create a classification for five different cases: healthy and disease (URTI, COPD, Pneumonia, and Bronchiolitis) states. The most important reason for using a Decision Tree Classifier instead of other high-performance classifiers such as CNN and RNN is that the class contributions of Shapley values can be seen with this classifier. The systems developed consist of either a single DTA classifier or five parallel DTA classifiers each of which is optimized to make a binary classification such as healthy vs. others, COPD vs. Others, etc. Feature sets based on Power Spectral Density (PSD), Mel Frequency Cepstral Coefficients (MFCC), and statistical characteristics extracted from lung sound recordings were used in these classifications. The results indicate that employing features selected based on the class-based contribution of Shapley values, along with utilizing an ensemble (parallel) system, leads to improved classification performance compared to performances using either raw features alone or traditional use of Shapley values.Article Citation - WoS: 27Citation - Scopus: 34A New Method for Gan-Based Data Augmentation for Classes With Distinct Clusters(Pergamon-Elsevier Science Ltd, 2024) Kuntalp, Mehmet; Düzyel, OkanData augmentation is a commonly used approach for addressing the issue of limited data availability in machine learning. There are various methods available, including classical and modern techniques. However, when applying modern data augmentation methods, such as Generative Adversarial Neural Networks (GANs), to a class specific data, the resulting data can exhibit structural discrepancies. This study explores a different use of GANs as a data augmentation method that solves this problem using the electrocardiogram (ECG) signals in the MITBIH arrhythmia dataset as the example. We begin by examining the cluster structure of a specific class using tDistributed Stochastic Neighbor (t-SNE) method. Based on this cluster structure, we propose a new method for applying GANs to augment data for that class. We assess the effect of our method in a classification task using 1-D Convolutional Neural Network (CNN), Support Vector Machine (SVM), One vs one classifier (Ovo), K-Nearest Neighbors (KNN), and Random Forest as the classifiers. The results demonstrate that our proposed method could lead to better classification performance if a specific class has distinct clusters when compared to normal use of GANs.
