PubMed İndeksli Yayınlar Koleksiyonu / PubMed Indexed Publications Collection
Permanent URI for this collectionhttps://hdl.handle.net/11147/7645
Browse
9 results
Search Results
Article Citation - Scopus: 34Machine Learning Methods for Microrna Gene Prediction(Humana Press Inc., 2014) Saçar,M.D.; Allmer,J.MicroRNAs (miRNAs) are single-stranded, small, noncoding RNAs of about 22 nucleotides in length, which control gene expression at the posttranscriptional level through translational inhibition, degradation, adenylation, or destabilization of their target mRNAs. Although hundreds of miRNAs have been identified in various species, many more may still remain unknown. Therefore, discovery of new miRNA genes is an important step for understanding miRNA-mediated posttranscriptional regulation mechanisms. It seems that biological approaches to identify miRNA genes might be limited in their ability to detect rare miRNAs and are further limited to the tissues examined and the developmental stage of the organism under examination. These limitations have led to the development of sophisticated computational approaches attempting to identify possible miRNAs in silico. In this chapter, we discuss computational problems in miRNA prediction studies and review some of the many machine learning methods that have been tried to address the issues. © Springer Science+Business Media New York 2014.Article Citation - WoS: 5Citation - Scopus: 6Sequence Identification and in Silico Characterization of Novel Thermophilic Lipases From Geobacillus Species(WILEY, 2023) Sürmeli, Yusuf; Tekedar, Hasan Cihad; Sanli-Mohamed, GulsahMicrobial lipases are utilized in various biotechnological areas, including pharmaceuticals, food, biodiesel, and detergents. In this study, we cloned and sequenced Lip21 and Lip33 genes from Geobacillus sp. GS21 and Geobacillus sp. GS33, then we in silico and experimentally analyzed the encoded lipases. For this purpose, Lip21 and Lip33 were cloned, sequenced, and their amino acid sequences were investigated for determination of biophysicochemical characteristics, evolutionary relationships, and sequence similarities. 3D models were built and computationally affirmed by various bioinformatics tools, and enzyme-ligand interactions were investigated by docking analysis using six ligands. Biophysicochemical property of Lip21 and Lip33 was also determined experimentally and the results demonstrated that they had similar isoelectric point (pI) (6.21) and T-m (75.5(degrees)C) values as T-m was revealed by denatured protein analysis of the circular dichroism spectrum and pI was obtained by isoelectric focusing. Phylogeny analysis indicated that Lip21 and Lip33 were the closest to lipases from Geobacillus sp. SBS-4S and Geobacillus thermoleovorans, respectively. Alignment analysis demonstrated that S144-D348-H389 was catalytic triad residues in Lip21 and Lip33, and enzymes possessed a conserved Gly-X-Ser-X-Gly motif containing catalytic serine. 3D structure analysis indicated that Lip21 and Lip33 highly resembled each other and they were alpha/beta hydrolase-fold enzymes with large lid domains. BAN Delta IT analysis results showed that Lip21 and Lip33 had higher thermal stability, compared to other thermostable Geobacillus lipases. Docking results revealed that Lip21- and Lip33-docked complexes possessed common residues (H112, K115, Q162, E163, and S141) that interacted with the substrates, except paranitrophenyl (pNP)-C10 and pNP-C12, indicating that these residues might have a significant action on medium and short-chain fatty acid esters. Thus, Lip21 and Lip33 can be potential candidates for different industrial applications.Article Citation - WoS: 4Citation - Scopus: 4A New Shapley-Based Feature Selection Method in a Clinical Decision Support System for the Identification of Lung Diseases(MDPI, 2023) Kababulut, Fevzi Yasin; Kuntalp, Damla Gurkan; Düzyel, Okan; Özcan, Nermin; Kuntalp, MehmetThe aim of this study is to propose a new feature selection method based on the class-based contribution of Shapley values. For this purpose, a clinical decision support system was developed to assist doctors in their diagnosis of lung diseases from lung sounds. The developed systems, which are based on the Decision Tree Algorithm (DTA), create a classification for five different cases: healthy and disease (URTI, COPD, Pneumonia, and Bronchiolitis) states. The most important reason for using a Decision Tree Classifier instead of other high-performance classifiers such as CNN and RNN is that the class contributions of Shapley values can be seen with this classifier. The systems developed consist of either a single DTA classifier or five parallel DTA classifiers each of which is optimized to make a binary classification such as healthy vs. others, COPD vs. Others, etc. Feature sets based on Power Spectral Density (PSD), Mel Frequency Cepstral Coefficients (MFCC), and statistical characteristics extracted from lung sound recordings were used in these classifications. The results indicate that employing features selected based on the class-based contribution of Shapley values, along with utilizing an ensemble (parallel) system, leads to improved classification performance compared to performances using either raw features alone or traditional use of Shapley values.Article Citation - WoS: 3Citation - Scopus: 3Automated Labeling of Cancer Textures in Larynx Histopathology Slides Using Quasi-Supervised Learning(Science Printers and Publishers Inc., 2014) Önder, Devrim; Sarıoğlu, Sülen; Karaçalı, BilgeOBJECTIVE: To evaluate the performance of a quasisupervised statistical learning algorithm, operating on datasets having normal and neoplastic tissues, to identify larynx squamous cell carcinomas. Furthermore, cancer texture separability measures against normal tissues are to be developed and compared either for colorectal or larynx tissues. STUDY DESIGN: Light microscopic digital images from histopathological sections were obtained from laryngectomy materials including squamous cell carcinoma and nonneoplastic regions. The texture features were calculated by using co-occurrence matrices and local histograms. The texture features were input to the quasisupervised learning algorithm. RESULTS: Larynx regions containing squamous cell carcinomas were accurately identified, having false and true positive rates up to 21% and 87%, respectively. CONCLUSION: Larynx squamous cell carcinoma versus normal tissue texture separability measures were higher than colorectal adenocarcinoma versus normal textures for the colorectal database. Furthermore, the resultant labeling performances for all larynx datasets are higher than or equal to that of colorectal datasets. The results in larynx datasets, in comparison with the former colorectal study, suggested that quasi-supervised texture classification is to be a helpful method in histopathological image classification and analysis.Article Citation - WoS: 42Citation - Scopus: 47Effects of Malaxation Temperature and Harvest Time on the Chemical Characteristics of Olive Oils(Elsevier Ltd., 2016) Jolayemi, Olusola Samuel; Tokatlı, Figen; Özen, BanuThe aim of the study was to determine the effects of harvest time and malaxation temperature on chemical composition of olive oils produced from economically important olive varieties with a full factorial experimental design. The oils of Ayvalik and Memecik olives were extracted in an industrial two-phase continuous system. The quality parameters, phenolic and fatty acid profiles were determined. Harvest time, olive variety and their interaction were the most significant factors. Malaxation temperature was significant for hydroxytyrosol, tyrosol, p-coumaric acid, pinoresinol and peroxide value. Early and mid-harvest oils had high hydroxytyrosol and tyrosol (maximum 20.7 mg/kg) and pigment concentrations (maximum chlorophyll and carotenoids as 4.6 mg/kg and 2.86 mg/kg, respectively). Late harvest oils were characterized with high peroxide values (9.2-25 meq O2/kg), stearic (2.4-3.1%) and linoleic acids (9.3-10.4%). Multivariate regression analysis showed that oxidative stability was affected positively by hydroxytyrosol, tyrosol and oleic acid and negatively by polyunsaturated fatty acids.Article Citation - WoS: 14Citation - Scopus: 12The Impact of Feature Selection on One and Two-Class Classification Performance for Plant Micrornas(PeerJ Inc., 2016) Khalifa, Waleed; Yousef, Malik; Saçar Demirci, Müşerref Duygu; Allmer, JensMicroRNAs (miRNAs) are short nucleotide sequences that form a typical hairpin structure which is recognized by a complex enzyme machinery. It ultimately leads to the incorporation of 18-24 nt long mature miRNAs into RISC where they act as recognition keys to aid in regulation of target mRNAs. It is involved to determine miRNAs experimentally and, therefore, machine learning is used to complement such endeavors. The success of machine learning mostly depends on proper input data and appropriate features for parameterization of the data. Although, in general, two-class classification (TCC) is used in the field; because negative examples are hard to come by, one-class classification (OCC) has been tried for pre-miRNA detection. Since both positive and negative examples are currently somewhat limited, feature selection can prove to be vital for furthering the field of pre-miRNA detection. In this study, we compare the performance of OCC and TCC using eight feature selection methods and seven different plant species providing positive pre-miRNA examples. Feature selection was very successful for OCC where the best feature selection method achieved an average accuracy of 95.6%, thereby being ~29% better than the worst method which achieved 66.9% accuracy. While the performance is comparable to TCC, which performs up to 3% better than OCC, TCC is much less affected by feature selection and its largest performance gap is ~13% which only occurs for two of the feature selection methodologies. We conclude that feature selection is crucially important for OCC and that it can perform on par with TCC given the proper set of features.Article Citation - Scopus: 19Feature Selection Has a Large Impact on One-Class Classification Accuracy for Micrornas in Plants(Hindawi Publishing Corporation, 2016) Yousef, Malik; Demirci, Müşerref Duygu Saçar; Khalifa, Waleed; Allmer, JensMicroRNAs (miRNAs) are short RNA sequences involved in posttranscriptional gene regulation. Their experimental analysis is complicated and, therefore, needs to be supplemented with computational miRNA detection. Currently computational miRNA detection is mainly performed using machine learning and in particular two-class classification. For machine learning, the miRNAs need to be parametrized and more than 700 features have been described. Positive training examples for machine learning are readily available, but negative data is hard to come by. Therefore, it seems prerogative to use one-class classification instead of two-class classification. Previously, we were able to almost reach two-class classification accuracy using one-class classifiers. In this work, we employ feature selection procedures in conjunction with one-class classification and show that there is up to 36% difference in accuracy among these feature selection methods. The best feature set allowed the training of a one-class classifier which achieved an average accuracy of 95.6% thereby outperforming previous two-class-based plant miRNA detection approaches by about 0.5%. We believe that this can be improved upon in the future by rigorous filtering of the positive training examples and by improving current feature clustering algorithms to better target pre-miRNA feature selection.Book Part Citation - WoS: 299Citation - Scopus: 406Introduction To Machine Learning(Humana Press, 2014) Baştanlar, Yalın; Özuysal, MustafaThe machine learning field, which can be briefly defined as enabling computers make successful predictions using past experiences, has exhibited an impressive development recently with the help of the rapid increase in the storage capacity and processing power of computers. Together with many other disciplines, machine learning methods have been widely employed in bioinformatics. The difficulties and cost of biological analyses have led to the development of sophisticated machine learning approaches for this application area. In this chapter, we first review the fundamental concepts of machine learning such as feature assessment, unsupervised versus supervised learning and types of classification. Then, we point out the main issues of designing machine learning experiments and their performance evaluation. Finally, we introduce some supervised learning methods.Article Citation - WoS: 30Machine Learning Methods for Microrna Gene Prediction(Humana Press, 2014) Saçar, Müşerref Duygu; Allmer, JensMicroRNAs (miRNAs) are single-stranded, small, noncoding RNAs of about 22 nucleotides in length, which control gene expression at the posttranscriptional level through translational inhibition, degradation, adenylation, or destabilization of their target mRNAs. Although hundreds of miRNAs have been identified in various species, many more may still remain unknown. Therefore, discovery of new miRNA genes is an important step for understanding miRNA-mediated posttranscriptional regulation mechanisms. It seems that biological approaches to identify miRNA genes might be limited in their ability to detect rare miRNAs and are further limited to the tissues examined and the developmental stage of the organism under examination. These limitations have led to the development of sophisticated computational approaches attempting to identify possible miRNAs in silico. In this chapter, we discuss computational problems in miRNA prediction studies and review some of the many machine learning methods that have been tried to address the issues.
