Quasi-Supervised Strategies for Compound-Protein Interaction Prediction
Loading...
Date
Authors
Karaçalı, B.
Journal Title
Journal ISSN
Volume Title
Publisher
Open Access Color
BRONZE
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
In-silico compound-protein interaction prediction addresses prioritization of drug candidates for experimental biochemical validation because the wet-lab experiments are time-consuming, laborious and costly. Most machine learning methods proposed to that end approach this problem with supervised learning strategies in which known interactions are labeled as positive and the rest are labeled as negative. However, treating all unknown interactions as negative instances may lead to inaccuracies in real practice since some of the unknown interactions are bound to be positive interactions waiting to be identified as such. In this study, we propose to address this problem using the Quasi-Supervised Learning (QSL) algorithm. In this framework, potential interactions are predicted by estimating the overlap between a true positive dataset of compound-protein pairs with known interactions and an unknown dataset of all the remaining compound-protein pairs. The potential interactions are then identified as those in the unknown dataset that overlap with the interacting pairs in the true positive dataset in terms of the associated similarity structure. We also address the class-imbalance problem by modifying the conventional cost function of the QSL algorithm. Experimental results on GPCR and Nuclear Receptor datasets show that the proposed method can identify actual interactions from all possible combinations. © 2021 Wiley-VCH GmbH.
Description
Keywords
Chemoinformatics, Compound Similarity, Compound-Protein Interactions, Drug Discovery, Machine Learning, Cost functions, Drug interactions, Learning algorithms, Learning systems, Supervised learning, Chemoinformatics, Compound similarity, Compound-protein interaction, Drug discovery, In-silico, Interaction prediction, Machine-learning, Protein interaction, Quasi-supervised learning, True positive, Proteins, alpha 2C adrenergic receptor, amcinonide, beta 1 adrenergic receptor, beta 2 adrenergic receptor, betaxolol, bisoprolol, cell nucleus receptor, chenodeoxycholic acid, chlorpromazine, chlorpromazine derivative, chlorpromazine hibenzate, chlorpromazine phenolphthalinate, cholesterol, cicloprolol, clozapine, denopamine, dipivefrine, dopamine 2 receptor, dopamine 3 receptor, dydrogesterone, epinephrine, eplerenone, estrogen receptor alpha, ethinylestradiol, etretinate, fenoldopam mesilate, fluoxymesterone, G protein coupled receptor, histamine H1 receptor, hydrocortisone, isoetarine, isotretinoin, levodopa, loteprednol etabonate, methoxamine, metixene, metoprolol, mifepristone, muscarinic M3 receptor, nandrolone phenpropionate, norethisterone, oxandrolone, oxymetazoline, perphenazine, pregnenolone, progesterone receptor, retinoic acid receptor beta, retinoic acid receptor gamma, retinoid X receptor alpha, retinoid X receptor gamma, ritodrine, salbutamol, salbutamol sulfate, spironolactone, tazarotene, terbutaline sulfate, testosterone, unclassified drug, protein, Article, chemical interaction, compound protein interaction, genome analysis, intermethod comparison, kernel method, Kolmogorov Smirnov test, learning algorithm, machine learning, molecular fingerprinting, pairwise kernel method, prediction, protein interaction, quasi supervised learning algorithm, semi supervised machine learning, algorithm, chemistry, Algorithms, Machine Learning, Proteins, Machine Learning, Proteins, Algorithms
Fields of Science
0301 basic medicine, 0303 health sciences, 03 medical and health sciences
Citation
WoS Q
Scopus Q

OpenCitations Citation Count
3
Source
Volume
41
Issue
4
Start Page
End Page
PlumX Metrics
Citations
CrossRef : 1
Scopus : 4
PubMed : 1
Captures
Mendeley Readers : 12
SCOPUS™ Citations
4
checked on Apr 27, 2026
Page Views
288
checked on Apr 27, 2026
Google Scholar™


