Quasi-Supervised Strategies for Compound-Protein Interaction Prediction

dc.contributor.author Çakı, O.
dc.contributor.author Karaçalı, B.
dc.date.accessioned 2024-01-06T07:22:36Z
dc.date.available 2024-01-06T07:22:36Z
dc.date.issued 2022
dc.description.abstract In-silico compound-protein interaction prediction addresses prioritization of drug candidates for experimental biochemical validation because the wet-lab experiments are time-consuming, laborious and costly. Most machine learning methods proposed to that end approach this problem with supervised learning strategies in which known interactions are labeled as positive and the rest are labeled as negative. However, treating all unknown interactions as negative instances may lead to inaccuracies in real practice since some of the unknown interactions are bound to be positive interactions waiting to be identified as such. In this study, we propose to address this problem using the Quasi-Supervised Learning (QSL) algorithm. In this framework, potential interactions are predicted by estimating the overlap between a true positive dataset of compound-protein pairs with known interactions and an unknown dataset of all the remaining compound-protein pairs. The potential interactions are then identified as those in the unknown dataset that overlap with the interacting pairs in the true positive dataset in terms of the associated similarity structure. We also address the class-imbalance problem by modifying the conventional cost function of the QSL algorithm. Experimental results on GPCR and Nuclear Receptor datasets show that the proposed method can identify actual interactions from all possible combinations. © 2021 Wiley-VCH GmbH. en_US
dc.identifier.doi 10.1002/minf.202100118
dc.identifier.issn 1868-1743
dc.identifier.issn 1868-1751
dc.identifier.scopus 2-s2.0-85119954344
dc.identifier.uri https://doi.org/10.1002/minf.202100118
dc.identifier.uri https://hdl.handle.net/11147/14200
dc.language.iso en en_US
dc.publisher John Wiley and Sons Inc en_US
dc.relation.ispartof Molecular Informatics en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Chemoinformatics en_US
dc.subject Compound Similarity en_US
dc.subject Compound-Protein Interactions en_US
dc.subject Drug Discovery en_US
dc.subject Machine Learning en_US
dc.subject Cost functions en_US
dc.subject Drug interactions en_US
dc.subject Learning algorithms en_US
dc.subject Learning systems en_US
dc.subject Supervised learning en_US
dc.subject Chemoinformatics en_US
dc.subject Compound similarity en_US
dc.subject Compound-protein interaction en_US
dc.subject Drug discovery en_US
dc.subject In-silico en_US
dc.subject Interaction prediction en_US
dc.subject Machine-learning en_US
dc.subject Protein interaction en_US
dc.subject Quasi-supervised learning en_US
dc.subject True positive en_US
dc.subject Proteins en_US
dc.subject alpha 2C adrenergic receptor en_US
dc.subject amcinonide en_US
dc.subject beta 1 adrenergic receptor en_US
dc.subject beta 2 adrenergic receptor en_US
dc.subject betaxolol en_US
dc.subject bisoprolol en_US
dc.subject cell nucleus receptor en_US
dc.subject chenodeoxycholic acid en_US
dc.subject chlorpromazine en_US
dc.subject chlorpromazine derivative en_US
dc.subject chlorpromazine hibenzate en_US
dc.subject chlorpromazine phenolphthalinate en_US
dc.subject cholesterol en_US
dc.subject cicloprolol en_US
dc.subject clozapine en_US
dc.subject denopamine en_US
dc.subject dipivefrine en_US
dc.subject dopamine 2 receptor en_US
dc.subject dopamine 3 receptor en_US
dc.subject dydrogesterone en_US
dc.subject epinephrine en_US
dc.subject eplerenone en_US
dc.subject estrogen receptor alpha en_US
dc.subject ethinylestradiol en_US
dc.subject etretinate en_US
dc.subject fenoldopam mesilate en_US
dc.subject fluoxymesterone en_US
dc.subject G protein coupled receptor en_US
dc.subject histamine H1 receptor en_US
dc.subject hydrocortisone en_US
dc.subject isoetarine en_US
dc.subject isotretinoin en_US
dc.subject levodopa en_US
dc.subject loteprednol etabonate en_US
dc.subject methoxamine en_US
dc.subject metixene en_US
dc.subject metoprolol en_US
dc.subject mifepristone en_US
dc.subject muscarinic M3 receptor en_US
dc.subject nandrolone phenpropionate en_US
dc.subject norethisterone en_US
dc.subject oxandrolone en_US
dc.subject oxymetazoline en_US
dc.subject perphenazine en_US
dc.subject pregnenolone en_US
dc.subject progesterone receptor en_US
dc.subject retinoic acid receptor beta en_US
dc.subject retinoic acid receptor gamma en_US
dc.subject retinoid X receptor alpha en_US
dc.subject retinoid X receptor gamma en_US
dc.subject ritodrine en_US
dc.subject salbutamol en_US
dc.subject salbutamol sulfate en_US
dc.subject spironolactone en_US
dc.subject tazarotene en_US
dc.subject terbutaline sulfate en_US
dc.subject testosterone en_US
dc.subject unclassified drug en_US
dc.subject protein en_US
dc.subject Article en_US
dc.subject chemical interaction en_US
dc.subject compound protein interaction en_US
dc.subject genome analysis en_US
dc.subject intermethod comparison en_US
dc.subject kernel method en_US
dc.subject Kolmogorov Smirnov test en_US
dc.subject learning algorithm en_US
dc.subject machine learning en_US
dc.subject molecular fingerprinting en_US
dc.subject pairwise kernel method en_US
dc.subject prediction en_US
dc.subject protein interaction en_US
dc.subject quasi supervised learning algorithm en_US
dc.subject semi supervised machine learning en_US
dc.subject algorithm en_US
dc.subject chemistry en_US
dc.subject Algorithms en_US
dc.subject Machine Learning en_US
dc.subject Proteins en_US
dc.title Quasi-Supervised Strategies for Compound-Protein Interaction Prediction en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.institutional
gdc.author.scopusid 57354233400
gdc.author.scopusid 6603084273
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access metadata only access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology en_US
gdc.description.departmenttemp Çakı, O., Electrical and Electronics Engineering Department, Izmir Institute of Technology, Izmir, Urla, 35430, Turkey; Karaçalı, B., Electrical and Electronics Engineering Department, Izmir Institute of Technology, Izmir, Urla, 35430, Turkey en_US
gdc.description.issue 4 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q2
gdc.description.volume 41 en_US
gdc.description.wosquality Q1
gdc.identifier.openalex W3216479650
gdc.identifier.pmid 34837345
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype BRONZE
gdc.oaire.diamondjournal false
gdc.oaire.impulse 2.0
gdc.oaire.influence 2.7798779E-9
gdc.oaire.isgreen false
gdc.oaire.keywords Machine Learning
gdc.oaire.keywords Proteins
gdc.oaire.keywords Algorithms
gdc.oaire.popularity 3.7766426E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0301 basic medicine
gdc.oaire.sciencefields 0303 health sciences
gdc.oaire.sciencefields 03 medical and health sciences
gdc.openalex.collaboration National
gdc.openalex.fwci 0.32018424
gdc.openalex.normalizedpercentile 0.63
gdc.opencitations.count 3
gdc.plumx.crossrefcites 1
gdc.plumx.facebookshareslikecount 1
gdc.plumx.mendeley 12
gdc.plumx.pubmedcites 1
gdc.plumx.scopuscites 4
gdc.scopus.citedcount 4
relation.isAuthorOfPublication.latestForDiscovery a081f8c3-cd7b-40d5-a9ca-74707d1b4dc7
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4018-8abe-a4dfe192da5e

Files