Quasi-Supervised Strategies for Compound-Protein Interaction Prediction [article]

Loading...

Date

Authors

Karaçalı, Bilge

Journal Title

Journal ISSN

Volume Title

Open Access Color

BRONZE

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

In-silico compound-protein interaction prediction addresses prioritization of drug candidates for experimental biochemical validation because the wet-lab experiments are time-consuming, laborious and costly. Most machine learning methods proposed to that end approach this problem with supervised learning strategies in which known interactions are labeled as positive and the rest are labeled as negative. However, treating all unknown interactions as negative instances may lead to inaccuracies in real practice since some of the unknown interactions are bound to be positive interactions waiting to be identified as such. In this study, we propose to address this problem using the Quasi-Supervised Learning (QSL) algorithm. In this framework, potential interactions are predicted by estimating the overlap between a true positive dataset of compound-protein pairs with known interactions and an unknown dataset of all the remaining compound-protein pairs. The potential interactions are then identified as those in the unknown dataset that overlap with the interacting pairs in the true positive dataset in terms of the associated similarity structure. We also address the class-imbalance problem by modifying the conventional cost function of the QSL algorithm. Experimental results on GPCR and Nuclear Receptor datasets show that the proposed method can identify actual interactions from all possible combinations.

Description

Keywords

Machine learning, Chemoinformatics, Drug discovery, Compound similarity, Machine Learning, Proteins, Algorithms

Fields of Science

0301 basic medicine, 0303 health sciences, 03 medical and health sciences

Citation

WoS Q

Scopus Q

OpenCitations Logo
OpenCitations Citation Count
3

Volume

41

Issue

Start Page

End Page

PlumX Metrics
Citations

CrossRef : 1

Scopus : 4

PubMed : 1

Captures

Mendeley Readers : 12

SCOPUS™ Citations

4

checked on Apr 27, 2026

Web of Science™ Citations

3

checked on Apr 27, 2026

Page Views

6845

checked on Apr 27, 2026

Downloads

111

checked on Apr 27, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.32018424

Sustainable Development Goals

SDG data is not available