The Impact of Feature Selection on One and Two-Class Classification Performance for Plant Micrornas

Loading...

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

GOLD

Green Open Access

Yes

OpenAIRE Downloads

0

OpenAIRE Views

5

Publicly Funded

No
Impulse
Top 10%
Influence
Average
Popularity
Top 10%

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

MicroRNAs (miRNAs) are short nucleotide sequences that form a typical hairpin structure which is recognized by a complex enzyme machinery. It ultimately leads to the incorporation of 18-24 nt long mature miRNAs into RISC where they act as recognition keys to aid in regulation of target mRNAs. It is involved to determine miRNAs experimentally and, therefore, machine learning is used to complement such endeavors. The success of machine learning mostly depends on proper input data and appropriate features for parameterization of the data. Although, in general, two-class classification (TCC) is used in the field; because negative examples are hard to come by, one-class classification (OCC) has been tried for pre-miRNA detection. Since both positive and negative examples are currently somewhat limited, feature selection can prove to be vital for furthering the field of pre-miRNA detection. In this study, we compare the performance of OCC and TCC using eight feature selection methods and seven different plant species providing positive pre-miRNA examples. Feature selection was very successful for OCC where the best feature selection method achieved an average accuracy of 95.6%, thereby being ~29% better than the worst method which achieved 66.9% accuracy. While the performance is comparable to TCC, which performs up to 3% better than OCC, TCC is much less affected by feature selection and its largest performance gap is ~13% which only occurs for two of the feature selection methodologies. We conclude that feature selection is crucially important for OCC and that it can perform on par with TCC given the proper set of features.

Description

Keywords

Feature selection, Machine learning, MicroRNAs, Plant genetics, Classification, One-class classification, Plant genetics, QH301-705.5, Bioinformatics, Two-class classification, R, MicroRNA, Plant, Classification, MicroRNAs, Machine learning, Feature selection, Medicine, Biology (General)

Fields of Science

0301 basic medicine, 0303 health sciences, 03 medical and health sciences

Citation

Khalifa, W., Yousef, M., Saçar Demirci, M. D., and Allmer, J. (2016). The impact of feature selection on one and two-class classification performance for plant microRNAs. PeerJ, 2016(6). doi:10.7717/peerj.2135

WoS Q

Scopus Q

OpenCitations Logo
OpenCitations Citation Count
12

Source

Volume

2016

Issue

6

Start Page

End Page

PlumX Metrics
Citations

CrossRef : 6

Scopus : 12

PubMed : 5

Captures

Mendeley Readers : 27

SCOPUS™ Citations

12

checked on May 01, 2026

Web of Science™ Citations

14

checked on May 01, 2026

Page Views

746

checked on May 01, 2026

Downloads

347

checked on May 01, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
1.10538283

Sustainable Development Goals

SDG data is not available