Delineating the Impact of Machine Learning Elements in Pre-Microrna Detection
Loading...
Files
Date
Authors
Allmer, Jens
Journal Title
Journal ISSN
Volume Title
Publisher
Open Access Color
GOLD
Green Open Access
Yes
OpenAIRE Downloads
1
OpenAIRE Views
5
Publicly Funded
No
Abstract
Gene regulation modulates RNA expression via transcription factors. Posttranscriptional gene regulation in turn influences the amount of protein product through, for example, microRNAs (miRNAs). Experimental establishment of miRNAs and their effects is complicated and even futile when aiming to establish the entirety of miRNA target interactions. Therefore, computational approaches have been proposed. Many such tools rely on machine learning (ML) which involves example selection, feature extraction, model training, algorithm selection, and parameter optimization. Different ML algorithms have been used for model training on various example sets, more than 1,000 features describing pre-miRNAs have been proposed and different training and testing schemes have been used for model establishment. For pre-miRNA detection, negative examples cannot easily be established causing a problem for two class classification algorithms. There is also no consensus on what ML approach works best and, therefore, we set forth and established the impact of the different parts involved in ML on model performance. Furthermore, we established two new negative datasets and analyzed the impact of using them for training and testing. It was our aim to attach an order of importance to the parts involved in ML for pre-miRNA detection, but instead we found that all parts are intricately connected and their contributions cannot be easily untangled leading us to suggest that when attempting ML-based pre-miRNA detection many scenarios need to be explored.
Description
Keywords
Feature selection, MicroRNAs, ML strategy, Negative dataset, ML strategy, QH301-705.5, Bioinformatics, R, MicroRNA, MicroRNAs, Machine learning, Feature selection, Medicine, Ab initio pre-miRNA detection, Biology (General), Negative dataset
Fields of Science
0301 basic medicine, 0206 medical engineering, 02 engineering and technology, 03 medical and health sciences
Citation
Saçar Demirci, M. D., and Allmer, J. (2017). Delineating the impact of machine learning elements in pre-microRNA detection. PeerJ, 2017(3). doi:10.7717/peerj.3131
WoS Q
Scopus Q

OpenCitations Citation Count
14
Source
Volume
2017
Issue
3
Start Page
End Page
PlumX Metrics
Citations
CrossRef : 2
Scopus : 13
PubMed : 7
Captures
Mendeley Readers : 21
SCOPUS™ Citations
13
checked on May 04, 2026
Web of Science™ Citations
14
checked on May 04, 2026
Page Views
851
checked on May 04, 2026
Downloads
378
checked on May 04, 2026
Google Scholar™


