Can Mirbase Provide Positive Data for Machine Learning for the Detection of Mirna Hairpins?
Loading...
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Open Access Color
GOLD
Green Open Access
Yes
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Experimental detection and validation of miRNAs is a tedious, time-consuming, and expensive process. Computational methods for miRNA gene detection are being developed so that the number of candidates that need experimental validation can be reduced to a manageable amount. Computational methods involve homology-based and ab inito algorithms. Both approaches are dependent on positive and negative training examples. Positive examples are usually derived from miRBase, the main resource for experimentally validated miRNAs. We encountered some problems with miRBase which we would like to report here. Some problems, among others, we encountered are that folds presented in miRBase are not always the fold with the minimum free energy; some entries do not seem to conform to expectations of miRNAs, and some external accession numbers are not valid. In addition, we compared the prediction accuracy for the same negative dataset when the positive data came from miRBase or miRTarBase and found that the latter led to more precise prediction models. We suggest that miRBase should introduce some automated facilities for ensuring data quality to overcome these problems.
Description
Keywords
[No Keyword Available], Base Sequence, Molecular Sequence Data, Data processing, computer science, computer systems, 004, Databases, MicroRNAs, Sequence alignment, Nucleic acid, Artificial Intelligence, Molecular sequence data, Humans, Nucleic Acid Conformation, Databases, Nucleic Acid, Sequence Alignment, TP248.13-248.65, Algorithms, Biotechnology
Fields of Science
0301 basic medicine, 0303 health sciences, 03 medical and health sciences
Citation
WoS Q
Scopus Q

OpenCitations Citation Count
22
Volume
10
Issue
2
Start Page
215
End Page
11
PlumX Metrics
Citations
CrossRef : 13
Scopus : 7
Captures
Mendeley Readers : 7
Google Scholar™


