Can Mirbase Provide Positive Data for Machine Learning for the Detection of Mirna Hairpins?

Loading...

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

GOLD

Green Open Access

Yes

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Top 10%
Popularity
Top 10%

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

Experimental detection and validation of miRNAs is a tedious, time-consuming, and expensive process. Computational methods for miRNA gene detection are being developed so that the number of candidates that need experimental validation can be reduced to a manageable amount. Computational methods involve homology-based and ab inito algorithms. Both approaches are dependent on positive and negative training examples. Positive examples are usually derived from miRBase, the main resource for experimentally validated miRNAs. We encountered some problems with miRBase which we would like to report here. Some problems, among others, we encountered are that folds presented in miRBase are not always the fold with the minimum free energy; some entries do not seem to conform to expectations of miRNAs, and some external accession numbers are not valid. In addition, we compared the prediction accuracy for the same negative dataset when the positive data came from miRBase or miRTarBase and found that the latter led to more precise prediction models. We suggest that miRBase should introduce some automated facilities for ensuring data quality to overcome these problems.

Description

Keywords

[No Keyword Available], Base Sequence, Molecular Sequence Data, Data processing, computer science, computer systems, 004, Databases, MicroRNAs, Sequence alignment, Nucleic acid, Artificial Intelligence, Molecular sequence data, Humans, Nucleic Acid Conformation, Databases, Nucleic Acid, Sequence Alignment, TP248.13-248.65, Algorithms, Biotechnology

Fields of Science

0301 basic medicine, 0303 health sciences, 03 medical and health sciences

Citation

WoS Q

Scopus Q

OpenCitations Logo
OpenCitations Citation Count
22

Volume

10

Issue

2

Start Page

215

End Page

11
PlumX Metrics
Citations

CrossRef : 13

Scopus : 7

Captures

Mendeley Readers : 7

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
1.51059891

Sustainable Development Goals