Can Mirbase Provide Positive Data for Machine Learning for the Detection of Mirna Hairpins?

dc.contributor.author Demirci, Müşerref Duygu Saçar
dc.contributor.author Hamzeiy, Hamid
dc.contributor.author Allmer, Jens
dc.coverage.doi 10.2390/biecoll-jib-2013-215
dc.date.accessioned 2017-04-07T07:40:38Z
dc.date.available 2017-04-07T07:40:38Z
dc.date.issued 2013
dc.description.abstract Experimental detection and validation of miRNAs is a tedious, time-consuming, and expensive process. Computational methods for miRNA gene detection are being developed so that the number of candidates that need experimental validation can be reduced to a manageable amount. Computational methods involve homology-based and ab inito algorithms. Both approaches are dependent on positive and negative training examples. Positive examples are usually derived from miRBase, the main resource for experimentally validated miRNAs. We encountered some problems with miRBase which we would like to report here. Some problems, among others, we encountered are that folds presented in miRBase are not always the fold with the minimum free energy; some entries do not seem to conform to expectations of miRNAs, and some external accession numbers are not valid. In addition, we compared the prediction accuracy for the same negative dataset when the positive data came from miRBase or miRTarBase and found that the latter led to more precise prediction models. We suggest that miRBase should introduce some automated facilities for ensuring data quality to overcome these problems. en_US
dc.identifier.citation Saçar, M. D., Hamzeiy, H., and Allmer, J. (2013). Can MiRBase provide positive data for machine learning for the detection of MiRNA hairpins? Journal of integrative bioinformatics, 10(2). doi:10.2390/biecoll-jib-2013-215 en_US
dc.identifier.doi 10.2390/biecoll-jib-2013-215 en_US
dc.identifier.doi 10.2390/biecoll-jib-2013-215
dc.identifier.issn 1613-4516
dc.identifier.issn 1613-4516
dc.identifier.scopus 2-s2.0-84891760538
dc.identifier.uri http://doi.org/10.2390/biecoll-jib-2013-215
dc.identifier.uri https://hdl.handle.net/11147/5250
dc.language.iso en en_US
dc.publisher Informationsmanagement in der Biotechnologie e.V. (IMBio e.V.) en_US
dc.relation.ispartof Journal of Integrative Bioinformatics en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject MicroRNAs en_US
dc.subject Sequence alignment en_US
dc.subject Molecular sequence data en_US
dc.subject Nucleic acid en_US
dc.subject Databases en_US
dc.title Can Mirbase Provide Positive Data for Machine Learning for the Detection of Mirna Hairpins? en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.institutional Demirci, Müşerref Duygu Saçar
gdc.author.institutional Hamzeiy, Hamid
gdc.author.institutional Allmer, Jens
gdc.author.yokid 114170
gdc.author.yokid 107974
gdc.bip.impulseclass C4
gdc.bip.influenceclass C4
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Molecular Biology and Genetics en_US
gdc.description.endpage 11
gdc.description.issue 2 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q1
gdc.description.startpage 1
gdc.description.volume 10 en_US
gdc.description.wosquality Q3
gdc.identifier.openalex W2184343450
gdc.identifier.pmid 23525896
gdc.identifier.wos WOS:000219961200001
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 14.0
gdc.oaire.influence 3.77784E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Base Sequence
gdc.oaire.keywords Molecular Sequence Data
gdc.oaire.keywords Data processing, computer science, computer systems
gdc.oaire.keywords 004
gdc.oaire.keywords Databases
gdc.oaire.keywords MicroRNAs
gdc.oaire.keywords Sequence alignment
gdc.oaire.keywords Nucleic acid
gdc.oaire.keywords Artificial Intelligence
gdc.oaire.keywords Molecular sequence data
gdc.oaire.keywords Humans
gdc.oaire.keywords Nucleic Acid Conformation
gdc.oaire.keywords Databases, Nucleic Acid
gdc.oaire.keywords Sequence Alignment
gdc.oaire.keywords TP248.13-248.65
gdc.oaire.keywords Algorithms
gdc.oaire.keywords Biotechnology
gdc.oaire.popularity 7.5862605E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0301 basic medicine
gdc.oaire.sciencefields 0303 health sciences
gdc.oaire.sciencefields 03 medical and health sciences
gdc.openalex.collaboration National
gdc.openalex.fwci 2.1268343
gdc.openalex.normalizedpercentile 0.89
gdc.opencitations.count 19
gdc.plumx.facebookshareslikecount 61
gdc.plumx.mendeley 38
gdc.plumx.pubmedcites 15
gdc.plumx.scopuscites 21
gdc.scopus.citedcount 21
gdc.wos.citedcount 25
relation.isAuthorOfPublication.latestForDiscovery bf9f97a4-6d62-49cd-a7c8-1bc8463d14d2
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4013-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
5250.pdf
Size:
785.57 KB
Format:
Adobe Portable Document Format
Description:
Makale

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: