Can Mirbase Provide Positive Data for Machine Learning for the Detection of Mirna Hairpins?
| dc.contributor.author | Demirci, Müşerref Duygu Saçar | |
| dc.contributor.author | Hamzeiy, Hamid | |
| dc.contributor.author | Allmer, Jens | |
| dc.coverage.doi | 10.2390/biecoll-jib-2013-215 | |
| dc.date.accessioned | 2017-04-07T07:40:38Z | |
| dc.date.available | 2017-04-07T07:40:38Z | |
| dc.date.issued | 2013 | |
| dc.description.abstract | Experimental detection and validation of miRNAs is a tedious, time-consuming, and expensive process. Computational methods for miRNA gene detection are being developed so that the number of candidates that need experimental validation can be reduced to a manageable amount. Computational methods involve homology-based and ab inito algorithms. Both approaches are dependent on positive and negative training examples. Positive examples are usually derived from miRBase, the main resource for experimentally validated miRNAs. We encountered some problems with miRBase which we would like to report here. Some problems, among others, we encountered are that folds presented in miRBase are not always the fold with the minimum free energy; some entries do not seem to conform to expectations of miRNAs, and some external accession numbers are not valid. In addition, we compared the prediction accuracy for the same negative dataset when the positive data came from miRBase or miRTarBase and found that the latter led to more precise prediction models. We suggest that miRBase should introduce some automated facilities for ensuring data quality to overcome these problems. | en_US |
| dc.identifier.citation | Saçar, M. D., Hamzeiy, H., and Allmer, J. (2013). Can MiRBase provide positive data for machine learning for the detection of MiRNA hairpins? Journal of integrative bioinformatics, 10(2). doi:10.2390/biecoll-jib-2013-215 | en_US |
| dc.identifier.doi | 10.2390/biecoll-jib-2013-215 | en_US |
| dc.identifier.doi | 10.2390/biecoll-jib-2013-215 | |
| dc.identifier.issn | 1613-4516 | |
| dc.identifier.issn | 1613-4516 | |
| dc.identifier.scopus | 2-s2.0-84891760538 | |
| dc.identifier.uri | http://doi.org/10.2390/biecoll-jib-2013-215 | |
| dc.identifier.uri | https://hdl.handle.net/11147/5250 | |
| dc.language.iso | en | en_US |
| dc.publisher | Informationsmanagement in der Biotechnologie e.V. (IMBio e.V.) | en_US |
| dc.relation.ispartof | Journal of Integrative Bioinformatics | en_US |
| dc.rights | info:eu-repo/semantics/openAccess | en_US |
| dc.subject | MicroRNAs | en_US |
| dc.subject | Sequence alignment | en_US |
| dc.subject | Molecular sequence data | en_US |
| dc.subject | Nucleic acid | en_US |
| dc.subject | Databases | en_US |
| dc.title | Can Mirbase Provide Positive Data for Machine Learning for the Detection of Mirna Hairpins? | en_US |
| dc.type | Article | en_US |
| dspace.entity.type | Publication | |
| gdc.author.institutional | Demirci, Müşerref Duygu Saçar | |
| gdc.author.institutional | Hamzeiy, Hamid | |
| gdc.author.institutional | Allmer, Jens | |
| gdc.author.yokid | 114170 | |
| gdc.author.yokid | 107974 | |
| gdc.bip.impulseclass | C4 | |
| gdc.bip.influenceclass | C4 | |
| gdc.bip.popularityclass | C4 | |
| gdc.coar.access | open access | |
| gdc.coar.type | text::journal::journal article | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | İzmir Institute of Technology. Molecular Biology and Genetics | en_US |
| gdc.description.endpage | 11 | |
| gdc.description.issue | 2 | en_US |
| gdc.description.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | Q1 | |
| gdc.description.startpage | 1 | |
| gdc.description.volume | 10 | en_US |
| gdc.description.wosquality | Q3 | |
| gdc.identifier.openalex | W2184343450 | |
| gdc.identifier.pmid | 23525896 | |
| gdc.identifier.wos | WOS:000219961200001 | |
| gdc.index.type | WoS | |
| gdc.index.type | Scopus | |
| gdc.index.type | PubMed | |
| gdc.oaire.accesstype | GOLD | |
| gdc.oaire.diamondjournal | false | |
| gdc.oaire.impulse | 14.0 | |
| gdc.oaire.influence | 3.77784E-9 | |
| gdc.oaire.isgreen | true | |
| gdc.oaire.keywords | Base Sequence | |
| gdc.oaire.keywords | Molecular Sequence Data | |
| gdc.oaire.keywords | Data processing, computer science, computer systems | |
| gdc.oaire.keywords | 004 | |
| gdc.oaire.keywords | Databases | |
| gdc.oaire.keywords | MicroRNAs | |
| gdc.oaire.keywords | Sequence alignment | |
| gdc.oaire.keywords | Nucleic acid | |
| gdc.oaire.keywords | Artificial Intelligence | |
| gdc.oaire.keywords | Molecular sequence data | |
| gdc.oaire.keywords | Humans | |
| gdc.oaire.keywords | Nucleic Acid Conformation | |
| gdc.oaire.keywords | Databases, Nucleic Acid | |
| gdc.oaire.keywords | Sequence Alignment | |
| gdc.oaire.keywords | TP248.13-248.65 | |
| gdc.oaire.keywords | Algorithms | |
| gdc.oaire.keywords | Biotechnology | |
| gdc.oaire.popularity | 7.5862605E-9 | |
| gdc.oaire.publicfunded | false | |
| gdc.oaire.sciencefields | 0301 basic medicine | |
| gdc.oaire.sciencefields | 0303 health sciences | |
| gdc.oaire.sciencefields | 03 medical and health sciences | |
| gdc.openalex.collaboration | National | |
| gdc.openalex.fwci | 2.1268343 | |
| gdc.openalex.normalizedpercentile | 0.89 | |
| gdc.opencitations.count | 19 | |
| gdc.plumx.facebookshareslikecount | 61 | |
| gdc.plumx.mendeley | 38 | |
| gdc.plumx.pubmedcites | 15 | |
| gdc.plumx.scopuscites | 21 | |
| gdc.scopus.citedcount | 21 | |
| gdc.wos.citedcount | 25 | |
| relation.isAuthorOfPublication.latestForDiscovery | bf9f97a4-6d62-49cd-a7c8-1bc8463d14d2 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 9af2b05f-28ac-4013-8abe-a4dfe192da5e |
