Feature Selection Has a Large Impact on One-Class Classification Accuracy for Micrornas in Plants
| dc.contributor.author | Yousef, Malik | |
| dc.contributor.author | Demirci, Müşerref Duygu Saçar | |
| dc.contributor.author | Khalifa, Waleed | |
| dc.contributor.author | Allmer, Jens | |
| dc.coverage.doi | 10.1155/2016/5670851 | |
| dc.date.accessioned | 2017-06-28T07:16:35Z | |
| dc.date.available | 2017-06-28T07:16:35Z | |
| dc.date.issued | 2016 | |
| dc.description.abstract | MicroRNAs (miRNAs) are short RNA sequences involved in posttranscriptional gene regulation. Their experimental analysis is complicated and, therefore, needs to be supplemented with computational miRNA detection. Currently computational miRNA detection is mainly performed using machine learning and in particular two-class classification. For machine learning, the miRNAs need to be parametrized and more than 700 features have been described. Positive training examples for machine learning are readily available, but negative data is hard to come by. Therefore, it seems prerogative to use one-class classification instead of two-class classification. Previously, we were able to almost reach two-class classification accuracy using one-class classifiers. In this work, we employ feature selection procedures in conjunction with one-class classification and show that there is up to 36% difference in accuracy among these feature selection methods. The best feature set allowed the training of a one-class classifier which achieved an average accuracy of 95.6% thereby outperforming previous two-class-based plant miRNA detection approaches by about 0.5%. We believe that this can be improved upon in the future by rigorous filtering of the positive training examples and by improving current feature clustering algorithms to better target pre-miRNA feature selection. | en_US |
| dc.description.sponsorship | The Scientific and Technological Research Council of Turkey (Grant no. 113E326) | en_US |
| dc.identifier.citation | Yousef, M., Saçar Demirci, M. D., Khalifa, W., and Allmer, J. (2016). Feature selection has a large impact on one-class classification accuracy for micrornas in plants. Advances in Bioinformatics, 2016. doi:10.1155/2016/5670851 | en_US |
| dc.identifier.doi | 10.1155/2016/5670851 | en_US |
| dc.identifier.doi | 10.1155/2016/5670851 | |
| dc.identifier.issn | 1687-8027 | |
| dc.identifier.issn | 1687-8035 | |
| dc.identifier.scopus | 2-s2.0-84969820470 | |
| dc.identifier.uri | http://doi.org/10.1155/2016/5670851 | |
| dc.identifier.uri | https://hdl.handle.net/11147/5791 | |
| dc.language.iso | en | en_US |
| dc.publisher | Hindawi Publishing Corporation | en_US |
| dc.relation | info:eu-repo/grantAgreement/TUBITAK/EEEAG/113E326 | en_US |
| dc.relation.ispartof | Advances in Bioinformatics | en_US |
| dc.rights | info:eu-repo/semantics/openAccess | en_US |
| dc.subject | MicroRNAs | en_US |
| dc.subject | miRNA detection | en_US |
| dc.subject | Machine learning | en_US |
| dc.subject | Classification | en_US |
| dc.subject | Plant | en_US |
| dc.title | Feature Selection Has a Large Impact on One-Class Classification Accuracy for Micrornas in Plants | en_US |
| dc.type | Article | en_US |
| dspace.entity.type | Publication | |
| gdc.author.institutional | Demirci, Müşerref Duygu Saçar | |
| gdc.author.institutional | Allmer, Jens | |
| gdc.author.yokid | 114170 | |
| gdc.author.yokid | 107974 | |
| gdc.bip.impulseclass | C4 | |
| gdc.bip.influenceclass | C4 | |
| gdc.bip.popularityclass | C4 | |
| gdc.coar.access | open access | |
| gdc.coar.type | text::journal::journal article | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | İzmir Institute of Technology. Molecular Biology and Genetics | en_US |
| gdc.description.endpage | 6 | |
| gdc.description.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | N/A | |
| gdc.description.startpage | 1 | |
| gdc.description.volume | 2016 | en_US |
| gdc.description.wosquality | N/A | |
| gdc.identifier.openalex | W2341782166 | |
| gdc.identifier.pmid | 27190509 | |
| gdc.index.type | Scopus | |
| gdc.index.type | PubMed | |
| gdc.oaire.accesstype | GOLD | |
| gdc.oaire.diamondjournal | false | |
| gdc.oaire.impulse | 9.0 | |
| gdc.oaire.influence | 3.8366568E-9 | |
| gdc.oaire.isgreen | true | |
| gdc.oaire.keywords | MicroRNAs | |
| gdc.oaire.keywords | miRNA detection | |
| gdc.oaire.keywords | Machine learning | |
| gdc.oaire.keywords | Plant | |
| gdc.oaire.keywords | Classification | |
| gdc.oaire.keywords | Research Article | |
| gdc.oaire.popularity | 8.667668E-9 | |
| gdc.oaire.publicfunded | false | |
| gdc.oaire.sciencefields | 0301 basic medicine | |
| gdc.oaire.sciencefields | 03 medical and health sciences | |
| gdc.oaire.sciencefields | 0206 medical engineering | |
| gdc.oaire.sciencefields | 02 engineering and technology | |
| gdc.openalex.collaboration | International | |
| gdc.openalex.fwci | 1.71948441 | |
| gdc.openalex.normalizedpercentile | 0.84 | |
| gdc.opencitations.count | 17 | |
| gdc.plumx.crossrefcites | 9 | |
| gdc.plumx.facebookshareslikecount | 110 | |
| gdc.plumx.mendeley | 42 | |
| gdc.plumx.pubmedcites | 4 | |
| gdc.plumx.scopuscites | 19 | |
| gdc.scopus.citedcount | 19 | |
| relation.isAuthorOfPublication.latestForDiscovery | bf9f97a4-6d62-49cd-a7c8-1bc8463d14d2 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 9af2b05f-28ac-4013-8abe-a4dfe192da5e |
