Feature Selection Has a Large Impact on One-Class Classification Accuracy for Micrornas in Plants

dc.contributor.author Yousef, Malik
dc.contributor.author Demirci, Müşerref Duygu Saçar
dc.contributor.author Khalifa, Waleed
dc.contributor.author Allmer, Jens
dc.coverage.doi 10.1155/2016/5670851
dc.date.accessioned 2017-06-28T07:16:35Z
dc.date.available 2017-06-28T07:16:35Z
dc.date.issued 2016
dc.description.abstract MicroRNAs (miRNAs) are short RNA sequences involved in posttranscriptional gene regulation. Their experimental analysis is complicated and, therefore, needs to be supplemented with computational miRNA detection. Currently computational miRNA detection is mainly performed using machine learning and in particular two-class classification. For machine learning, the miRNAs need to be parametrized and more than 700 features have been described. Positive training examples for machine learning are readily available, but negative data is hard to come by. Therefore, it seems prerogative to use one-class classification instead of two-class classification. Previously, we were able to almost reach two-class classification accuracy using one-class classifiers. In this work, we employ feature selection procedures in conjunction with one-class classification and show that there is up to 36% difference in accuracy among these feature selection methods. The best feature set allowed the training of a one-class classifier which achieved an average accuracy of 95.6% thereby outperforming previous two-class-based plant miRNA detection approaches by about 0.5%. We believe that this can be improved upon in the future by rigorous filtering of the positive training examples and by improving current feature clustering algorithms to better target pre-miRNA feature selection. en_US
dc.description.sponsorship The Scientific and Technological Research Council of Turkey (Grant no. 113E326) en_US
dc.identifier.citation Yousef, M., Saçar Demirci, M. D., Khalifa, W., and Allmer, J. (2016). Feature selection has a large impact on one-class classification accuracy for micrornas in plants. Advances in Bioinformatics, 2016. doi:10.1155/2016/5670851 en_US
dc.identifier.doi 10.1155/2016/5670851 en_US
dc.identifier.doi 10.1155/2016/5670851
dc.identifier.issn 1687-8027
dc.identifier.issn 1687-8035
dc.identifier.scopus 2-s2.0-84969820470
dc.identifier.uri http://doi.org/10.1155/2016/5670851
dc.identifier.uri https://hdl.handle.net/11147/5791
dc.language.iso en en_US
dc.publisher Hindawi Publishing Corporation en_US
dc.relation info:eu-repo/grantAgreement/TUBITAK/EEEAG/113E326 en_US
dc.relation.ispartof Advances in Bioinformatics en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject MicroRNAs en_US
dc.subject miRNA detection en_US
dc.subject Machine learning en_US
dc.subject Classification en_US
dc.subject Plant en_US
dc.title Feature Selection Has a Large Impact on One-Class Classification Accuracy for Micrornas in Plants en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.institutional Demirci, Müşerref Duygu Saçar
gdc.author.institutional Allmer, Jens
gdc.author.yokid 114170
gdc.author.yokid 107974
gdc.bip.impulseclass C4
gdc.bip.influenceclass C4
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Molecular Biology and Genetics en_US
gdc.description.endpage 6
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 1
gdc.description.volume 2016 en_US
gdc.description.wosquality N/A
gdc.identifier.openalex W2341782166
gdc.identifier.pmid 27190509
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 9.0
gdc.oaire.influence 3.8366568E-9
gdc.oaire.isgreen true
gdc.oaire.keywords MicroRNAs
gdc.oaire.keywords miRNA detection
gdc.oaire.keywords Machine learning
gdc.oaire.keywords Plant
gdc.oaire.keywords Classification
gdc.oaire.keywords Research Article
gdc.oaire.popularity 8.667668E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0301 basic medicine
gdc.oaire.sciencefields 03 medical and health sciences
gdc.oaire.sciencefields 0206 medical engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.collaboration International
gdc.openalex.fwci 1.71948441
gdc.openalex.normalizedpercentile 0.84
gdc.opencitations.count 17
gdc.plumx.crossrefcites 9
gdc.plumx.facebookshareslikecount 110
gdc.plumx.mendeley 42
gdc.plumx.pubmedcites 4
gdc.plumx.scopuscites 19
gdc.scopus.citedcount 19
relation.isAuthorOfPublication.latestForDiscovery bf9f97a4-6d62-49cd-a7c8-1bc8463d14d2
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4013-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
5791.pdf
Size:
1.19 MB
Format:
Adobe Portable Document Format
Description:
Makale

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: