Delineating the Impact of Machine Learning Elements in Pre-Microrna Detection

dc.contributor.author Saçar Demirci, Müşerref Duygu
dc.contributor.author Allmer, Jens
dc.coverage.doi 10.7717/peerj.3131
dc.date.accessioned 2017-09-20T12:33:55Z
dc.date.available 2017-09-20T12:33:55Z
dc.date.issued 2017
dc.description.abstract Gene regulation modulates RNA expression via transcription factors. Posttranscriptional gene regulation in turn influences the amount of protein product through, for example, microRNAs (miRNAs). Experimental establishment of miRNAs and their effects is complicated and even futile when aiming to establish the entirety of miRNA target interactions. Therefore, computational approaches have been proposed. Many such tools rely on machine learning (ML) which involves example selection, feature extraction, model training, algorithm selection, and parameter optimization. Different ML algorithms have been used for model training on various example sets, more than 1,000 features describing pre-miRNAs have been proposed and different training and testing schemes have been used for model establishment. For pre-miRNA detection, negative examples cannot easily be established causing a problem for two class classification algorithms. There is also no consensus on what ML approach works best and, therefore, we set forth and established the impact of the different parts involved in ML on model performance. Furthermore, we established two new negative datasets and analyzed the impact of using them for training and testing. It was our aim to attach an order of importance to the parts involved in ML for pre-miRNA detection, but instead we found that all parts are intricately connected and their contributions cannot be easily untangled leading us to suggest that when attempting ML-based pre-miRNA detection many scenarios need to be explored. en_US
dc.description.sponsorship Scientific and Technological Research Council of Turkey (113E326) en_US
dc.identifier.citation Saçar Demirci, M. D., and Allmer, J. (2017). Delineating the impact of machine learning elements in pre-microRNA detection. PeerJ, 2017(3). doi:10.7717/peerj.3131 en_US
dc.identifier.doi 10.7717/peerj.3131 en_US
dc.identifier.doi 10.7717/peerj.3131
dc.identifier.issn 2167-8359
dc.identifier.scopus 2-s2.0-85016400996
dc.identifier.uri http://doi.org/10.7717/peerj.3131
dc.identifier.uri https://hdl.handle.net/11147/6284
dc.language.iso en en_US
dc.publisher PeerJ Inc. en_US
dc.relation info:eu-repo/grantAgreement/TUBITAK/EEEAG/113E326 en_US
dc.relation.ispartof PeerJ en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Feature selection en_US
dc.subject MicroRNAs en_US
dc.subject ML strategy en_US
dc.subject Negative dataset en_US
dc.title Delineating the Impact of Machine Learning Elements in Pre-Microrna Detection en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.institutional Saçar Demirci, Müşerref Duygu
gdc.author.institutional Allmer, Jens
gdc.author.yokid 107974
gdc.bip.impulseclass C4
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Molecular Biology and Genetics en_US
gdc.description.issue 3 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q3
gdc.description.volume 2017 en_US
gdc.description.wosquality Q2
gdc.identifier.openalex W2602423701
gdc.identifier.pmid 28367373
gdc.identifier.wos WOS:000397973700004
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.downloads 1
gdc.oaire.impulse 10.0
gdc.oaire.influence 3.2447707E-9
gdc.oaire.isgreen true
gdc.oaire.keywords ML strategy
gdc.oaire.keywords QH301-705.5
gdc.oaire.keywords Bioinformatics
gdc.oaire.keywords R
gdc.oaire.keywords MicroRNA
gdc.oaire.keywords MicroRNAs
gdc.oaire.keywords Machine learning
gdc.oaire.keywords Feature selection
gdc.oaire.keywords Medicine
gdc.oaire.keywords Ab initio pre-miRNA detection
gdc.oaire.keywords Biology (General)
gdc.oaire.keywords Negative dataset
gdc.oaire.popularity 7.6172935E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0301 basic medicine
gdc.oaire.sciencefields 0206 medical engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.oaire.sciencefields 03 medical and health sciences
gdc.oaire.views 5
gdc.openalex.collaboration National
gdc.openalex.fwci 1.21162811
gdc.openalex.normalizedpercentile 0.72
gdc.opencitations.count 14
gdc.plumx.crossrefcites 2
gdc.plumx.mendeley 21
gdc.plumx.pubmedcites 7
gdc.plumx.scopuscites 13
gdc.scopus.citedcount 13
gdc.wos.citedcount 14
relation.isAuthorOfPublication.latestForDiscovery bf9f97a4-6d62-49cd-a7c8-1bc8463d14d2
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4013-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
6284.pdf
Size:
1.1 MB
Format:
Adobe Portable Document Format
Description:
Makale

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: