Molecular Biology and Genetics / Moleküler Biyoloji ve Genetik
Permanent URI for this collectionhttps://hdl.handle.net/11147/9
Browse
3 results
Search Results
Article Citation - WoS: 11Citation - Scopus: 14Categorization of Species Based on Their Micrornas Employing Sequence Motifs, Information-Theoretic Sequence Feature Extraction, and K-Mers(Springer Verlag, 2017) Yousef, Malik; Nigatu, Dawit; Levy, Dalit; Allmer, Jens; Henkel, WernerBackground: Diseases like cancer can manifest themselves through changes in protein abundance, and microRNAs (miRNAs) play a key role in the modulation of protein quantity. MicroRNAs are used throughout all kingdoms and have been shown to be exploited by viruses to modulate their host environment. Since the experimental detection of miRNAs is difficult, computational methods have been developed. Many such tools employ machine learning for pre-miRNA detection, and many features for miRNA parameterization have been proposed. To train machine learning models, negative data is of importance yet hard to come by; therefore, we recently started to employ pre-miRNAs from one species as positive data versus another species’ pre-miRNAs as negative examples based on sequence motifs and k-mers. Here, we introduce the additional usage of information-theoretic (IT) features. Results: Pre-miRNAs from one species were used as positive and another species’ pre-miRNAs as negative training data for machine learning. The categorization capability of IT and k-mer features was investigated. Both feature sets and their combinations yielded a very high accuracy, which is as good as the previously suggested sequence motif and k-mer based method. However, for obtaining a high performance, a sufficiently large phylogenetic distance between the species and sufficiently high number of pre-miRNAs in the training set is required. To examine the contribution of the IT and k-mer features, an information gain-based feature ranking was performed. Although the top 3 are IT features, 80% of the top 100 features are k-mers. The comparison of all three individual approaches (motifs, IT, and k-mers) shows that the distinction of species based on their pre-miRNAs k-mers are sufficient. Conclusions: IT sequence feature extraction enables the distinction among species and is less computationally expensive than motif calculations. However, since IT features need larger amounts of data to have enough statistics for producing highly accurate results, future categorization into species can be effectively done using k-mers only. The biological reasoning for this is the existence of a codon bias between species which can, at least, be observed in exonic miRNAs. Future work in this direction will be the ab initio detection of pre-miRNA. In addition, prediction of pre-miRNA from RNA-seq can be done.Conference Object Citation - WoS: 6Citation - Scopus: 8Comparison of Four Ab Initio Microrna Prediction Tools(SciTePress, 2013) Saçar, Müşerref Duygu; Allmer, JensMicroRNAs are small RNA sequences of 18-24 nucleotides in length, which serve as templates to drive post transcriptional gene silencing. The canonical microRNA pathway starts with transcription from DNA and is followed by processing by the Microprocessor complex, yielding a hairpin structure. This is then exported into the cytosol where it is processed by Dicer and next incorporated into the RNA induced silencing complex. All of these biogenesis steps add to the overall specificity of miRNA production and effect. Unfortunately, experimental detection of miRNAs is cumbersome and therefore computational tools are necessary. Homology-based miRNA prediction tools are limited by fast miRNA evolution and by the fact that they are template driven. Ab initio miRNA prediction methods have been proposed but they have not been analyzed competitively so that their relative performance is largely unknown. Here we implement the features proposed in four miRNA ab initio studies and evaluate them on two data sets. Using the features described in Bentwich 2008 leads to the highest accuracy but still does not provide enough confidence into the results to warrant experimental validation of all predictions in a larger genome like the human genome. Copyright © 2013 SCITEPRESS - Science and Technology Publications.Conference Object Citation - Scopus: 17Systematic Computational Analysis of Potential Rnai Regulation in Toxoplasma Gondii(Institute of Electrical and Electronics Engineers Inc., 2010) Çakır, Mehmet Volkan; Allmer, JensRNA interference (RNAi) is the mechanism through which RNA interferes with the production of other RNAs in a sequence specific manner. Micro RNA (miRNA) is a type of RNA which is transcribed as pri-miRNAs and processed to premiRNAs in the nucleus. These pre-miRNAs are then exported from the nucleus and processed in the cytoplasm to double stranded RNA with one strand providing target specificity.. Toxoplasma gondii is a parasitic apicomplexan which causes several diseases. T. gondii is a good candidate for computational efforts with its small and publicly available genome files and extensive information about its gene structure. Although the existence of RNA interference in T. gondii is being debated, establishment of its complete potential RNAi regulatory network may be beneficial for further investigations into the topic. ©2009 IEEE.
