Hierarchical Motif Vectors for Prediction of Functional Sites in Amino Acid Sequences Using Quasi-Supervised Learning
| dc.contributor.author | Karaçalı, Bilge | |
| dc.coverage.doi | 10.1109/TCBB.2012.68 | |
| dc.date.accessioned | 2017-05-25T13:42:19Z | |
| dc.date.available | 2017-05-25T13:42:19Z | |
| dc.date.issued | 2012 | |
| dc.description.abstract | We propose hierarchical motif vectors to represent local amino acid sequence configurations for predicting the functional attributes of amino acid sites on a global scale in a quasi-supervised learning framework. The motif vectors are constructed via wavelet decomposition on the variations of physico-chemical amino acid properties along the sequences. We then formulate a prediction scheme for the functional attributes of amino acid sites in terms of the respective motif vectors using the quasi-supervised learning algorithm that carries out predictions for all sites in consideration using only the experimentally verified sites. We have carried out comparative performance evaluation of the proposed method on the prediction of N-glycosylation of 55,184 sites possessing the consensus N-glycosylation sequon identified over 15,104 human proteins, out of which only 1,939 were experimentally verified N-glycosylation sites. In the experiments, the proposed method achieved better predictive performance than the alternative strategies from the literature. In addition, the predicted N-glycosylation sites showed good agreement with existing potential annotations, while the novel predictions belonged to proteins known to be modified by glycosylation. | en_US |
| dc.description.sponsorship | European Commission PIRG03-GA-2008-230903 | en_US |
| dc.identifier.citation | Karaçalı, B. (2012). Hierarchical motif vectors for prediction of functional sites in amino acid sequences using quasi-supervised learning. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 9(5), 1432-1441. doi:10.1109/TCBB.2012.68 | en_US |
| dc.identifier.doi | 10.1109/TCBB.2012.68 | |
| dc.identifier.doi | 10.1109/TCBB.2012.68 | en_US |
| dc.identifier.issn | 1545-5963 | |
| dc.identifier.scopus | 2-s2.0-84864913588 | |
| dc.identifier.uri | http://doi.org/10.1109/TCBB.2012.68 | |
| dc.identifier.uri | https://hdl.handle.net/11147/5615 | |
| dc.language.iso | en | en_US |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | en_US |
| dc.relation.ispartof | IEEE/ACM Transactions on Computational Biology and Bioinformatics | en_US |
| dc.rights | info:eu-repo/semantics/openAccess | en_US |
| dc.subject | Functional attribute prediction | en_US |
| dc.subject | Hierarchical motif vectors | en_US |
| dc.subject | Protein sequence analysis | en_US |
| dc.subject | Forecasting | en_US |
| dc.subject | Learning frameworks | en_US |
| dc.title | Hierarchical Motif Vectors for Prediction of Functional Sites in Amino Acid Sequences Using Quasi-Supervised Learning | en_US |
| dc.type | Article | en_US |
| dspace.entity.type | Publication | |
| gdc.author.institutional | Karaçalı, Bilge | |
| gdc.author.yokid | 11527 | |
| gdc.bip.impulseclass | C5 | |
| gdc.bip.influenceclass | C5 | |
| gdc.bip.popularityclass | C5 | |
| gdc.coar.access | open access | |
| gdc.coar.type | text::journal::journal article | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | İzmir Institute of Technology. Electrical and Electronics Engineering | en_US |
| gdc.description.endpage | 1441 | en_US |
| gdc.description.issue | 5 | en_US |
| gdc.description.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | Q2 | |
| gdc.description.startpage | 1432 | en_US |
| gdc.description.volume | 9 | en_US |
| gdc.description.wosquality | Q1 | |
| gdc.identifier.openalex | W2088515494 | |
| gdc.identifier.pmid | 22585139 | |
| gdc.identifier.wos | WOS:000307299200018 | |
| gdc.index.type | WoS | |
| gdc.index.type | Scopus | |
| gdc.index.type | PubMed | |
| gdc.oaire.accesstype | BRONZE | |
| gdc.oaire.diamondjournal | false | |
| gdc.oaire.impulse | 4.0 | |
| gdc.oaire.influence | 2.99754E-9 | |
| gdc.oaire.isgreen | true | |
| gdc.oaire.keywords | Glycosylation | |
| gdc.oaire.keywords | Protein sequence analysis | |
| gdc.oaire.keywords | Amino Acid Motifs | |
| gdc.oaire.keywords | Computational Biology | |
| gdc.oaire.keywords | Proteins | |
| gdc.oaire.keywords | Learning frameworks | |
| gdc.oaire.keywords | Functional attribute prediction | |
| gdc.oaire.keywords | Sequence Analysis, Protein | |
| gdc.oaire.keywords | Amino Acid Sequence | |
| gdc.oaire.keywords | Amino Acids | |
| gdc.oaire.keywords | Databases, Protein | |
| gdc.oaire.keywords | Hierarchical motif vectors | |
| gdc.oaire.keywords | Algorithms | |
| gdc.oaire.keywords | Forecasting | |
| gdc.oaire.popularity | 7.568539E-10 | |
| gdc.oaire.publicfunded | false | |
| gdc.oaire.sciencefields | 0301 basic medicine | |
| gdc.oaire.sciencefields | 0303 health sciences | |
| gdc.oaire.sciencefields | 03 medical and health sciences | |
| gdc.openalex.collaboration | National | |
| gdc.openalex.fwci | 0.84098967 | |
| gdc.openalex.normalizedpercentile | 0.71 | |
| gdc.opencitations.count | 4 | |
| gdc.plumx.crossrefcites | 4 | |
| gdc.plumx.mendeley | 6 | |
| gdc.plumx.scopuscites | 4 | |
| gdc.scopus.citedcount | 4 | |
| gdc.wos.citedcount | 3 | |
| relation.isAuthorOfPublication.latestForDiscovery | a081f8c3-cd7b-40d5-a9ca-74707d1b4dc7 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 9af2b05f-28ac-4018-8abe-a4dfe192da5e |
