Hierarchical Motif Vectors for Prediction of Functional Sites in Amino Acid Sequences Using Quasi-Supervised Learning

dc.contributor.author Karaçalı, Bilge
dc.coverage.doi 10.1109/TCBB.2012.68
dc.date.accessioned 2017-05-25T13:42:19Z
dc.date.available 2017-05-25T13:42:19Z
dc.date.issued 2012
dc.description.abstract We propose hierarchical motif vectors to represent local amino acid sequence configurations for predicting the functional attributes of amino acid sites on a global scale in a quasi-supervised learning framework. The motif vectors are constructed via wavelet decomposition on the variations of physico-chemical amino acid properties along the sequences. We then formulate a prediction scheme for the functional attributes of amino acid sites in terms of the respective motif vectors using the quasi-supervised learning algorithm that carries out predictions for all sites in consideration using only the experimentally verified sites. We have carried out comparative performance evaluation of the proposed method on the prediction of N-glycosylation of 55,184 sites possessing the consensus N-glycosylation sequon identified over 15,104 human proteins, out of which only 1,939 were experimentally verified N-glycosylation sites. In the experiments, the proposed method achieved better predictive performance than the alternative strategies from the literature. In addition, the predicted N-glycosylation sites showed good agreement with existing potential annotations, while the novel predictions belonged to proteins known to be modified by glycosylation. en_US
dc.description.sponsorship European Commission PIRG03-GA-2008-230903 en_US
dc.identifier.citation Karaçalı, B. (2012). Hierarchical motif vectors for prediction of functional sites in amino acid sequences using quasi-supervised learning. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 9(5), 1432-1441. doi:10.1109/TCBB.2012.68 en_US
dc.identifier.doi 10.1109/TCBB.2012.68
dc.identifier.doi 10.1109/TCBB.2012.68 en_US
dc.identifier.issn 1545-5963
dc.identifier.scopus 2-s2.0-84864913588
dc.identifier.uri http://doi.org/10.1109/TCBB.2012.68
dc.identifier.uri https://hdl.handle.net/11147/5615
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof IEEE/ACM Transactions on Computational Biology and Bioinformatics en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Functional attribute prediction en_US
dc.subject Hierarchical motif vectors en_US
dc.subject Protein sequence analysis en_US
dc.subject Forecasting en_US
dc.subject Learning frameworks en_US
dc.title Hierarchical Motif Vectors for Prediction of Functional Sites in Amino Acid Sequences Using Quasi-Supervised Learning en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.institutional Karaçalı, Bilge
gdc.author.yokid 11527
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Electrical and Electronics Engineering en_US
gdc.description.endpage 1441 en_US
gdc.description.issue 5 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q2
gdc.description.startpage 1432 en_US
gdc.description.volume 9 en_US
gdc.description.wosquality Q1
gdc.identifier.openalex W2088515494
gdc.identifier.pmid 22585139
gdc.identifier.wos WOS:000307299200018
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype BRONZE
gdc.oaire.diamondjournal false
gdc.oaire.impulse 4.0
gdc.oaire.influence 2.99754E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Glycosylation
gdc.oaire.keywords Protein sequence analysis
gdc.oaire.keywords Amino Acid Motifs
gdc.oaire.keywords Computational Biology
gdc.oaire.keywords Proteins
gdc.oaire.keywords Learning frameworks
gdc.oaire.keywords Functional attribute prediction
gdc.oaire.keywords Sequence Analysis, Protein
gdc.oaire.keywords Amino Acid Sequence
gdc.oaire.keywords Amino Acids
gdc.oaire.keywords Databases, Protein
gdc.oaire.keywords Hierarchical motif vectors
gdc.oaire.keywords Algorithms
gdc.oaire.keywords Forecasting
gdc.oaire.popularity 7.568539E-10
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0301 basic medicine
gdc.oaire.sciencefields 0303 health sciences
gdc.oaire.sciencefields 03 medical and health sciences
gdc.openalex.collaboration National
gdc.openalex.fwci 0.84098967
gdc.openalex.normalizedpercentile 0.71
gdc.opencitations.count 4
gdc.plumx.crossrefcites 4
gdc.plumx.mendeley 6
gdc.plumx.scopuscites 4
gdc.scopus.citedcount 4
gdc.wos.citedcount 3
relation.isAuthorOfPublication.latestForDiscovery a081f8c3-cd7b-40d5-a9ca-74707d1b4dc7
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4018-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
5615.pdf
Size:
993.4 KB
Format:
Adobe Portable Document Format
Description:
Makale

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: