2-D Thresholding of the Connectivity Map Following the Multiple Sequence Alignments of Diverse Datasets

dc.contributor.author Doğan, Tunca
dc.contributor.author Karaçalı, Bilge
dc.coverage.doi 10.2316/P.2013.791-092
dc.date.accessioned 2019-10-30T08:35:10Z
dc.date.available 2019-10-30T08:35:10Z
dc.date.issued 2013
dc.description 10th IASTED International Conference on Biomedical Engineering, BioMed 2013; Innsbruck; Austria; 13 February 2013 through 15 February 2013 en_US
dc.description.abstract Multiple sequence alignment (MSA) is a widely used method to uncover the relationships between the biomolecular sequences. One essential prerequisite to apply this procedure is to have a considerable amount of similarity between the test sequences. It's usually not possible to obtain reliable results from the multiple alignments of large and diverse datasets. Here we propose a method to obtain sequence clusters of significant intragroup similarities and make sense out of the multiple alignments containing remote sequences. This is achieved by thresholding the pairwise connectivity map over 2 parameters. The first one is the inferred pairwise evolutionary distances and the second parameter is the number of gapless positions on the pairwise comparisons of the alignment. Threshold curves are generated regarding the statistical parameter values obtained from a shuffled dataset and probability distribution techniques are employed to select an optimum threshold curve that eliminate as much of the unreliable connectivities while keeping the reliable ones. We applied the method on a large and diverse dataset composed of nearly 18000 human proteins and measured the biological relevance of the recovered connectivities. Our precision measure (0.981) was nearly 20% higher than the one for the connectivities left after a classical thresholding procedure displaying a significant improvement. Finally we employed the method for the functional clustering of protein sequences in a gold standard dataset. We have also measured the performance, obtaining a higher F-measure (0.882) compared to a conventional clustering operation (0.827). en_US
dc.identifier.citation Doğan, T, and Karaçalı, B. (2013, February 13-15). 2-D thresholding of the connectivity map following the multiple sequence alignments of diverse datasets. Paper presented at the 10th IASTED International Conference on Biomedical Engineering, BioMed 2013. doi:10.2316/P.2013.791-092 en_US
dc.identifier.doi 10.2316/P.2013.791-092 en_US
dc.identifier.doi 10.2316/P.2013.791-092
dc.identifier.isbn 978-088986953-0
dc.identifier.scopus 2-s2.0-84883861890
dc.identifier.uri http://doi.org/10.2316/P.2013.791-092
dc.identifier.uri https://hdl.handle.net/11147/7319
dc.language.iso en en_US
dc.publisher ACTA Press en_US
dc.relation.ispartof 10th IASTED International Conference on Biomedical Engineering, BioMed 2013 en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Biomedical computing en_US
dc.subject Biostatistics en_US
dc.subject Sequence analysis en_US
dc.subject Biomedical engineering en_US
dc.title 2-D Thresholding of the Connectivity Map Following the Multiple Sequence Alignments of Diverse Datasets en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.id 0000-0002-7765-6329
gdc.author.id 0000-0002-7765-6329 en_US
gdc.author.institutional Doğan, Tunca
gdc.author.institutional Karaçalı, Bilge
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access open access
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Electrical and Electronics Engineering en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.wosquality N/A
gdc.identifier.openalex W2322813073
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 0.0
gdc.oaire.influence 2.635068E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Biomedical computing
gdc.oaire.keywords Sequence analysis
gdc.oaire.keywords Biostatistics
gdc.oaire.keywords Biomedical engineering
gdc.oaire.popularity 6.9093203E-10
gdc.oaire.publicfunded false
gdc.openalex.collaboration International
gdc.openalex.fwci 0.0
gdc.openalex.normalizedpercentile 0.08
gdc.opencitations.count 0
gdc.plumx.mendeley 1
gdc.plumx.scopuscites 0
gdc.scopus.citedcount 0
relation.isAuthorOfPublication.latestForDiscovery a081f8c3-cd7b-40d5-a9ca-74707d1b4dc7
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4018-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
7319.pdf
Size:
1.01 MB
Format:
Adobe Portable Document Format
Description:
Conference Paper

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: