Removing Contamination From Genomic Sequences Based on Vector Reference Libraries

dc.contributor.author Bağcı, Caner
dc.contributor.author Allmer, Jens
dc.coverage.doi 10.1109/HIBIT.2012.6209053
dc.date.accessioned 2017-03-14T08:50:47Z
dc.date.available 2017-03-14T08:50:47Z
dc.date.issued 2012
dc.description 7th International Symposium on Health Informatics and Bioinformatics, HIBIT 2012; Cappadocia; Turkey; 19 April 2012 through 22 April 2012 en_US
dc.description.abstract DNA is often sequenced after being cloned into a vector since this provides the possibility for using standard primers and removes the need to develop custom primers. In this way a certain amount of vector is sequenced along with the sequence of interest. Unfortunately, occasionally these contaminating vector sequences find their way into public databases as part of submitted sequences. It has been pointed out that SeqClean, a program used to remove vector contamination from sequences, does not take into account that vectors are circular structures. A workaround has been presented before, but we were able to simplify the process and, additionally, we provide an implementation. We further applied our method to a test set of EST sequences and also analyzed the amount of contamination found in the EST sequences available on NCBI. © 2012 IEEE. en_US
dc.identifier.citation Bağcı, C., and Allmer, J. (2012, April 19-22). Removing contamination from genomic sequences based on vector reference libraries. Paper presented at the 7th International Symposium on Health Informatics and Bioinformatics, HIBIT 2012. doi:10.1109/HIBIT.2012.6209053 en_US
dc.identifier.doi 10.1109/HIBIT.2012.6209053 en_US
dc.identifier.isbn 9781467308786
dc.identifier.scopus 2-s2.0-84862734476
dc.identifier.uri http://doi.org/10.1109/HIBIT.2012.6209053
dc.identifier.uri http://hdl.handle.net/11147/5046
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof 7th International Symposium on Health Informatics and Bioinformatics, HIBIT 2012 en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Circular structures en_US
dc.subject Genomic sequence en_US
dc.subject Public database en_US
dc.subject Vectors en_US
dc.subject Test sets en_US
dc.title Removing Contamination From Genomic Sequences Based on Vector Reference Libraries en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.institutional Bağcı, Caner
gdc.author.institutional Allmer, Jens
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access open access
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department İzmir Institute of Technology. Molecular Biology and Genetics en_US
gdc.description.endpage 122 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 118 en_US
gdc.description.wosquality N/A
gdc.identifier.openalex W2113954317
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 0.0
gdc.oaire.influence 2.6660971E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Test sets
gdc.oaire.keywords Public database
gdc.oaire.keywords Circular structures
gdc.oaire.keywords Vectors
gdc.oaire.keywords Genomic sequence
gdc.oaire.popularity 6.362067E-10
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0301 basic medicine
gdc.oaire.sciencefields 03 medical and health sciences
gdc.oaire.sciencefields 0206 medical engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.collaboration National
gdc.openalex.fwci 0.0
gdc.openalex.normalizedpercentile 0.11
gdc.opencitations.count 1
gdc.plumx.crossrefcites 1
gdc.plumx.mendeley 10
gdc.plumx.scopuscites 1
gdc.scopus.citedcount 1
relation.isAuthorOfPublication.latestForDiscovery bf9f97a4-6d62-49cd-a7c8-1bc8463d14d2
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4013-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
5046.pdf
Size:
279.16 KB
Format:
Adobe Portable Document Format
Description:
Conference Paper

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: