Evaluation of Protein Secondary Structure Prediction Algorithms on a New Advanced Benchmark Dataset

dc.contributor.advisor Allmer, Jens
dc.contributor.author Has, Canan
dc.date.accessioned 2014-07-22T13:51:00Z
dc.date.available 2014-07-22T13:51:00Z
dc.date.issued 2011
dc.description Thesis (Master)--Izmir Institute of Technology, Molecular Biology and Genetics, Izmir, 2011 en_US
dc.description Includes bibliographical references (leaves: 39-43) en_US
dc.description Text in English; Abstract: Turkish and English en_US
dc.description xi, 43 leaves en_US
dc.description Full text release delayed at author's request until 2015.01.17 en_US
dc.description.abstract Starting from 1970s, researchers have been studying secondary structure prediction. However the accuracy of state-of art methods reach to approximately 80- 85%. One of the reasons for that is related with the limitations in respect to datasets used for training or testing the algorithm. A number of databases with n number of experimentally determined proteins, which also contain the knowledge of functionality, biochemical properties and location annotation of proteins, will directly show us how the algorithms work on certain groups of proteins. This also ensures opportunity to users to determine the quality of algorithms on those datasets and to decide on which algorithm can be used for which type of proteins. In this thesis, the objective is set through the development of a new and advanced protein benchmark database which contains functional and biochemical information of experimentally defined 64872 proteins in S2C database derived by ProteinDataBank (PDB). With this database, the seven available predictors are evaluated in respect to their performances on different datasets in terms of functionality and subcellular localization of proteins in the benchmark database. According to the results obtained on proposed benchmark datasets in compare to results on one of existing dataset, RS126, it was shown that grouping proteins into functions in their subcellular localizations have a great impact on deciding the accuracies of existing algorithms. en_US
dc.identifier.uri https://hdl.handle.net/11147/3161
dc.language.iso en en_US
dc.publisher Izmir Institute of Technology en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject.lcsh Bioinformatics en
dc.subject.lcsh Proteins--Computer simulation en
dc.title Evaluation of Protein Secondary Structure Prediction Algorithms on a New Advanced Benchmark Dataset en_US
dc.type Master Thesis en_US
dspace.entity.type Publication
gdc.author.institutional Has, Canan
gdc.coar.access open access
gdc.coar.type text::thesis::master thesis
gdc.description.department Thesis (Master)--İzmir Institute of Technology, Molecular Biology and Genetics en_US
gdc.description.publicationcategory Tez en_US
gdc.description.scopusquality N/A
gdc.description.wosquality N/A
relation.isAuthorOfPublication.latestForDiscovery bf9f97a4-6d62-49cd-a7c8-1bc8463d14d2
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4013-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
421602.pdf
Size:
1.21 MB
Format:
Adobe Portable Document Format
Description:
MasterThesis

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: