Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

Permanent URI for this collectionhttps://hdl.handle.net/11147/7148

Browse

Search Results

Now showing 1 - 6 of 6
  • Article
    Citation - WoS: 1
    Citation - Scopus: 1
    Dnmso; an Ontology for Representing De Novo Sequencing Results From Tandem-Ms Data
    (PeerJ Inc., 2020) Takan, Savaş; Allmer, Jens
    For the identification and sequencing of proteins, mass spectrometry (MS) has become the tool of choice and, as such, drives proteomics. MS/MS spectra need to be assigned a peptide sequence for which two strategies exist. Either database search or de novo sequencing can be employed to establish peptide spectrum matches. For database search, mzIdentML is the current community standard for data representation. There is no community standard for representing de novo sequencing results, but we previously proposed the de novo markup language (DNML). At the moment, each de novo sequencing solution uses different data representation, complicating downstream data integration, which is crucial since ensemble predictions may be more useful than predictions of a single tool. We here propose the de novo MS Ontology (DNMSO), which can, for example, provide many-to-many mappings between spectra and peptide predictions. Additionally, an application programming interface (API) that supports any file operation necessary for de novo sequencing from spectra input to reading, writing, creating, of the DNMSO format, as well as conversion from many other file formats, has been implemented. This API removes all overhead from the production of de novo sequencing tools and allows developers to concentrate on algorithm development completely. We make the API and formal descriptions of the format freely available at https://github.com/savastakan/dnmso.
  • Letter
    Citation - Scopus: 9
    A Call for Benchmark Data in Mass Spectrometry-Based Proteomics
    (Proteomass Scientific Society, 2012) Allmer, Jens
    Proteomics is a quickly developing field. New and better mass spectrometers, the platform of choice in proteomics, are being introduced frequently. New algorithms for the analysis of mass spectrometric data and assignment of amino acid sequence to tandem mass spectra are also presented on a frequent basis. Unfortunately, the best application area for these algorithms cannot be established at the moment. Furthermore, even the accuracy of the algorithms and their relative performance cannot be established. This is due to the lack of proper benchmark data. This letter first introduces the field of mass spectrometry-based proteomics and then defines the expectations of a well-designed benchmark dataset. Thereafter, the current situation is compared to this ideal. A call for the creation of a proper benchmark dataset is then placed and it is explained how measurement should be performed. Finally, the benefits for the research community are highlighted. © 2012, Proteomass Scientific Society. All rights reserved.
  • Article
    Citation - WoS: 2
    Citation - Scopus: 5
    Pgminer: Complete Proteogenomics Workflow; From Data Acquisition To Result Visualization
    (Elsevier Ltd., 2017) Has, Canan; Allmer, Jens
    In parallel with the development of nucleotide sequencing an equally important interest in further describing the sequence in terms of function arose and the latter represents the current bottleneck in the overall research question. Sequencing the transcriptome allows determination of expressed nucleotide sequences and using mass spectrometry allows sequencing on the protein level. Both approaches can only sequence a subset of the existing transcripts. Moreover, for example post translational modification events can only be determined on the proteomics level. Therefore, it is essential to combine proteomics and genomics. For that purpose, proteogenomics data analysis pipelines have been described. Here, we describe a novel proteogenomics workflow which encompasses everything from the acquisition of data to result visualization in the Konstanz Information Miner (KNIME), a state of the art workflow management and data analytics platform. We amended KNIME with a number of processes like peptide consensus prediction, peptide mapping, and database equalizing, as well as result visualization. This enabled construction of our new workflow, entitled PGMiner, which not only includes all data analysis steps, but is highly customizable which is rather cumbersome for most existing pipelines. Furthermore, no burdensome installation processes have to be performed making PGMiner the most user friendly tool available.
  • Conference Object
    Citation - Scopus: 1
    Ranking Tandem Mass Spectra: and the Impact of Database Size and Scoring Function on Peptide Spectrum Matches
    (Institute of Electrical and Electronics Engineers Inc., 2013) Has, Canan; Kundakçı, Cemal Ulaş; Altay, Aybuge; Allmer, Jens
    Proteomics is currently driven by mass spectrometry. For the analysis of tandem mass spectra many computational algorithms have been proposed. There are two approaches, one which assigns a peptide sequence to a tandem mass spectrum directly and one which employs a sequence database for looking up possible solutions. The former method needs high quality spectra while the latter can tolerate lower quality spectra. Since both methods are computationally expensive, it is sensible to establish spectral quality using an independent fast algorithm. In this study, we first establish proper settings for database search algorithms for the analysis of spectra in our gold benchmark dataset and then analyze the performance of ScanRanker, an algorithm for quality assessment of tandem MS spectra, on this ground truth data. We found that OMSSA and MSGFDB have limitations in their scoring functions but were able to form a proper consensus prediction using majority vote for our benchmark data. Unfortunately, ScanRanker's results do not correlate well with the consensus and ScanRanker is also too slow to be used in the capacity it is supposed to be used. © 2013 IEEE
  • Article
    Citation - WoS: 91
    Citation - Scopus: 106
    Algorithms for the De Novo Sequencing of Peptides From Tandem Mass Spectra
    (Taylor & Francis, 2011) Allmer, Jens
    Proteomics is the study of proteins, their time- and location-dependent expression profiles, as well as their modifications and interactions. Mass spectrometry is useful to investigate many of the questions asked in proteomics. Database search methods are typically employed to identify proteins from complex mixtures. However, databases are not often available or, despite their availability, some sequences are not readily found therein. To overcome this problem, de novo sequencing can be used to directly assign a peptide sequence to a tandem mass spectrometry spectrum. Many algorithms have been proposed for de novo sequencing and a selection of them are detailed in this article. Although a standard accuracy measure has not been agreed upon in the field, relative algorithm performance is discussed. The current state of the de novo sequencing is assessed thereafter and, finally, examples are used to construct possible future perspectives of the field. © 2011 Expert Reviews Ltd.
  • Conference Object
    Relative Protein Quantitation With Post Translational Modifications in Mass Spectrometry Based Proteomics
    (Institute of Electrical and Electronics Engineers Inc., 2010) Allmer, Jens
    Mass spectrometry has become the tool of choice for most investigations in proteomics. Identification of proteins from complex mixtures has long been achieved and is now routinely used in countless high throughput studies. Quantitation by mass spectrometry is comparably newer and many different strategies have been proposed. One such strategy quantitates the difference in protein expression level among samples via extracted ion chromatograms, or spectral counts or a combination thereof. Another strategy involves mass modifications of the analytes in one or more of the samples under investigation. MSMAG has been developed as an extension to 2DB and it has been shown that it can aid in quantitation of data from experiments employing label-free quantitation. Recently, it has been extended to allow for analysis of data based on labelling strategies. This also makes it possible to quickly visualize and investigate inherent mass differences as presented by post translational modifications. ©2009 IEEE.