Master Degree / Yüksek Lisans Tezleri

Permanent URI for this collectionhttps://hdl.handle.net/11147/3008

Browse

Search Results

Now showing 1 - 2 of 2
  • Master Thesis
    Predicting Software Size From Requirements Written in Natural Language: a Generative Ai Approach
    (01. Izmir Institute of Technology, 2024) Kennouche, Dhıa Eddıne; Demirörs, Onur
    In project management, software size measurement represents a critical process aimed at visualizing a project. This quantification is pursued independently of the specific technologies or technical decisions adopted during the project's development phase. Among the various methodologies employed for this purpose, the COSMIC Functional Size Measurement (FSM) and Event Points are used to facilitate such assessments. These methodologies are instrumental in offering a standardized approach for measuring software size, yet they inherently demand a considerable amount of manual effort. Furthermore, these methods require the manual extraction of Objects of Interest and Event Names, adding to the labor-intensive nature of the process. In response to these challenges, this thesis implements a suite of Artificial Intelligence (AI)-based methods that have dramatically transformed the measurement process. These innovative approaches encompass the creation of a Regression Model that predicts software sizes with remarkable accuracy, a Summarization Model that automates the extraction of Event Names, and a finely tuned Large Language Model (LLM) that generates Objects of Interest with a significant precision. The adoption of these AI-driven techniques has proven to be highly successful, substantially minimizing the manual effort traditionally required in software size measurement and thereby greatly enhancing both efficiency and reliability of estimation practices. Together, these AI-based methodologies represent a significant advancement in software size measurements, offering a more streamlined and efficient approach. By reducing the reliance on manual processes, these methods not only enhance the accuracy and reliability of measurements but also contribute to a more agile project management environment.
  • Master Thesis
    Learning Citation-Aware Representations for Scientific Papers
    (01. Izmir Institute of Technology, 2024) Çelik, Ege Yiğit; Tekir, Selma
    In the field of Natural Language Processing (NLP), the tasks of understanding and generating scientific documents are highly challenging and have been extensively studied. Comprehending scientific papers can facilitate the generation of their contents. Similarly, understanding the relationships between scientific papers and their citations can be instrumental in generating and predicting citations within the text of scientific works. Moreover, language models equipped with citation-aware representations can be particularly robust for downstream tasks involving scientific literature. This thesis aims to enhance the accuracy of citation predictions within scientific texts. To achieve this, we hide citations within the context of scientific papers using mask tokens and subsequently pre-train the RoBERTa-base language model to predict citations for these masked tokens. We ensure that each citation is treated as a single token to be predicted by the mask-filling language model. Consequently, our models function as language models with citation-aware representations. Furthermore, we propose two alternative techniques for our approach. Our base technique predicts citations using only the contexts from scientific papers, while our global technique incorporates the titles and abstracts of papers alongside the contexts to improve performance. Experimental results demonstrate that our models significantly surpass the state-of-the-art results on two out of four benchmark datasets. However, for the remaining two datasets, our models yield suboptimal results, indicating potential for further improvement. Additionally, we conducted experiments on sampled datasets to examine the effects of inherent factors on the datasets and to identify correlations between these factors and our results.