Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

Permanent URI for this collectionhttps://hdl.handle.net/11147/7148

Browse

Search Results

Now showing 1 - 7 of 7

Automating Software Size Measurement From Python Code Using Language Models (Vol 33, 19, 2026)
(Springer, 2025) Demirörs, Onur; Tenekeci, Samet; Gul, Bedir Arda; Keles, Damla; Kucuk, Murat; Demirors, Onur; 03.04. Department of Computer Engineering; 03. Faculty of Engineering; 01. Izmir Institute of Technology
Automating Software Size Measurement from Python Code Using Language Models
(Springer, 2025) Tenekeci, Samet; Demirörs, Onur; Gul, Bedir Arda; Keles, Damla; Kuuk, Murat; Demirors, Onur; 01. Izmir Institute of Technology; 03. Faculty of Engineering; 03.04. Department of Computer Engineering
Software size is a key input for project planning, effort estimation, and productivity analysis. While pre-trained language models have shown promise in deriving functional size from natural-language requirements, measuring size directly from source code remains under-explored. Yet, code-based size measurement is critical in modern workflows where requirement documents are often incomplete or unavailable, especially in Agile development environments. This exploratory study investigates the use of CodeBERT, a pre-trained bimodal transformer model, for measuring software size directly from Python source code according to two measurement methods: COSMIC Function Points and MicroM. We construct two curated datasets from the Python subset of the CodeSearchNet corpus, and manually annotate each function with its corresponding size. Our experimental results show that CodeBERT can successfully measure COSMIC data movements with up to 91.4% accuracy and generalize to the functional, architectural, and algorithmic event types defined in MicroM, reaching up to 81.5% accuracy. These findings highlight the potential of code-based language models for automated functional size measurement when requirement artifacts are absent or unreliable.
Citation - WoS: 1
Citation - Scopus: 1
Automating Software Size Measurement With Language Models: Insights From Industrial Case Studies
(Elsevier Science Inc, 2026) Demirörs, Onur; Tenekeci, Samet; Kennouche, Dhia Eddine; Demirors, Onur; 01. Izmir Institute of Technology; 03. Faculty of Engineering; 03.04. Department of Computer Engineering
Objective software size measurement is critical for accurate effort estimation, yet many organizations avoid it due to high costs, required expertise, and time-consuming manual effort. This often leads to vague predictions, poor planning, and project overruns. To address this challenge, we investigate the use of pre-trained language models - BERT and SE-BERT - to automate size measurement based on textual requirements using COSMIC and MicroM methods. We constructed one heterogeneous dataset and two industrial datasets, each manually measured by experienced analysts. Models were evaluated in three settings: (i) generic model evaluation, where the models are trained and tested on heterogeneous data, (ii) internal evaluation, where the models are trained and tested on organization-specific data, and (iii) external evaluation, where generic models were tested on organization-specific data. Results show that organization-specific models significantly outperform generic models, indicating that aligning training data with the target organization's requirement style is critical for accuracy. SE-BERT, a domain-adapted variant of BERT, improves performance, particularly in low-resource settings. These findings highlight the practical potential of tailoring training data for broader adoption and cost-effective software size measurement in industrial contexts.
Microarc: Event Driven Analysis and Design Method for Microservices
(Elsevier B.V., 2025) Demirörs, Onur; Demirors, Onur; 01. Izmir Institute of Technology; 03. Faculty of Engineering; 03.04. Department of Computer Engineering
The rapid development of the Internet infrastructure has enabled software applications to leverage almost unlimited and scalable resources. Microservice-based architecture has emerged as a solution to harness the benefits of a distributed cloud-based infrastructure. Event-driven architecture is a powerful approach for addressing challenges in distributed systems, such as scalability, distributed data, and sharing of data at scale. In an event-driven microservice architecture, decoupled services interact by responding to events and event streams facilitate data sharing between them. Despite these advantages, there is no de facto method for the analysis and design of systems within microservice architecture. Organizations often face difficulties in developing microservice-based systems, owing to the lack of well-defined methodologies for analysis and design. In this study, we present an analysis and design method for microservice-based systems. MicroArc is a method for analyzing and designing microservice-based systems, and comprises modeling notations, guiding processes to articulate how the method is applied, and a supporting tool for modelling. The MicroArc approach enables the identification of events and microservice candidates by modeling the flow of processes in the early phase of development. © 2025 Elsevier B.V., All rights reserved.
An Alternative Software Benchmarking Dataset: Effort Estimation With Machine Learning
(Elsevier Science Inc, 2026) Demirörs, Onur; Unlu, Huseyin; Demirors, Onur; 01. Izmir Institute of Technology; 03. Faculty of Engineering; 03.04. Department of Computer Engineering
Effort estimation plays a vital role in software project planning, as accurate estimates of required human resources are essential for success. Traditional estimation models often depend on historical size and effort data, yet organizations frequently struggle to access reliable effort records. Public benchmarking datasets like ISBSG offer useful data but may lack coverage or involve licensing fees. To address this issue, we previously introduced a free, extendable benchmarking dataset that integrates functional size and effort data extracted from 18 studies. In this study, we examine the effectiveness of our dataset for predictive effort estimation and compare it with the widely used ISBSG dataset. Our analysis includes 337 records from our dataset and 732 ISBSG projects, focusing on those with COSMIC size data. We first developed and compared models using linear regression and nine machine learning algorithms - Bayesian Ridge, Ridge Regression, Decision Tree, Random Forest, XGBoost, LightGBM, k-Nearest Neighbors, Multi-Layer Perceptron, and Support Vector Regression. Then, we selected the best-performing models and applied them to an unseen evaluation dataset to assess their generalization performance. The results show that machine learning performance varies based on evaluation method and dataset characteristics. Despite having fewer records, our dataset enabled more accurate predictions than ISBSG in most cases, highlighting its potential for effort estimation. This study demonstrates the viability of our dataset for building predictive models and supports the use of machine learning in improving estimation accuracy. Expanding this dataset could offer a valuable, open-access resource for organizations seeking effective and lowcost estimation solutions.
Citation - WoS: 1
Citation - Scopus: 1
Towards the Construction of a Software Benchmarking Dataset Via Systematic Literature Review
(IEEE, 2024) Yurum, Ozan Rasit; Demirörs, Onur; Unlu, Huseyin; Demirors, Onur; 03.04. Department of Computer Engineering; 03. Faculty of Engineering; 01. Izmir Institute of Technology
Effort estimation is a fundamental task during the planning of software projects. Prediction models usually rely on two essential factors: software size and effort data. Measuring the size of the software can be done at various stages of the project with desired accuracy. Nevertheless, the industry faces challenges when it comes to collecting reliable actual effort data. Consequently, organizations encounter difficulties in establishing effort prediction models. Benchmarking datasets are available, but, in most cases, they have huge variances that make them less useful for effort prediction. In this study, we aimed to answer whether creating a software benchmarking dataset is possible by gathering the data from the literature. To the best of our knowledge, a comprehensive dataset that gathers the functional size and effort data of the studies from the literature is unavailable. For this purpose, we performed a systematic literature review to find studies that include projects measured with the COSMIC Functional Size Measurement (FSM) method and the related effort. As a result, we formed a dataset including 337 records from 18 studies that shared the corresponding size and effort data. Although we performed a limited search, we created a larger dataset than many datasets in the literature. In light of our review, we obtained that most studies did not share their dataset, and many lacked case details such as implementation environment and the scope of software development life cycle activities included in the effort data. We also compared the dataset with the ISBSG repository and found that our dataset has less variation in productivity. Our review showed the applicability of creating a software benchmarking dataset is possible by gathering the data from the literature. In conclusion, this study addresses gaps in the literature through a cost-free and easily extendable dataset.
Analysis, Design, Test, and Devops in Microservice-Based Software Architectures: Results From Pakistan
(Springer international Publishing Ag, 2024) Demirörs, Onur; Soylu, Gorkem Kilinc; Ahmad, Isra Shafique; Demirors, Onur; 03.04. Department of Computer Engineering; 03. Faculty of Engineering; 01. Izmir Institute of Technology
In today's software industry, Microservice-based Software Architecture (MSSA) has been a common practice and has been adopted by many companies. MSSA differs from traditional object-oriented architecture in several ways. The architecture moved away from being data-driven and evolved into a behavior-oriented structure. The usage of a single database is replaced by the structures in which each microservice is developed independently and has its own database. Therefore, adaptation demands software organizations to transform their culture. However, there is no de facto method for analyzing, designing, and testing systems for these architectures, similar to object-oriented analysis and design practices. This study aimed to understand how Pakistani software organizations undertake analysis, design, test, and DevOps processes in software projects adopting the MSSA paradigm. To achieve this goal, we surveyed 49 participants from various agile organizations in Pakistan, encompassing different roles and domains. The results reveal that Pakistani software organizations continue using familiar object-oriented analysis and design approaches. However, they have already started exploring event-oriented analysis and design methods for MSSA projects.

Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

Browse

Filters

Settings

Sort By

Results per page

Search Results