Towards the Construction of a Software Benchmarking Dataset Via Systematic Literature Review
Loading...
Date
2024
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
IEEE
Open Access Color
Green Open Access
Yes
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Effort estimation is a fundamental task during the planning of software projects. Prediction models usually rely on two essential factors: software size and effort data. Measuring the size of the software can be done at various stages of the project with desired accuracy. Nevertheless, the industry faces challenges when it comes to collecting reliable actual effort data. Consequently, organizations encounter difficulties in establishing effort prediction models. Benchmarking datasets are available, but, in most cases, they have huge variances that make them less useful for effort prediction. In this study, we aimed to answer whether creating a software benchmarking dataset is possible by gathering the data from the literature. To the best of our knowledge, a comprehensive dataset that gathers the functional size and effort data of the studies from the literature is unavailable. For this purpose, we performed a systematic literature review to find studies that include projects measured with the COSMIC Functional Size Measurement (FSM) method and the related effort. As a result, we formed a dataset including 337 records from 18 studies that shared the corresponding size and effort data. Although we performed a limited search, we created a larger dataset than many datasets in the literature. In light of our review, we obtained that most studies did not share their dataset, and many lacked case details such as implementation environment and the scope of software development life cycle activities included in the effort data. We also compared the dataset with the ISBSG repository and found that our dataset has less variation in productivity. Our review showed the applicability of creating a software benchmarking dataset is possible by gathering the data from the literature. In conclusion, this study addresses gaps in the literature through a cost-free and easily extendable dataset.
Description
Keywords
Software Size Measurement, Effort Estimation, Dataset, Benchmarking, Cosmic, Systematic Literature Review
Fields of Science
Citation
WoS Q
N/A
Scopus Q
Q4

OpenCitations Citation Count
N/A
Source
50th Euromicro Conference on Software Engineering and Advanced Applications -- AUG 28-30, 2024 -- Paris, FRANCE
Volume
Issue
Start Page
194
End Page
200
PlumX Metrics
Citations
Scopus : 1
Captures
Mendeley Readers : 1
SCOPUS™ Citations
1
checked on Apr 27, 2026
Web of Science™ Citations
1
checked on Apr 27, 2026
Page Views
18
checked on Apr 27, 2026
Google Scholar™


