Software cost estimation technique based on bagging ensemble learning algorithm

Document Type : Research Paper

Authors

Software Department, College of Computer & Math, University of Mosul, Iraq.

10.30772/qjes.2024.147128.1138
Abstract
Recently, software cost estimation has become a more important issue in the software project development cycle, software quality, and decision-making management. In view of the common problem of inaccurate and difficult cost estimation in the software industry, in this article, the proposed bagging method is one of the ensemble learning methods to estimate the cost of software development. Five algorithms were used as the basic models: Random Forest, Decision Tree, AdaBoost, K-Nearest Neighbor, and Gradient Boosting, compiled using the bagging method. The proposed method was applied to a data set (ISBSG). The contribution of the paper suggests a more accurate method compared to previous studies, and applying it to high-quality data, which was prepared to obtain more accurate results when applying the proposed model, which showed its superiority over individual models in estimating the cost of software development. The results showed high accuracy in R2 prediction in ratio (97%) and gave a lower error rate (MMRE: Mean Magnitude of Relative Error) compared to previous studies in ratio (0.1). This indicates its accuracy in prediction is closer to the real cost, where the RF model was the basic estimator model in this method, because it surpasses the main models that were used in the proposed method. The KNN model gave the lowest accuracy ratio (73%) among the main models when trained on the ISBSG dataset.

Keywords

Subjects

Volume 19, Issue 2
Summer 2026
Pages 188-194

  • Receive Date 22 April 2024
  • Revise Date 14 March 2025
  • Accept Date 22 April 2026