Cluster-based fuzzy regression trees for software cost prediction

assia najm

Cluster-based fuzzy regression trees for software cost prediction

assia najm

Indonesian Journal of Electrical Engineering and Computer Science

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

The current paper proposes a novel type of decision tree, which is never used for software development cost prediction (SDCP) purposes, the cluster-based fuzzy regression tree (CFRT). This model uses the fuzzy k-means (FKM), which deals with data uncertainty and imprecision. The tree expansion is based on the variability measure by choosing the node with the highest value of granulation diversity. This paper outlined an experimental study comparing CFRT with four SDCP methods, notably linear regression, multi-layer perceptron, K-nearest-neighbors, and classification and regression trees (CART), employing eight datasets and the leave-one-out cross-validation (LOOCV). The results show that CFRT is among the best, ranked first in 3 datasets according to four accuracy measures. Also, according to the Pred(25%) values, the proposed CFRT model outperformed all the twelve compared techniques in four datasets: Albrecht, constructive cost model (COCOMO), Desharnais, and The International Sof...

Andreas Andreou

2009

This work addresses the issue of software effort prediction via fuzzy decision trees generated using historical project data samples. Moreover, the effect that various numerical and nominal project characteristics used as predictors have on software development effort is investigated utilizing the classification rules extracted. The approach attempts to classify successfully past project data into homogeneous clusters to provide accurate and reliable cost estimates within each cluster. CHAID and CART algorithms are applied on approximately 1000 project cost data records which were analyzed, preprocessed and used for generating fuzzy decision tree instances, followed by an evaluation method assessing prediction accuracy achieved by the classification rules produced. Even though the experimentation follows a heuristic approach, the trees built were found to fit the data properly, while the predicted effort values approximate well the actual effort.

Log In

Cluster-based fuzzy regression trees for software cost prediction

Sign up for access to the world's latest research

Abstract

Related papers

Related topics