A Hybrid Feature Selection Model For Software Fault Prediction

Surendiran  B

A Hybrid Feature Selection Model For Software Fault Prediction

Surendiran B

2012, International Journal on Computational Science & Applications

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Software fault prediction plays a vital role in software quality assurance. Identifying the faulty modules helps to better concentrate on those modules and helps improve the quality of the software. With increasing complexity of software nowadays feature selection is important to remove the redundant, irrelevant and erroneous data from the dataset. In general, Feature selection is done mainly based on filter and wrapper. In this paper a hybrid feature selection method is proposed which gives a better prediction than the traditional methods. NASA's public dataset KC1 available at promise software engineering repository is used. To evaluate the performance of the software fault prediction models Accuracy, Mean absolute error (MAE), Root mean squared error (RMSE) values are used.

Minh Phương Hà

Journal of Research and Development on Information and Communication Technology, 2021

The rapid growth of data has become a huge challenge for software systems. The quality of fault predictionmodel depends on the quality of software dataset. High-dimensional data is the major problem that affects the performance of the fault prediction models. In order to deal with dimensionality problem, feature selection is proposed by various researchers. Feature selection method provides an effective solution by eliminating irrelevant and redundant features, reducing computation time and improving the accuracy of the machine learning model. In this study, we focus on research and synthesis of the Filter-based feature selection with several search methods and algorithms. In addition, five filter-based feature selection methods are analyzed using five different classifiers over datasets obtained from National Aeronautics and Space Administration (NASA) repository. The experimental results show that Chi-Square and Information Gain methods had the best influence on the results of pre...

Log In

A Hybrid Feature Selection Model For Software Fault Prediction

Sign up for access to the world's latest research

Abstract

Related papers

Related topics

Related papers