Usage of multiple prediction models based on defect categories

Bora Caglayan*, Ayse Tosun, Andriy Miranskyy, Ayse Bener, Nuzio Ruffolo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

14 Citations (Scopus)

Abstract

Background: Most of the defect prediction models are built for two purposes: 1) to detect defective and defect-free modules (binary classification), and 2) to estimate the number of defects (regression analysis). It would also be useful to give more information on the nature of defects so that software managers can plan their testing resources more effectively. Aims: In this paper, we propose a defect prediction model that is based on defect categories. Method: We mined the version history of a large-scale enterprise software product to extract churn and static code metrics. and grouped them into three defect categories according to different testing phases. We built a learning-based model for each defect category. We compared the performance of our proposed model with a general one. We conducted statistical techniques to evaluate the relationship between defect categories and software metrics. We also tested our hypothesis by replicating the empirical work on Eclipse data. Results: Our results show that building models that are sensitive to defect categories is cost-effective in the sense that it reveals more information and increases detection rates (pd) by 10% keeping the false alarms (pf) constant. Conclusions: We conclude that slicing defect data and categorizing it for use in a defect prediction model would enable practitioners to take immediate actions. Our results on Eclipse replication showed that haphazard categorization of defects is not worth the effort.

Original languageEnglish
Title of host publication6th International Conference on Predictive Models in Software Engineering, PROMISE 2010
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event6th International Conference on Predictive Models in Software Engineering, PROMISE 2010 - Timisoara, Romania
Duration: 12 Sept 201013 Sept 2010

Publication series

NameACM International Conference Proceeding Series

Conference

Conference6th International Conference on Predictive Models in Software Engineering, PROMISE 2010
Country/TerritoryRomania
CityTimisoara
Period12/09/1013/09/10

Keywords

  • Defect categories
  • Defect prediction
  • Software quality

Fingerprint

Dive into the research topics of 'Usage of multiple prediction models based on defect categories'. Together they form a unique fingerprint.

Cite this