Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/HASE.2005.4guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Analyzing Software Quality with Limited Fault-Proneness Defect Data

Published: 12 October 2005 Publication History

Abstract

Assuring whether the desir ed software quality and reliability is met for a project is as important as deliveringit within scheduled budget and time. This is especially vital for high-assurance software systems where software failures can have severe consequences. To achieve the desired software quality, practitioners utilize software quality models to identify high-risk program modules: e.g., software quality classification models are built using training data consisting of software measurements and fault-proneness data from previous development experiences similar to the project currently under-development. However, various practical issues can limit availability of fault-proneness data for all modules in the training data, leading to the data consisting of many modules with no fault-proneness data, i.e., unlabeled data. To address this problem, we propose a novel semi-supervised clustering scheme for software quality analysis with limited fault-proneness data. It is a constraint-based semi-supervised clustering scheme based on the k-means algorithm. The proposed approach is investigated with software measurement data of two NASA software projects, JM1 and KC2. Empirical results validate the promise of our semi-supervised clustering technique for software quality modeling and analysis in the presence of limited defect data. Additionally, the approach provides some valuable insight into the characteristics of certain program modules that remain unlabeled subsequent to our semi-supervised clustering analysis.

Cited By

View all
  • (2014)An in-depth study of the potentially confounding effect of class size in fault predictionACM Transactions on Software Engineering and Methodology10.1145/255677723:1(1-51)Online publication date: 20-Feb-2014
  • (2012)Comparing the performance of fault prediction models which report multiple performance measuresProceedings of the 8th International Conference on Predictive Models in Software Engineering10.1145/2365324.2365338(109-118)Online publication date: 21-Sep-2012
  • (2011)Application of K-Medoids with Kd-Tree for Software Fault PredictionACM SIGSOFT Software Engineering Notes10.1145/1943371.194338136:2(1-6)Online publication date: 5-May-2011
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
HASE '05: Proceedings of the Ninth IEEE International Symposium on High-Assurance Systems Engineering
October 2005
156 pages
ISBN:0769523773

Publisher

IEEE Computer Society

United States

Publication History

Published: 12 October 2005

Author Tags

  1. k-means
  2. semi-supervised clustering
  3. software faults
  4. software measurements
  5. software quality

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2014)An in-depth study of the potentially confounding effect of class size in fault predictionACM Transactions on Software Engineering and Methodology10.1145/255677723:1(1-51)Online publication date: 20-Feb-2014
  • (2012)Comparing the performance of fault prediction models which report multiple performance measuresProceedings of the 8th International Conference on Predictive Models in Software Engineering10.1145/2365324.2365338(109-118)Online publication date: 21-Sep-2012
  • (2011)Application of K-Medoids with Kd-Tree for Software Fault PredictionACM SIGSOFT Software Engineering Notes10.1145/1943371.194338136:2(1-6)Online publication date: 5-May-2011
  • (2008)ReferencesDependability metrics10.5555/1806170.1806202(267-300)Online publication date: 1-Jan-2008
  • (2008)An extension of fault-prone filtering using precise training and a dynamic thresholdProceedings of the 2008 international working conference on Mining software repositories10.1145/1370750.1370772(89-98)Online publication date: 10-May-2008
  • (2008)Prediction of Fault-Prone Software Modules Using a Generic Text DiscriminatorIEICE - Transactions on Information and Systems10.1093/ietisy/e91-d.4.888E91-D:4(888-896)Online publication date: 1-Apr-2008
  • (2007)Training on errors experiment to detect fault-prone software modules by spam filterProceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering10.1145/1287624.1287683(405-414)Online publication date: 7-Sep-2007

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media