Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/ESEM.2011.34guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

How to Find Relevant Data for Effort Estimation?

Published: 22 September 2011 Publication History

Abstract

Background: Building effort estimators requires the training data. How can we find that data? It is tempting to cross the boundaries of development type, location, language, application and hardware to use existing datasets of other organizations. However, prior results caution that using such cross data may not be useful. Aim: We test two conjectures: (1) instance selection can automatically prune irrelevant instances and (2) retrieval from the remaining examples is useful for effort estimation, regardless of their source. Method: We selected 8 cross-within divisions (21 pairs of within-cross subsets) out of 19 datasets and evaluated these divisions under different analogy-based estimation (ABE) methods. Results: Between the within & cross experiments, there were few statistically significant differences in (i) the performance of effort estimators, or (ii) the amount of instances retrieved for estimation. Conclusion: For the purposes of effort estimation, there is little practical difference between cross and within data. After applying instance selection, the remaining examples (be they from within or from cross source divisions) can be used for effort estimation.

Cited By

View all
  • (2019)Applying Cross Project Defect Prediction Approaches to Cross-Company Effort EstimationProceedings of the Fifteenth International Conference on Predictive Models and Data Analytics in Software Engineering10.1145/3345629.3345638(76-79)Online publication date: 18-Sep-2019
  • (2019)ExperienceJournal of Data and Information Quality10.1145/332874611:4(1-38)Online publication date: 19-Aug-2019
  • (2018)Data-driven search-based software engineeringProceedings of the 15th International Conference on Mining Software Repositories10.1145/3196398.3196442(341-352)Online publication date: 28-May-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
ESEM '11: Proceedings of the 2011 International Symposium on Empirical Software Engineering and Measurement
September 2011
473 pages
ISBN:9780769546049

Publisher

IEEE Computer Society

United States

Publication History

Published: 22 September 2011

Author Tags

  1. cross resource
  2. k-NN
  3. software cost estimation
  4. within resource

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Applying Cross Project Defect Prediction Approaches to Cross-Company Effort EstimationProceedings of the Fifteenth International Conference on Predictive Models and Data Analytics in Software Engineering10.1145/3345629.3345638(76-79)Online publication date: 18-Sep-2019
  • (2019)ExperienceJournal of Data and Information Quality10.1145/332874611:4(1-38)Online publication date: 19-Aug-2019
  • (2018)Data-driven search-based software engineeringProceedings of the 15th International Conference on Mining Software Repositories10.1145/3196398.3196442(341-352)Online publication date: 28-May-2018
  • (2017)Research patterns and trends in software effort estimationInformation and Software Technology10.1016/j.infsof.2017.06.00291:C(1-21)Online publication date: 1-Nov-2017
  • (2016)Too much automation? the bellwether effect and its implications for transfer learningProceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering10.1145/2970276.2970339(122-131)Online publication date: 25-Aug-2016
  • (2016)Pareto efficient multi-objective optimization for local tuning of analogy-based estimationNeural Computing and Applications10.1007/s00521-015-2004-y27:8(2241-2265)Online publication date: 1-Nov-2016
  • (2015)An empirical evaluation of ensemble adjustment methods for analogy-based effort estimationJournal of Systems and Software10.1016/j.jss.2015.01.028103:C(36-52)Online publication date: 1-May-2015
  • (2014)The potential benefit of relevance vector machine to software effort estimationProceedings of the 10th International Conference on Predictive Models in Software Engineering10.1145/2639490.2639510(52-61)Online publication date: 17-Sep-2014
  • (2013)Better cross company defect predictionProceedings of the 10th Working Conference on Mining Software Repositories10.5555/2487085.2487161(409-418)Online publication date: 18-May-2013
  • (2013)Data science for software engineeringProceedings of the 2013 International Conference on Software Engineering10.5555/2486788.2487048(1484-1486)Online publication date: 18-May-2013
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media