Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/ICDM.2005.126guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Ranking-Based Evaluation of Regression Models

Published: 27 November 2005 Publication History

Abstract

We suggest the use of ranking-based evaluation measures for regression models, as a complement to the commonly used residual-based evaluation. We argue that in some cases, such as the case study we present, ranking can be the main underlying goal in building a regression model, and ranking performance is the correct evaluation metric. However, even when ranking is not the contextually correct performance metric, the measures we explore still have significant advantages: They are robust against extreme outliers in the evaluation set; and they are interpretable. The two measures we consider correspond closely to non-parametric correlation coefficients commonly used in data analysis (Spearman's ρ and Kendall's τ); and they both have interesting graphical representations, which, similarly to ROC curves, offer useful "partial" model performance views, in addition to a one-number summary in the area under the curve. We illustrate our methods on a case study of evaluating IT Wallet size estimation models for IBM's customers.

References

[1]
J. Bi and K. P. Bennett. Regression error characteristic curves. In Proceedings of ICML-03, 2003.
[2]
A. P. Bradley. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30(7):1145-1159, 1997.
[3]
J. Egan. Signal Detection Theory and ROC Analysis. Academic Press, 1975.
[4]
R. Garland. Share of wallet's role in customer profitability. Journal of Financial Services Marketing, 8(3):259-268, March 2004.
[5]
F. R. Hampel, E. M. Ronchetti, P. J. Rousseeuw, and W. A. Stahel. Robust Statistics: The Approach Based on Influence Functions. Wiley & Sons, 1986.
[6]
M. Kendall and J. M. Gibbons. Rank Correlation Methods. Edward Arnold, 1990.
[7]
N. E. Noether. Elements of Nonparametric Statistics. Wiley and Sons, 1967.
[8]
V. Vapnik. The Nature of Statistical Learning Theory. Springer, 1995.

Cited By

View all
  • (2017)Using bad learners to find good configurationsProceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering10.1145/3106237.3106238(257-267)Online publication date: 21-Aug-2017

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
ICDM '05: Proceedings of the Fifth IEEE International Conference on Data Mining
November 2005
837 pages
ISBN:0769522785

Publisher

IEEE Computer Society

United States

Publication History

Published: 27 November 2005

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Using bad learners to find good configurationsProceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering10.1145/3106237.3106238(257-267)Online publication date: 21-Aug-2017

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media