Complexity measures of supervised classification problems
TK Ho, M Basu - IEEE transactions on pattern analysis and …, 2002 - ieeexplore.ieee.org
TK Ho, M Basu
IEEE transactions on pattern analysis and machine intelligence, 2002•ieeexplore.ieee.orgWe studied a number of measures that characterize the difficulty of a classification problem,
focusing on the geometrical complexity of the class boundary. We compared a set of real-
world problems to random labelings of points and found that real problems contain
structures in this measurement space that are significantly different from the random sets.
Distributions of problems in this space show that there exist at least two independent factors
affecting a problem's difficulty. We suggest using this space to describe a classifier's domain …
focusing on the geometrical complexity of the class boundary. We compared a set of real-
world problems to random labelings of points and found that real problems contain
structures in this measurement space that are significantly different from the random sets.
Distributions of problems in this space show that there exist at least two independent factors
affecting a problem's difficulty. We suggest using this space to describe a classifier's domain …
We studied a number of measures that characterize the difficulty of a classification problem, focusing on the geometrical complexity of the class boundary. We compared a set of real-world problems to random labelings of points and found that real problems contain structures in this measurement space that are significantly different from the random sets. Distributions of problems in this space show that there exist at least two independent factors affecting a problem's difficulty. We suggest using this space to describe a classifier's domain of competence. This can guide static and dynamic selection of classifiers for specific problems as well as subproblems formed by confinement, projection, and transformations of the feature vectors.
ieeexplore.ieee.org