Abstract
In this paper we show that Rank Distance Aggregation can improve ensemble classifier precision in the classical text categorization task by presenting a series of experiments done on a 20 class newsgroup corpus, with a single correct class per document. We aggregate four established document classification methods (TF-IDF, Probabilistic Indexing, Naive Bayes and KNN) in different training scenarios, and compare these results to widely used fixed combining rules such as Voting, Min, Max, Sum, Product and Median.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Dinu, L.P.: On the classification and aggregation of hierarchies with difererent constitutive elements. Fundamenta Informaticae 55(1), 39–50 (2002)
Dinu, L.P., Manea, F.: An eficient approach for the rank aggregation problem. Theoretical Computer Science 359(1), 455–461 (2006)
Dinu, L.P., Popescu, M.: A multi-criteria decision method based on rank distance. Fundamenta Informaticae 86(1-2), 79–91 (2008)
Duin, R.P.W., Tax, D.M.J.: Experiments with classifier combining rules. In: Kittler, J., Roli, F. (eds.) MCS 2000. LNCS, vol. 1857, pp. 16–29. Springer, Heidelberg (2000)
Duin, R.P.W.: The combining classifier: To train or not to train? Pattern Recognition 2 (2002)
Fukuda, K., Matsui, T.: Finding all minimum cost perfect matchings in bipartite graphs. Networks 22, 461–468 (1992)
Manea, F., Ploscaru, C.: A generalization of the assignment problem, and its application to the rank aggregation problem. Fundamenta Informaticae 81(4), 459–471 (2007)
McCallum, A.K.: Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering (1996), http://www.cs.cmu.edu/~mccallum/bow
Polikar, R.: Ensemble based systems in decision making. IEEE Circuits and Systems Magazine 6(3), 21–45 (2006)
Roli, F., Fumera, G.: Fixed and trained combiners for fusion of imbalanced pattern classifiers. In: Proc. 5th Int. Conf. on Information Fusion, pp. 278–284 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dinu, L.P., Rusu, A. (2010). Rank Distance Aggregation as a Fixed Classifier Combining Rule for Text Categorization. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2010. Lecture Notes in Computer Science, vol 6008. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12116-6_54
Download citation
DOI: https://doi.org/10.1007/978-3-642-12116-6_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12115-9
Online ISBN: 978-3-642-12116-6
eBook Packages: Computer ScienceComputer Science (R0)