On Selecting Additional Predictive Models in Double Bagging Type Ensemble Method

Zaman Faisal²¹,
Mohammad Mesbah Uddin²² &
Hideo Hirose²²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6019))

Included in the following conference series:

International Conference on Computational Science and Its Applications

894 Accesses
1 Citations

Abstract

Double Bagging is a parallel ensemble method, where an additional classifier model is trained on the out-of-bag samples and then the posteriori class probabilities of this additional classifier are added with the inbag samples to train a decision tree classifier. The subsampled version of double bagging depend on two hyper parameters, subsample ratio (SSR) and an additional classifier. In this paper we have proposed an embedded cross-validation based selection technique to select one of these parameters automatically. This selection technique builds different ensemble classifier models with each of these parameter values (keeping another fixed) during the training phase of the ensemble method and finally select the one with the highest accuracy. We have used four additional classifier models, Radial Basis Support Vector Machine (RSVM), Linear Support Vector Machine (LSVM), Nearest Neighbor Classifier (5-NN and 10-NN) with five subsample ratios (SSR), 0.1, 0.2, 0.3, 0.4 and 0.5. We have reported the performance of the subsampled double bagging ensemble with these SSRs with each of these additional classifiers. In our experiments we have used UCI benchmark datasets. The results indicate that LSVM has superior performance as an additional classifiers in enhancing the predictive power of double bagging, where as with SSR 0.4 and 0.5 double bagging has better performance, than with other SSRs. We have also compared the performance of these resulting ensemble methods with Bagging, Adaboost, Double Bagging (original) and Rotation Forest. Experimental results show that the performance of the resulting subsampled double bagging ensemble is better than these ensemble methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Combining Sampling and Ensemble Classifier for Multiclass Imbalance Data Learning

A Simple Ensemble Learning Algorithm For a Real Time High Dimensional Data

HOUSEN: Hybrid Over–Undersampling and Ensemble Approach for Imbalance Classification

References

Blake, C.L., Merz, C.J.: UCI Repository of Machine Learning Databases, http://www.ics.uci.edu/mlearn/MLRepository.html
Breiman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996a)
MATH MathSciNet Google Scholar
Breiman, L.: Out-of-bag estimation. Statistics Department, University of Berkeley CA 94708, Technical Report (1996b)
Google Scholar
Breiman, L.: Random Forests. Machine Learning 45(1), 5–32 (2001)
Article MATH Google Scholar
Caruana, R., Niculescu-Mizil, A., Crew, G., Ksikes, A.: Ensemble selection from libraries of models. In: Proceedings of the 21st Int’l Conf. on Machine Learning (2004)
Google Scholar
Caruana, R., Niculescu-Mizil, A.: Getting the most out of ensemble selection. In: Proceedings of the Int’l Conf. on Data Mining, ICDM (2006)
Google Scholar
Demšar, J.: Statistical comparisons of classifiers over multiple datasets. J. Mach. Learn. Research 7, 1–30 (2006)
Google Scholar
Dietterich, T.G.: Machine-learning research: Four current directions. AI Magazine 18(4), 97–136 (1997)
Google Scholar
Freund, Y., Schapire, R.: Experiments with a New boosting algorithm. In: Machine Learning: Proceedings to the Thirteenth International Conference, pp. 148–156. Morgan Kaufmann, San Francisco (1996)
Google Scholar
Freund, Y., Schapire, R.: A decision-theoretic generalization of online learning and an application to boosting. J. Comput. System Sci. 55, 119–139 (1997)
Article MATH MathSciNet Google Scholar
Hastie, T., Tibshirani, R., Freidman, J.: The elements of statistical learning: data mining, inference and prediction. Springer, New York (2001)
MATH Google Scholar
Hothorn, T., Lausen, B.: Double-bagging: combining classifiers by bootstrap aggregation. Pattern Recognition 36(6), 1303–1309 (2003)
Article MATH Google Scholar
Hothorn, T., Lausen, B.: Bundling classifiers by bagging trees. Comput. Statist. Data Anal. 49, 1068–1078 (2005)
Article MATH MathSciNet Google Scholar
Kuncheva, L.I.: Combining Pattern Classifiers. Methods and Algorithms (2004)
Google Scholar
Rodríguez, J., Kuncheva, L.: Rotation forest: A new classifier ensemble method. IEEE Trans. Patt. Analys. Mach. Intell. 28(10), 1619–1630 (2006)
Article Google Scholar
Zaman, F., Hirose., H.: Double SVMbagging: A subsampling approach to SVM ensemble. To appear in Intelligent Automation and Computer Engineering. Springer, Heidelberg (2009)
Google Scholar
Zaman, F., Hirose, H.: Effect of Subsampling Rate on Subbagging and Related Ensembles of Stable Classifiers. In: Chaudhury, S., et al. (eds.) PReMI 2009. LNCS, vol. 5909, pp. 44–49. Springer, Heidelberg (2009)
Chapter Google Scholar
Zaman, F., Hirose., H.: A Comparative Study on the Performance of Several Ensemble Methods with Low Subsampling Ratio. In: Accepted in 2nd Asian Conference on Intelligent Information and Database Systems, ACIIDS 2010 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Kyushu Institute of Technology, 680-4, Kawazu, Iizuka, Japan
Zaman Faisal
Kyushu University, Fukuoka, Japan
Mohammad Mesbah Uddin & Hideo Hirose

Authors

Zaman Faisal
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Mesbah Uddin
View author publications
You can also search for this author in PubMed Google Scholar
Hideo Hirose
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Clayton School of Information Technology, Monash University, VIC 3800, Clayton, Australia
David Taniar
Department of Mathematics and Computer Science, University of Perugia, Via Vanvitelli, 1, 06123, Perugia, Italy
Osvaldo Gervasi
L.I.S.U.T. - D.A.P.I.T., University of Basilicata, Viale dell’Ateneo Lucano 10, 85100, Potenza, Italy
Beniamino Murgante
Department of Computer Science and Computer Engineeering, LaTrobe University, VIC 3086, Bundoora, Australia
Eric Pardede
Department of Intelligent Informatics, Kyushu Sangyo University, 813-8503, Fukuoka, Japan
Bernady O. Apduhan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Faisal, Z., Uddin, M.M., Hirose, H. (2010). On Selecting Additional Predictive Models in Double Bagging Type Ensemble Method. In: Taniar, D., Gervasi, O., Murgante, B., Pardede, E., Apduhan, B.O. (eds) Computational Science and Its Applications – ICCSA 2010. ICCSA 2010. Lecture Notes in Computer Science, vol 6019. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12189-0_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-12189-0_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12188-3
Online ISBN: 978-3-642-12189-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

On Selecting Additional Predictive Models in Double Bagging Type Ensemble Method

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Combining Sampling and Ensemble Classifier for Multiclass Imbalance Data Learning

A Simple Ensemble Learning Algorithm For a Real Time High Dimensional Data

HOUSEN: Hybrid Over–Undersampling and Ensemble Approach for Imbalance Classification

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

On Selecting Additional Predictive Models in Double Bagging Type Ensemble Method

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Combining Sampling and Ensemble Classifier for Multiclass Imbalance Data Learning

A Simple Ensemble Learning Algorithm For a Real Time High Dimensional Data

HOUSEN: Hybrid Over–Undersampling and Ensemble Approach for Imbalance Classification

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation