Abstract
Data mining schemes, or workflows, are collections of interconnected machine learning models, including preprocessing procedures, and ensembles methods combinations. The proposal of data mining schemes for a task at hand has always been a task for experienced data scientists. We will study generating and testing workflows by automated procedures. Two representations of data mining schemes are used in this paper – a linear one, and a one based on direct acyclic graphs. Efficient procedures for generating schemes are presented and evaluated by testing the generated schemes on real data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Wolpert, D.H., Macready, W.G.: No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1(1), 67–82 (1997)
Brazdil, P., Giraud-Carrier, C.G., Soares, C., Vilalta, R.: Metalearning – Applications to Data Mining. Cognitive Technologies. Springer, Heidelberg (2009)
Clarke, B., Fokoue, E., Zhang, H.H.: Principles and Theory for Data Mining and Machine Learning. Springer Series in Statistics. Springer, Heidelberg (2009)
Neruda, R., Beuster, G.: Toward dynamic generation of computational agents by means of logical descriptions. Int. Trans. Syst. Sci. Appl., 139–144 (2008)
Bache, K., Lichman, M.: UCI machine learning repository (2013)
Kazík, O., Neruda, R.: Data mining process optimization in computational multi-agent systems. In: Cao, L., Zeng, Y., An, B., Symeonidis, A.L., Gorodetsky, V., Coenen, F., Yu, P.S. (eds.) ADMI 2014. LNCS (LNAI), vol. 9145, pp. 93–103. Springer International Publishing, Cham (2015). doi:10.1007/978-3-319-20230-3_8
Pešková, K., Šmíd, J., Pilát, M., Kazík, O., Neruda, R.: Hybrid multi-agent system for metalearning in data mining. In: Vanschoren, J., Brazdil, P., Soares, C., Kotthoff, L. (eds.) Proceedings of the MetaSel@ECAI 2014. CEUR Workshop Proceedings, vol. 1201, pp. 53–54. CEUR-WS.org (2014)
Acknowledgment
This work was supported by the Czech Science Foundation project no. P103-15-19877S. and the institutional support of the Institute of Computer Science, Czech Academy of Sciences RVO 67985807.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Neruda, R. (2016). Search Techniques for Automated Proposal of Data Mining Schemes. In: Figueroa-García, J., López-Santana, E., Ferro-Escobar, R. (eds) Applied Computer Sciences in Engineering. WEA 2016. Communications in Computer and Information Science, vol 657. Springer, Cham. https://doi.org/10.1007/978-3-319-50880-1_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-50880-1_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50879-5
Online ISBN: 978-3-319-50880-1
eBook Packages: Computer ScienceComputer Science (R0)