Abstract
Smart Grids are critical infrastructure networks. They play a critical role in the survival of our postmodern economies, as all other areas depend on their availability. An interruption in their operation may have a direct impact on the availability of other services (e.g., health, transportation). The problem is particularly intense when no backup networks are available, or the required recovery time is beyond backup autonomy. The transition to a decentralized management and control system for these networks requires digital technologies, advanced interconnected system communications, and Internet access. These technologies expose critical infrastructure networks to external threats that require careful assessment of cyber-security risks and appropriate countermeasures. An important factor that enhances the range of threats is the heterogeneity of Smart Grids, which incorporate industrial control systems such as the SCADA, distributed control system, and programmable logic controllers to which security improvements may not have been made since they were installed. Υet, another serious problem arises from the fact that older technologies were designed at times when cyber-security was not part of their technical design specifications. At the same time, it should be seriously considered that many of the systems of these networks that can be cyber-attacked may not be easily disconnected, as this could potentially lead to generalized operational problems. In this scientific research, a sophisticated active security framework is proposed, which is based solely on advanced computational intelligence methods and concerns the digital security of critical infrastructure networks. Specifically, this research introduces a sophisticated adaptive ensemble classification framework for real-time data streams by distributed control systems. It is a “Kappa” architecture framework that is based on a two-step online ensemble learning model based on bagging and boosting methods. The aim is performance of real-time analysis and evaluation of data flows from Smart Grids, toward the effective identification of APT attacks.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Babu B, Ijyas T, Muneer P, Varghese J (2017) Security issues in SCADA based industrial control systems. In: 2017 2nd international conference on anti-cyber crimes (ICACC), Abha, 2017, pp 47–51. https://doi.org/10.1109/anti-cybercrime.2017.7905261
Haq EU, Xu H, Pan L, Khattak MI Smart grid security: threats and solutions. In: 2017 13th international conference on semantics, knowledge and grids (SKG), Beijing, China, 2017, pp 188–193. https://doi.org/10.1109/skg.2017.00039
Tan S, De D, Song WZ, Yang J, Das SK (2017) Survey of security advances in smart grid: a data driven approach. IEEE Commun Surv Tutor 19(1):397–422. https://doi.org/10.1109/comst.2016.2616442
Raj VS, Chezhian RM, Mrithulashri M (2014) Advanced persistent threats & recent high profile cyber threat encounters. Int J Innovative Res Comput Commun Eng (An ISO 3297: 2007 Certified Organization) 2(1)
Virvilis N, Gritzalis D, Apostolopoulos T (2013) Trusted computing vs. advanced persistent threats: can a defender win this game?, In: Proceedings of 10th IEEE international conference on autonomic and trusted computing (ATC-2013), IEEE Press, Italy, pp 396–403
www.damballa.com. Accessed 5 Nov 2019
www.crowdstrike.com. Accessed 5 Nov 2019
Aretz K, Bartram SM, Pope PF (2011) Asymmetric loss functions and the rationality of expected stock returns. Int J Forecast 27(2):413–437. https://doi.org/10.1016/j.ijforecast.2009.10.008.SSRN889323
Kushner H, Yin GG (1997) Stochastic approximation algorithms and applications. Springer, New York 2nd ed., titled Stochastic approximation and recursive algorithms and applications, 2003. ISBN: 0-387-00894-2. ISBN 0-387-94916-X
Kurlej B, Wozniak M (2012) Active learning approach to concept drift problem. Logic J IGPL 20(3):550–559. https://doi.org/10.1093/jigpal/jzr011
Kiran M, Murphy P, Monga I, Dugan J, Baveja SS, Lambda architecture for cost-effective batch and speed big data processing. In: 2015 IEEE International conference on big data (big data), Santa Clara, CA, 2015, pp 2785–2792. https://doi.org/10.1109/bigdata.2015.7364082
Yamato Y, Kumazaki H, Fukumoto Y (2016) Proposal of Lambda architecture adoption for real time predictive maintenance. In: 2016 fourth international symposium on computing and networking (CANDAR), pp 713–715. https://doi.org/10.1109/CANDAR.2016.0130
Lin J (2017) The lambda and the kappa. IEEE Int Comput 21(5):60–66. https://doi.org/10.1109/MIC.2017.3481351
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Gomes HM, Bifet A, Read J, Barddal JP, Enembreck F, Pfharinger B, Holmes G, Abdessalem T (2017) Adaptive random forests for evolving data stream classification. Mach Learn. https://doi.org/10.1007/s10994-017-5642-8
Zhou ZH (2012) Ensemble methods foundations and algorithms. CRC Press, Boca Raton
Kuncheva L (2004) Combining pattern classifiers: methods and algorithms. Wiley, Hoboken
Bonab HR, Can F (2016) A theoretical framework on the ideal number of classifiers for online ensembles in data streams. In: CIKM. ACM, USA, p 2053
Dietterich TG (2001) Ensemble methods in machine learning. In: Kittler J, Roli F (eds) Multiple classifier systems. LNCS vol 1857. Springer, Cham, pp 1–15
Webb GI, Zheng Z (2004) Multistrategy ensemble learning: Reducing error by combining ensemble learning techniques. IEEE Trans Knowl Data Eng 16(8):980–991. https://doi.org/10.1007/s00521-016-2591-2
Tsoumakas G, Angelis L, Vlahavas IP (2005) Selective fusion of heterogeneous classifiers. Intell Data Anal 9(6):511–525
Strutz T (2010) Data fitting and uncertainty (a practical introduction to weighted least squares and beyond). Vieweg + Teubner, Berlin. ISBN 978-3-8348-1022-9
Brzezinski D, Stefanowski J (2014) Combining block-based and online methods in learning ensembles from concept drifting data streams. Inf Sci 265:50–67
Barddal JP, Gomes HM, Enembreck F (2015) SNCStream: a social network-based data stream clustering algorithm. In: Proceedings of the 30th annual ACM symposium on applied computing, SAC’15. ACM, New York, NY, pp 935–940
Parker BS, Khan L (2015). Detecting and tracking concept class drift and emergence in non-stationary fast data streams. In: Twenty-ninth AAAI conference on artificial intelligence
Bifet A, Holmes G, Pfahringer B, Kirkby R, Gavaldà R (2009). New ensemble methods for evolving data streams. In: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining. ACM SIGKDD, pp 139–148
Bifet A, Holmes G, Pfahringer B (2010) Leveraging bagging for evolving data streams. In: PKDD, pp 135–150
Baena-Garcia M, del Campo-Avila J, Fidalgo R, Bifet A, Gavalda R, Morales-Bueno R (2006) Early drift detection method. In: ECML PKDD 2006 workshop on knowledge discovery from data streams
Gomes HM, Enembreck F (2014) Sae2: advances on the social adaptive ensemble classifier for data streams. In: Proceedings of the 29th annual ACM symposium on applied computing (SAC), SAC 2014. ACM, pp 199–206
Pan S, Morris T, Adhikari U (2014) Developing a hybrid intrusion detection system using data mining for power systems. IEEE Trans Smart Grid. https://doi.org/10.1109/tsg.2015.2409775
Pan S, Morris T, Adhikari U (2015) Classification of disturbances and cyber-attacks in power systems using heterogeneous time-synchronized data. IEEE Trans Ind Inf. https://doi.org/10.1109/tii.2015.2420951
Pan S, Morris T, Adhikari U (2015) A specification-based intrusion detection framework for cyber-physical environment in electric power system. Int J Netw Secur (IJNS) 17(2):174–188
Beaver J, Borges R, Buckner M, Morris T, Adhikari U, Pan S (2014) Machine learning for power system disturbance and cyber-attack discrimination. In: Proceedings of the 7th international symposium on resilient control systems, August 19–21, 2014, Denver, CO, USA
Dodge Y (2003) The oxford dictionary of statistical terms. OUP, Oxford (entry for normalization of scores). ISBN: 0-19-920613-9
Zwillinger D, Kokoska S (2000) CRC standard probability and statistics tables and formulae. CRC Press, Boca Raton, p 18. ISBN: 1-58488-059-7
Žliobaitė I, Bifet A, Read J et al (2015) Evaluation methods and decision theory for classification of streaming data with temporal dependence. Mach Learn 98(3):455–482
Vinagre J, Jorge AM, Gama J (2014) Evaluation of recommender systems in streaming environments. In: Workshop on recommender systems evaluation: dimensions and design (REDD 2014), held in conjunction with RecSys. October 10, 2014, Silicon Valley, United States https://doi.org/10.13140/2.1.4381.5367
Demertzis K, Iliadis L, Spartalis S (2017) A spiking one-class anomaly detection framework for cyber-security on industrial control systems. In: Boracchi G, Iliadis L, Jayne C, Likas A (eds) Engineering applications of neural networks. EANN 2017. Communications in Computer and Information Science, vol 744. Springer, Cham
Cruz T, Proença J, Simões P, Aubigny M, Ouedrago M, Graziano A, Yasakhetu L (2014) Improving cyber-security awareness on industrial control systems: the CockpitCI approach. J Inf Warf 13(4):27–41
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sufang, W. An adaptive ensemble classification framework for real-time data streams by distributed control systems. Neural Comput & Applic 32, 4139–4149 (2020). https://doi.org/10.1007/s00521-020-04759-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-020-04759-0