One-Dimensional Convolutional Auto-Encoder for Predicting Furnace Blowback Events from Multivariate Time Series Process Data—A Case Study
<p>Illustration of PCA-data reconstruction [<a href="#B20-minerals-11-01106" class="html-bibr">20</a>]. A new data point (green star) is projected onto the first principal component (blue arrow), yielding the projected data point (dark crimson star). The difference between the new data point and the projected data point is the reconstruction error.</p> "> Figure 2
<p>Typical AE architecture, with input (yellow), encoding (blue), bottleneck (red), decoding (purple), and output (green) layers [<a href="#B20-minerals-11-01106" class="html-bibr">20</a>].</p> "> Figure 3
<p>Simple 2-dimensional CAE with one convolutional filter per layer [<a href="#B20-minerals-11-01106" class="html-bibr">20</a>]. Shaded areas represent the subsets of each layer output used as receptive field for subsequent convolutional filters. In the example, the first convolutional filter maps nine input features to a single feature in the first convolved layer (red blocks), the second convolutional filter maps four features to a single feature in the second convolved layer (green blocks), and the deconvolutional filter maps a single feature into eight separate features in the reconstructed pattern (blue blocks).</p> "> Figure 4
<p>Submerged arc furnace model layout, with distinct bulk and smelting concentrate, liquid slag and matte, trapped reaction gas, cooling water, and freeboard zones. Each zone is modelled as a separate lumped parameter system.</p> "> Figure 5
<p>Illustration of how increasing the concentrate bed thickness (blue) causes PPEs, where the freeboard gauge pressure (red) becomes positive [<a href="#B20-minerals-11-01106" class="html-bibr">20</a>]. A PPE-causing fault is introduced at 2 days of simulated operation; during this time the bed thickness is maintained at levels where PPEs occur when the bed ruptures.</p> "> Figure 6
<p>Illustration of the simulated process dataset partitioned into separate training (gold) and testing (red) datasets. The dashed blue line indicates zero gauge pressure. Note that all variables in <math display="inline"><semantics> <mrow> <msub> <mstyle mathvariant="bold" mathsize="normal"> <mi>X</mi> </mstyle> <mi>t</mi> </msub> </mrow> </semantics></math> are partitioned in the same way, not just the freeboard pressure.</p> "> Figure 7
<p>Illustration of the ground truth of the simulated data w.r.t. the presence of blowback-causing faults. Fault-free observations are found in the area shaded in gold. Faulty observations are found in the area shaded red. Unshaded areas contain blowback-causing faults but sounding the alarm here would either be redundant due to blowbacks already occurring or would not provide sufficient warning before the blowback. The dashed blue line indicates zero gauge pressure.</p> "> Figure 8
<p>Illustration of online event prediction. After an event is predicted, an event is assumed to occur within the prediction period (red arrow). The prediction is valid if an event occurs within this period. The prediction should provide a minimum warning period (blue arrow) for plant operators to prepare for the event. Only warnings given in the gold shaded area will be both valid and provide plant operators with sufficient time to prepare for the event. (1) Invalid prediction as no fault occurs within the prediction period, (2) valid prediction, (3) invalid prediction as the minimum warning period is exceeded.</p> "> Figure 9
<p>Illustration of observations selected for the target dataset, <math display="inline"><semantics> <mrow> <msub> <mstyle mathvariant="bold" mathsize="normal"> <mi>X</mi> </mstyle> <mi>t</mi> </msub> </mrow> </semantics></math>. <math display="inline"><semantics> <mrow> <msub> <mstyle mathvariant="bold" mathsize="normal"> <mi>X</mi> </mstyle> <mi>t</mi> </msub> </mrow> </semantics></math> is constructed from observations in the gold-shaded region, but event-preceding patterns may still be present outside this window (red shaded area). The dashed blue line indicates zero gauge pressure.</p> "> Figure 10
<p>Illustration of a lagged AE network architecture. A lagged input, with <math display="inline"><semantics> <mrow> <mi>m</mi> <mrow> <mo>(</mo> <mrow> <mi>l</mi> <mo>+</mo> <mn>1</mn> </mrow> <mo>)</mo> </mrow> </mrow> </semantics></math> variables, is projected to a high dimensional encoding layer. The hidden layer extracts representative features from this encoding layer, yielding the nonlinear AE subspace (3 features in this example). The decoding and output layers are used to reconstruct the lagged input from the subspace.</p> "> Figure 11
<p>Illustration of the CAE architecture evaluated in this project [<a href="#B20-minerals-11-01106" class="html-bibr">20</a>]. Convolutions that are applied vertically convolve a feature in the time dimension. Horizontal convolutions convolve across the variables in a feature. In the example, the first convolutional filter maps three input features to a single feature in the first feature layer (orange blocks), the second convolutional filter maps three features to a single feature in the second feature layer (green blocks), the third convolutional filter maps nine features into a single feature (red blocks), and the final deconvolutional filter maps single features into a reconstructed output with 45 features (blue blocks). See <a href="#minerals-11-01106-t007" class="html-table">Table 7</a> for further details.</p> "> Figure 12
<p>Illustration of dPCA- (top graph), AE- (middle graph) and CAE-fault pattern recognition for 9 days of simulated operation. The areas shaded in gold indicate fault-free observations, red areas indicate blowback-preceding observations. The blue line is the discriminant value generated by each model, and the solid black line indicates freeboard pressure. The black dashed horizontal line indicates zero gauge pressure. The coloured dashed horizontal lines correspond to the different recognition thresholds defined in <a href="#minerals-11-01106-t008" class="html-table">Table 8</a>.</p> ">
Abstract
:1. Introduction
2. Methods
2.1. Reconstruction-Based One-Class Classifiers
2.1.1. Principal Component Analysis
2.1.2. Auto-Encoders
2.1.3. Convolutional Auto-Encoders
2.2. Case Study
2.3. Performance Evaluation
2.4. Data Partitioning
2.5. Model Development
2.5.1. Dynamic Principal Component Analysis
2.5.2. Auto-Encoder
2.5.3. Convolutional Auto-Encoder
3. Results and Discussions
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Nell, J. Melting of platinum group metal concentrates in South Africa. J. South Afr. Inst. Min. Metall. 2004, 104, 423–428. [Google Scholar]
- Jones, R.T. An overview of South African PGM smelting. Nickel and Cobalt 2005: Challenges in Extraction and Production. In Proceedings of the 44th Annual Conference of Metallurgists, Calgary, AB, Canada, 21–24 August 2005; pp. 147–178. [Google Scholar]
- Eksteen, J. A mechanistic model to predict matte temperatures during the smelting of UG2-rich blends of platinum group metal concentrates. Miner. Eng. 2011, 24, 676–687. [Google Scholar] [CrossRef]
- Thethwayo, B. Sulphidation of Copper Coolers in PGM Smelters. Ph.D. Thesis, University of Pretoria, Pretoria, South Africa, 2010. [Google Scholar]
- Yin, S.; Ding, S.X.; Naik, A.; Deng, P.; Haghani, A. On PCA-based fault diagnosis techniques. In Proceedings of the 2010 Conference on Control and Fault-Tolerant Systems, Nice, France, 6–8 October 2010; pp. 179–184. [Google Scholar]
- Gredilla, A.; de Vallejuelo, S.F.-O.; de Diego, A.; Madariaga, J.M.; Amigo, J. Unsupervised pattern-recognition techniques to investigate metal pollution in estuaries. TrAC Trends Anal. Chem. 2013, 46, 59–69. [Google Scholar] [CrossRef]
- Zhang, Z.; Jiang, T.; Li, S.; Yang, Y. Automated feature learning for nonlinear process monitoring—An approach using stacked denoising autoencoder and k-nearest neighbor rule. J. Process. Control. 2018, 64, 49–61. [Google Scholar] [CrossRef]
- Tax, D.M.J. One-Class Classification: Concept-Learning in the Absence of Counter-Examples. Ph.D. Thesis, Universiteit Delft, Delft, The Netherlands, 2001; p. 202. [Google Scholar]
- Mazhelis, O. One-class classifiers: A review and analysis of suitability in the context of mobile-masquerader detection. South African Comput. J. 2006, 36, 29–48. [Google Scholar]
- Charte, D.; Charte, F.; del Jesus, M.J.; Herrera, F. An analysis on the use of autoencoders for representation learning: Fundamentals, learning task case studies, explainability and challenges. Neurocomputing 2020, 404, 93–107. [Google Scholar] [CrossRef]
- Ku, W.; Storer, R.H.; Georgakis, C. Disturbance detection and isolation by dynamic principal component analysis. Chemom. Intell. Lab. Syst. 1995, 30, 179–196. [Google Scholar] [CrossRef]
- MacGregor, J.; Kourti, T. Statistical process control of multivariate processes. Control. Eng. Pr. 1995, 3, 403–414. [Google Scholar] [CrossRef]
- Misra, M.; Yue, H.; Qin, S.; Ling, C. Multivariate process monitoring and fault diagnosis by multi-scale PCA. Comput. Chem. Eng. 2002, 26, 1281–1293. [Google Scholar] [CrossRef]
- Zhang, Z.; Jiang, T.; Zhan, C.; Yang, Y. Gaussian feature learning based on variational autoencoder for improving nonlinear process monitoring. J. Process. Control. 2019, 75, 136–155. [Google Scholar] [CrossRef]
- Hu, Y.; Ping, B.; Zeng, D.; Niu, Y.; Gao, Y.; Zhang, D. Research on fault diagnosis of coal mill system based on the simulated typical fault samples. Measurement 2020, 161, 107864. [Google Scholar] [CrossRef]
- Ko, T.; Kim, H. Fault Classification in High-Dimensional Complex Processes Using Semi-Supervised Deep Convolutional Generative Models. IEEE Trans. Ind. Inform. 2020, 16, 2868–2877. [Google Scholar] [CrossRef]
- Fawaz, H.I.; Forestier, G.; Weber, J.; Idoumghar, L.; Muller, P.-A. Deep learning for time series classification: A review. Data Min. Knowl. Discov. 2019, 33, 917–963. [Google Scholar] [CrossRef] [Green Version]
- Wang, F.; Ma, Q.; Liu, W.; Chang, S.; Wang, H.; He, J.; Huang, Q. A novel ECG signal compression method using spindle convolutional auto-encoder. Comput. Methods Programs Biomed. 2019, 175, 139–150. [Google Scholar] [CrossRef]
- Chen, S.; Yu, J.; Wang, S. One-dimensional convolutional auto-encoder-based feature learning for fault diagnosis of multivariate processes. J. Process. Control. 2020, 87, 54–67. [Google Scholar] [CrossRef]
- Theunissen, C.D. Fault Pattern Recognition in Simulated Furnace Data. Ph.D. Thesis, Stellenbosch University, Stellenbosch, South Africa, 2021. [Google Scholar]
- Villalba, S.D.; Cunningham, P. An evaluation of dimension reduction techniques for one-class classification. Artif. Intell. Rev. 2007, 27, 273–294. [Google Scholar] [CrossRef]
- Bishop, C.M. Pattern Recognition and Machine Learning, 1st ed.; Springer: New York, NY, USA, 2006; pp. 227–272. [Google Scholar]
- Merelli, E.; Luck, M. Technical Forum Group on Agents in Bioinformatics. Knowl. Eng. Rev. 2004, 20, 117–125. [Google Scholar]
- Shyu, M.L.; Chen, S.C.; Sarinnapakorn, K.; Chang, L. A Novel Anomaly Detection Scheme Based on Principal Component Classifier. In Proceedings of the 3rd IEEE International Conference on Data Mining, Melbourne, FL, USA, 19–22 November 2003; pp. 353–365. [Google Scholar]
- Singhal, A.; Seborg, D. Pattern matching in historical batch data using PCA. IEEE Control. Syst. 2002, 22, 53–63. [Google Scholar]
- Dong, Y.; Qin, S.J. A novel dynamic PCA algorithm for dynamic data modeling and process monitoring. J. Process. Control. 2018, 67, 1–11. [Google Scholar] [CrossRef]
- Calli, E. Faster Convolutional Neural Networks. Master’s Thesis, Radboud University, Nijmegen, The Netherlands, 2017. [Google Scholar]
- Glorot, X.; Bordes, A.; Bengio, Y. Deep Sparse Rectifier Neural Networks. In Proceedings of the International Conference on Artificial Intelligence and Statistics, Ft. Lauderdale, FL, USA, 11–13 April 2011; Volume 15, pp. 315–323. [Google Scholar]
- Sheng, Y.Y.; Irons, G.A.; Tisdale, D.G. Transport phenomena in electric smelting of nickel matte: Part I. Electric potential distribution. Metall. Mater. Trans. B Process Metall. Mater. Process. Sci. 1998, 29, 77–83. [Google Scholar] [CrossRef]
- Sheng, Y.Y.; Irons, G.A.; Tisdale, D.G. Transport phenomena in electric smelting of nickel matte: Part II. Mathematical modeling. Metall. Mater. Trans. B Process Metall. Mater. Process. Sci. 1998, 29, 85–94. [Google Scholar] [CrossRef]
- Salfner, F.; Lenk, M.; Malek, M. A survey of online failure prediction methods. ACM Comput. Surv. 2010, 42, 1–42. [Google Scholar] [CrossRef]
- Wise, B.M.; Ricker, N.L.; Veltkamp, D.F.; Kowalski, B.R. Theoretical basis for the use of principal component models for monitoring multivariate processes. Process Control Qual. 1990, 1, 41–51. [Google Scholar]
Monitored Variable | Symbol | Units | |
---|---|---|---|
1 | Slag zone height | ||
2 | Matte zone height | ||
3 | Slag zone temperature | ||
4 | Matte zone temperature | ||
5 | Bulk concentrate temperature | ||
6 | Freeboard temperature | ||
7 | Cooling water temperature | ||
8 | Freeboard pressure | ||
9 | Reaction gas concentration in freeboard |
Pattern Present | Pattern Absent | |
---|---|---|
Recognition | True positives— | False positives— |
No recognition | False negatives— | True negatives— |
Step Description | Output | Equation | |
---|---|---|---|
1 | Standardize | - | |
2 | Lag the standardized dataset | 5 | |
3 | Optimize model parameters to reconstruct from | 3 |
Step Description | Output | Equation | |
---|---|---|---|
1 | Standardize | - | |
2 | Lag the standardized dataset | 5 | |
3 | Construct corrupted | 4 | |
4 | Optimize model parameters to reconstruct from | 3 |
Step Description | Output | Equation | |
---|---|---|---|
1 | Standardize | - | |
2 | Lag with observations | 5 | |
3 | Reconstruct | 1 | |
4 | Calculate the reconstruction error | 2 |
Design Parameter | Investigated Value | |
---|---|---|
1 | Network architecture, | See Figure 10 |
2 | Lag dimension, | 4 |
3 | Input corruption variance, | 0.1 |
4 | Number of bottleneck neurons, | 3 |
5 | Regularization parameter, | |
6 | Learning rate, | 0.01 |
7 | Momentum parameter, | 0.001 |
Layer Type | Output Size | Filter Shape | No. of Filters | Parameters | |
---|---|---|---|---|---|
1 | Input | ||||
2 | Convolution + ReLU | 24 weights, 8 biases | |||
3 | Convolution + ReLU | 192 weights, 8 biases | |||
4 | Convolution + ReLU | 288 weights, 4 biases | |||
5 | Deconvolution | 252 weights, 1 bias | |||
6 | Output |
Recognition Threshold | Motivation | |
---|---|---|
1 | Threshold where 95% precision is achieved. | Useful to evaluate detection delay and specificity at high precision |
2 | Maximum threshold where all PPEs are predicted. | Enables prediction of each blowback |
3 | Minimum threshold where 100% specificity is achieved. | No false alarms |
95 % Precision: | dPCA | AE | CAE | |
1 | Average time-to-event (hrs) | 3.08 | 2.65 | 4.43 |
2 | Sensitivity (%) | 24.73 | 27.80 | 52.12 |
3 | Specificity (%) | 99.49 | 99.40 | 98.99 |
4 | Failed predictions | 40 | 20 | 0 |
No failed predictions: | dPCA | AE | CAE | |
5 | Average time-to-event (hrs) | 3.99 | 4.06 | 2.29 |
6 | Sensitivity (%) | 43.94 | 46.23 | 24.95 |
7 | Specificity (%) | 86.70 | 90.14 | 99.86 |
8 | Failed predictions | 0 | 0 | 0 |
No false predictions: | dPCA | AE | CAE | |
9 | Average time-to-event (hrs) | 1.45 | 2.24 | 1.26 |
10 | Sensitivity (%) | 2.39 | 22.18 | 11.37 |
11 | Specificity (%) | 100.00 | 100.00 | 100.00 |
12 | Failed predictions | 46 | 24 | 16 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Theunissen, C.D.; Bradshaw, S.M.; Auret, L.; Louw, T.M. One-Dimensional Convolutional Auto-Encoder for Predicting Furnace Blowback Events from Multivariate Time Series Process Data—A Case Study. Minerals 2021, 11, 1106. https://doi.org/10.3390/min11101106
Theunissen CD, Bradshaw SM, Auret L, Louw TM. One-Dimensional Convolutional Auto-Encoder for Predicting Furnace Blowback Events from Multivariate Time Series Process Data—A Case Study. Minerals. 2021; 11(10):1106. https://doi.org/10.3390/min11101106
Chicago/Turabian StyleTheunissen, Carl Daniel, Steven Martin Bradshaw, Lidia Auret, and Tobias Muller Louw. 2021. "One-Dimensional Convolutional Auto-Encoder for Predicting Furnace Blowback Events from Multivariate Time Series Process Data—A Case Study" Minerals 11, no. 10: 1106. https://doi.org/10.3390/min11101106
APA StyleTheunissen, C. D., Bradshaw, S. M., Auret, L., & Louw, T. M. (2021). One-Dimensional Convolutional Auto-Encoder for Predicting Furnace Blowback Events from Multivariate Time Series Process Data—A Case Study. Minerals, 11(10), 1106. https://doi.org/10.3390/min11101106