NILMPEds: A Performance Evaluation Dataset for Event Detection Algorithms in Non-Intrusive Load Monitoring
<p>Underlying folder and file organization of NILMPEds.</p> "> Figure 2
<p>Distribution of detected power events across the five algorithms and four datasets.</p> "> Figure 2 Cont.
<p>Distribution of detected power events across the five algorithms and four datasets.</p> "> Figure 3
<p>Median F<math display="inline"><semantics> <msub> <mrow/> <mn>1</mn> </msub> </semantics></math>-Score, for each dataset, under the different detection tolerance values.</p> ">
Abstract
:1. Summary
2. Methods
2.1. Event Detection Dataset
2.2. Event Detection Algorithms and Models
2.3. Power Events and Performance Metrics
3. Data Description
3.1. Ground Truth
3.2. Event Detection Models
3.3. Power Events
3.4. Performance Metrics
4. Data Exploration
Performance Metrics
5. Future Work and Source Code Release
Funding
Acknowledgments
Conflicts of Interest
References
- Goodman, S.N.; Fanelli, D.; Ioannidis, J.P.A. What does research reproducibility mean? Sci. Transl. Med. 2016, 8, 341ps12. [Google Scholar] [CrossRef] [PubMed]
- Hart, G. Nonintrusive appliance load monitoring. Proc. IEEE 1992, 80, 1870–1891. [Google Scholar] [CrossRef]
- Pereira, L.; Nunes, N. Performance evaluation in non-intrusive load monitoring: Datasets, metrics, and tools—A review. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2018, 8, e1265. [Google Scholar] [CrossRef]
- Bergés, M.; Kolter, Z. Non-Intrusive Load Monitoring: A Review of the State of the Art. In Proceedings of the 1st International Workshop on Non-Intrusive Load Monitoring, Pittsburgh, PA, USA, 7 May 2012. [Google Scholar]
- Zeifman, M.; Roth, K. Nonintrusive appliance load monitoring: Review and outlook. IEEE Trans. Consum. Electron. 2011, 57, 76–84. [Google Scholar] [CrossRef]
- Anderson, K.; Ocneanu, A.; Benitez, D.; Carlson, D.; Rowe, A.; Berges, M. BLUED: A Fully Labeled Public Dataset for Event-Based Non-Intrusive Load Monitoring Research. In Proceedings of the 2nd KDD Workshop on Data Mining Applications in Sustainability (SustKDD), Beijing, China, 12 August 2012; pp. 1–5. [Google Scholar]
- Jazizadeh, F.; Afzalan, M.; Becerik-Gerber, B.; Soibelman, L. EMBED: A Dataset for Energy Monitoring Through Building Electricity Disaggregation. In Proceedings of the Ninth International Conference on Future Energy Systems, Karlsruhe, Germany, 12–15 June 2018; ACM: New York, NY, USA, 2018; pp. 230–235. [Google Scholar] [CrossRef]
- Makonin, S.; Ellert, B.; Bajić, I.V.; Popowich, F. Electricity, water, and natural gas consumption of a residential house in Canada from 2012 to 2014. Sci. Data 2016, 3, 160037. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Murray, D.; Stankovic, L.; Stankovic, V. An electrical load measurements dataset of United Kingdom households from a two-year longitudinal study. Sci. Data 2017, 4, 160122. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Makonin, S.; Wang, Z.J.; Tumpach, C. RAE: The Rainforest Automation Energy Dataset for Smart Grid Meter Data Analysis. Data 2018, 3, 8. [Google Scholar] [CrossRef]
- Kelly, J.; Knottenbelt, W. The UK-DALE dataset, domestic appliance-level electricity demand and whole-house demand from five UK homes. Sci. Data 2015, 2, 150007. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Pereira, L.; Nunes, N. An Experimental Comparison of Performance Metrics for Event Detection Algorithms in NILM. In Proceedings of the 4th International NILM Workshop, Austin, TX, USA, 7–8 March 2018. [Google Scholar]
- Makonin, S.; Popowich, F. Nonintrusive load monitoring (NILM) performance evaluation. Energy Effic. 2014, 8, 809–814. [Google Scholar] [CrossRef]
- Mayhorn, E.T.; Sullivan, G.P.; Fu, T.; Petersen, J.M. Non-Intrusive Load Monitoring Laboratory-Based Test Protocols; Technical Report 26184; Pacific Northwest National Laboratory (PNNL): Richland, WA, USA, 2017. [Google Scholar]
- Pereira, L. Developing and Evaluating a Probabilistic Event Detector for Non-Intrusive Load Monitoring. In Proceedings of the fifth IFIP Conference on Sustainable Internet and ICT for Sustainability, Funchal, Portugal, 6–7 December 2017; pp. 1–10. [Google Scholar]
- Cao, H.A.; Wijaya, T.K.; Aberer, K.; Nunes, N. A collaborative framework for annotating energy datasets. In Proceedings of the 2015 IEEE International Conference on Big Data (Big Data), Santa Clara, CA, USA, 29 October–1 November 2015; pp. 2716–2725. [Google Scholar] [CrossRef]
- Pereira, L.; Nunes, N. Semi-automatic labeling for public non-intrusive load monitoring datasets. In Proceedings of the 2015 Sustainable Internet and ICT for Sustainability (SustainIT), Madrid, Spain, 14–15 April 2015; pp. 1–4. [Google Scholar] [CrossRef]
- Völker, B.; Scholl, P.M.; Becker, B. Semi-Automatic Generation and Labeling of Training Data for Non-intrusive Load Monitoring. In Proceedings of the Tenth ACM International Conference on Future Energy Systems, Phoenix, AZ, USA, 25–28 June 2019; pp. 17–23. [Google Scholar] [CrossRef]
- Pereira, M.; Velosa, N.; Pereira, L. dsCleaner: A Python Library to Clean, Preprocess and Convert Non-Instrusive Load Monitoring Datasets. Data 2018, 4, 123. [Google Scholar] [CrossRef]
- Berges, M.; Goldman, E.; Matthews, H.; Soibelman, L.; Anderson, K. User-Centered Nonintrusive Electricity Load Monitoring for Residential Buildings. J. Comput. Civ. Eng. 2011, 25, 471–480. [Google Scholar] [CrossRef]
- Meehan, P.; McArdle, C.; Daniels, S. An Efficient, Scalable Time-Frequency Method for Tracking Energy Usage of Domestic Appliances Using a Two-Step Classification Algorithm. Energies 2014, 7, 7041–7066. [Google Scholar] [CrossRef] [Green Version]
- Pereira, L. Hardware and Software Platforms to Deploy and Evaluate Non-Intrusive Load Monitoring Systems. Ph.D. Thesis, Universidade da Madeira, Funchal, Portugal, 2016. [Google Scholar]
- Anderson, K. Non-Intrusive Load Monitoring: Disaggregation of Energy by Unsupervised Power Consumption Clustering. Ph.D. Thesis, Carnegie Mellon University, Pittsburgh, PA, USA, 2014. [Google Scholar]
1. | NIMPEds on OSF, https://osf.io/bxudn/?view_only=0b17ac2890b0464d80f9bfc0e2a17f58 |
Dataset Name | Dataset ID | Rate | P.E. | Power Change (W) | Elapsed Time (s) | ||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Mean | 25% | 50% | 75% | Mean | 25% | 50% | 75% | ||||
UK-DALE H1 | 1 | 50 Hz | 5440 | 268 | 48 | 100 | 273 | 111 | 4 | 7 | 28 |
UK-DALE H2 | 2 | 50 Hz | 2842 | 365 | 45 | 74 | 137 | 212 | 6 | 15 | 172 |
BLUED PA | 3 | 60 Hz | 887 | 274 | 84 | 116 | 582 | 690 | 18 | 294 | 892 |
BLUED PB | 4 | 60 Hz | 1562 | 351 | 40 | 170 | 428 | 383 | 7 | 35 | 83 |
Event Detection Algorithm | Models | Model-Dataset Pairs | |
---|---|---|---|
Name | ID | ||
Simplified LLR Detector with Maxima [15] | 1 | 1 k | 4 k |
LLR Detector with Maxima [15] | 2 | 1 k | 4 k |
Simplified LLR Detector with Voting [15] | 3 | 1.1 k (50 Hz); 9.5 k (60 Hz) | 22 k; 19 k |
LLR Detector with Voting [20] | 4 | 1.1 k (50 Hz); 9.5 k (60 Hz) | 22 k; 19 k |
Expert Heuristic Detector [21] | 5 | 4.95 k | 19.8 k |
47.95 k | 109.8 k |
Column | Description |
---|---|
Position of the power event in number of samples from the beginning of the dataset. | |
Difference in power before and after the power event, considering the average of one second of samples before and after the event position. | |
Distance in samples to the previous power event. | |
The corresponding day in the dataset. The days are numbered from 1 to the total number of days in the dataset. |
Column | Description |
---|---|
All Models (dm_*.csv) | |
Unique model identifier for each event detection algorithm. | |
Length of the pre-event window in seconds. | |
Length of the post-event window in seconds. | |
Models for Algorithms 1 and 2 (dm_a1a2.csv) | |
Maxima precision in seconds. | |
Models for Algorithms 3 and 4 (dm_a3a4_<?>Hz.csv) | |
Length of the voting window in seconds. | |
Minimum number of votes necessary to trigger a power event. | |
Models for Algorithm 5 (dm_a5_<?>Hz.csv) | |
Number of seconds before the second under evaluation | |
Minimum elapsed time between events | |
Sample index inside the second where the event occurred |
Column | Description |
---|---|
Power event unique identifier. | |
Model identifier. This corresponds to the Model_ID in the detection models data. | |
Position of the power event in number of samples from the beginning of the dataset. | |
Difference in power before and after the power event, considering the average of one second of samples before and after the event position. | |
Value of the detection statistics. This field is not available for the models of Algorithm 5 since these are not probabilistic. |
Column | Description | Values Range | |
---|---|---|---|
Best | Worst | ||
Model identifier. This corresponds to the Model_IDin the power events data. | - | - | |
A tolerance value (in samples) that was set to account for eventual ambiguity when labeling the event detection datasets. | - | - | |
The number of real power events in the dataset. | - | - | |
True Positives | Events | 0 | |
False Positives | 0 | S- | |
True Negatives | S- | 0 | |
False Negatives | 0 | Events | |
A | Accuracy | 1 | 0 |
E | Error-rate | 0 | 1 |
P | Precision | 1 | 0 |
R | Recall | 1 | 0 |
False Positive Rate | 0 | 1 | |
True Positive Percentage | 1 | 0 | |
False Positive Percentage | 0 | ||
True Negative Rate | 1 | 0 | |
False Discovery Rate | 0 | 1 | |
-Score | 1 | 0 | |
-Score | 1 | 0 | |
-Score | 1 | 0 | |
Matthews Correlation Coefficient | 1 | −1 | |
Standardized MCC | 1 | 0 | |
Distance to Perfect Score between Precision and Recall | 0 | 2 | |
Distance to Perfect Score between TPR (i.e., Recall) and FPR | 0 | ||
Distance to Perfect Score between TPP and FPP. | 0 | ||
Wilcoxon statistics based Area Under Curve | 1 | 0 | |
Wilcoxon statistics based AUC Balanced | 1 | 0 | |
Geometric Mean AUC | 1 | 0 | |
Biased AUC | 1 | 0 | |
Total Power Change - False Positives | 0 | ||
Total Power Change - False Negatives | 0 | ||
Average Power Change - False Positives | 0 | ||
Average Power Change - False Negatives | 0 | ||
Distance to Perfect Score TPC | 0 | ||
Distance to Perfect Score APC | 0 |
© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Pereira, L. NILMPEds: A Performance Evaluation Dataset for Event Detection Algorithms in Non-Intrusive Load Monitoring. Data 2019, 4, 127. https://doi.org/10.3390/data4030127
Pereira L. NILMPEds: A Performance Evaluation Dataset for Event Detection Algorithms in Non-Intrusive Load Monitoring. Data. 2019; 4(3):127. https://doi.org/10.3390/data4030127
Chicago/Turabian StylePereira, Lucas. 2019. "NILMPEds: A Performance Evaluation Dataset for Event Detection Algorithms in Non-Intrusive Load Monitoring" Data 4, no. 3: 127. https://doi.org/10.3390/data4030127