research-article

Few-Shot Log Anomaly Detection Based on Matching Networks

Authors:

Chunjing Han,

Bohai Guan,

Tong Li,

Di Kang,

Jifeng Qin,

Yulei WuAuthors Info & Claims

IEEE Transactions on Network and Service Management, Volume 21, Issue 3

Pages 2909 - 2925

https://doi.org/10.1109/TNSM.2024.3363626

Published: 08 February 2024 Publication History

Abstract

In order to address the problem of log anomaly detection in scenarios with limited labeled log datasets, this paper proposes Log-MatchNet, a novel few-shot log anomaly detection method. To tackle issues such as unstructured log data, diversity, and evolution over time, we employ structured processing and log parsing to convert log content information and template ID into vectors. Feature extraction is performed using the BERT model. Additionally, by integrating multiple datasets and conducting post-training on the BERT model for domain adaptation, we obtain <inline-formula> <tex-math notation="LaTeX">$BERT\_{}Post$ </tex-math></inline-formula>, a module with universal feature extraction capabilities in the log domain. Compared to <inline-formula> <tex-math notation="LaTeX">$BERT_{base}$ </tex-math></inline-formula> and CyBERT, our method demonstrates superior performance in log anomaly detection, especially in situations with limited labeled datasets. With only 2 annotated normal logs and 2 annotated abnormal logs, <inline-formula> <tex-math notation="LaTeX">$BERT\_{}Post$ </tex-math></inline-formula> achieves a remarkable 16.14% increase in F1-score. Addressing the challenge of imbalanced data, we introduce a matching network that learns the similarity scores between input and prototype vectors, showcasing strong generalization capabilities with an average accuracy of 99.6%. In few-shot scenarios, our method, Log-MatchNet outperforms traditional methods and Proto-Siamese network in terms of F1-score. In an unstable log evolution environment, our method exhibits robustness against noisy data, achieving an F1-score of 81.2% even with 20% injected noise. Compared to LogAnMeta, our approach yields a 31.71% increase in F1-score. Experimental results demonstrate the effectiveness of Log-MatchNet in detecting anomalies in the presence of limited labeled log data and its robust performance in log evolution scenarios.

References

[1]

M. Du, F. Li, G. Zheng, and V. Srikumar, “DeepLog: Anomaly detection and diagnosis from system logs through deep learning,” in Proc. ACM SIGSAC Conf. Comput. Commun. Security, 2017, pp. 1285–1298. [Online]. Available: https://api.semanticscholar.org/CorpusID:4232579

Abstract

References

Recommendations

Robust log-based anomaly detection on unstable log data

Log-based anomaly detection without log parsing

Impact of log parsing on deep learning-based anomaly detection

Comments

Information

Published In

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations