ConAnomaly: Content-Based Anomaly Detection for System Logs.

Journal: Sensors (Basel, Switzerland)
Published Date:

Abstract

Enterprise systems typically produce a large number of logs to record runtime states and important events. Log anomaly detection is efficient for business management and system maintenance. Most existing log-based anomaly detection methods use log parser to get log event indexes or event templates and then utilize machine learning methods to detect anomalies. However, these methods cannot handle unknown log types and do not take advantage of the log semantic information. In this article, we propose ConAnomaly, a log-based anomaly detection model composed of a log sequence encoder (log2vec) and multi-layer Long Short Term Memory Network (LSTM). We designed log2vec based on the Word2vec model, which first vectorized the words in the log content, then deleted the invalid words through part of speech tagging, and finally obtained the sequence vector by the weighted average method. In this way, ConAnomaly not only captures semantic information in the log but also leverages log sequential relationships. We evaluate our proposed approach on two log datasets. Our experimental results show that ConAnomaly has good stability and can deal with unseen log types to a certain extent, and it provides better performance than most log-based anomaly detection methods.

Authors

  • Dan Lv
    The First Affiliated Hospital of Ningbo University, Ningbo, People's Republic of China.
  • Nurbol Luktarhan
    College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China.
  • Yiyong Chen
    College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China.