Log-based Anomaly Detection with Deep Learning: How Far Are We?

Abstract: Software-intensive systems produce logs for troubleshooting purposes. Recently, many deep learning models have been proposed to automatically detect system anomalies based on log data. These models typically claim very high detection accuracy. For example, most models report an F-measure greater than 0.9 on the commonly-used HDFS dataset. To achieve a profound understanding of how far we are from solving the problem of log-based anomaly detection, in this paper, we conduct an in-depth analysis of five state-of-the-art deep learning-based models for detecting system anomalies on four public log datasets. Our experiments focus on several aspects of model evaluation, including training data selection, data grouping, class distribution, data noise, and early detection ability. Our results point out that all these aspects have significant impact on the evaluation, and that all the studied models do not always work well. The problem of log-based anomaly detection has not been solved yet. Based on our findings, we also suggest possible future work. This repository provides the implementation of recent log-based anomaly detection methods.

I. Studied Models

Model	Paper
Unsupervised
DeepLog (CCS '17)	DeepLog: Anomaly Detection and Diagnosis from System Logs through Deep Learning
LogAnomaly (IJCAI '19)	LogAnomaly: Unsupervised Detection of Sequential and Quantitative Anomalies in Unstructured Logs
LogBERT (IJCNN '21)	LogBERT: Log Anomaly Detection via BERT
Semi-supervised
PLELog (ICSE '21)	Semi-Supervised Log-Based Anomaly Detection via Probabilistic Label Estimation
Supervised
CNN (DSAC '18)	Detecting Anomaly in Big Data System Logs Using Convolutional Neural Network
LogRobust (ESEC/FSE '19)	Robust log-based anomaly detection on unstable log data
NeuralLog (ASE '21)	Log-based Anomaly Detection Without Log Parsing

II. Requirements

Python 3
NVIDIA GPU + CUDA cuDNN
PyTorch

The required packages are listed in requirements.txt. Install:

pip install -r requirements.txt

III. Usage

1. Data Preparation

Raw and preprocessed datasets (including parsed logs and their embeddings) are available at https://zenodo.org/record/8115559.

1.1. Datasets

We use datasets collected by LogPAI for evaluation. The datasets are available at loghub. The details of datasets is shown as belows:

Dataset	Size	# Logs	# Anomalies	Anomaly Ratio
HDFS	1.5 GB	11,175,629	16,838	2.93%
BGL	743 MB	4,747,963	348,460	7.34 %
Thunderbird	1.4 GB	10,000,000	4,934	0.49%
Spirit	1.4 GB	5,000,000	764,500	15.29%

1.2. Parsing

We use log parsers from logparser to parse raw logs. We use AEL, Spell, Drain, and IPLoM for our experiments. The configuration for each parser used in our experiments can be found here.

1.3. Embedding

For a fair comparison, we use the same fastText-based embedding method for all models. Use the following command to generate embeddings for log templates:

$ cd dataset

# download fastText word2vec model
$ wget https://dl.fbaipublicfiles.com/fasttext/vectors-english/crawl-300d-2M.vec.zip & unzip crawl-300d-2M.vec.zip

# generate embeddings for log templates
$ python generate_embeddings.py <dataset> <strategy>
# where <dataset> is one of {HDFS, BGL, Thunderbird, or Spirit}
# and <strategy> is one of {average or tfidf}

2. Training and Testing

2.1. Configuration File

The configuration files used to set up the hyperparameters for training and testing can be found at /config. Main parameters are described as follows:

data_dir: the directory of the dataset
log_file: the path to the log file
dataset_name: the name of the dataset
grouping: the type of log grouping technique (session or sliding)
session_level: to grouping with sliding window by time or log entries (i.e., entry or minute)
window_size: window size for sliding grouping
step_size: step size for sliding grouping (if step_size = window_size, it is equivalent to fixed grouping)
is_chronological: whether to use chronological order for train/test split (only apply for sliding grouping)
model_name: the name of the model (e.g., DeepLog, LogAnomaly, LogRobust, PLELog, CNN)
sequential: whether to use sequential features (i.e., indexes of log templates) or not
quantitative: whether to use quantitative features (i.e., event count vectors) or not
semantic: whether to use semantic features (i.e., log template embeddings) or not
embedding_dim: the dimension of log template embeddings
embeddings: the path to the json file for log template embeddings

Training parameters such as batch_size, lr, max_epoch, optimizer, etc. are also defined in the configuration files.

2.2. To run the code

python main_run.py --config_file <config_file>
# where `<config_file>` is the path to the configuration file.

To see all the options, run python main_run.py -h.

Citation

If you find the code and models useful for your research, please cite the following paper:

@inproceedings{le2022log,
  title={Log-based Anomaly Detection with Deep Learning: How Far Are We?},
  author={Le, Van-Hoang and Zhang, Hongyu},
  booktitle={2022 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 242 Commits
config		config
dataset		dataset
demo		demo
docs		docs
logadempirical		logadempirical
modules		modules
pictures		pictures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
main_run.py		main_run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Log-based Anomaly Detection with Deep Learning: How Far Are We?

I. Studied Models

II. Requirements

III. Usage

1. Data Preparation

1.1. Datasets

1.2. Parsing

1.3. Embedding

2. Training and Testing

2.1. Configuration File

2.2. To run the code

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

LogIntelligence/LogADEmpirical

Folders and files

Latest commit

History

Repository files navigation

Log-based Anomaly Detection with Deep Learning: How Far Are We?

I. Studied Models

II. Requirements

III. Usage

1. Data Preparation

1.1. Datasets

1.2. Parsing

1.3. Embedding

2. Training and Testing

2.1. Configuration File

2.2. To run the code

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages