Sentiment Analysis based Error Detection for Large-Scale Systems

Sheng Di

Sentiment Analysis based Error Detection for Large-Scale Systems

Sheng Di

2021, 2021 51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)

visibility

…

description

13 pages

link

1 file

AI-generated Abstract

This paper presents a novel sentiment analysis-based approach for error detection in large-scale systems, particularly high-performance computing (HPC) systems designed for exascale computing. It offers a machine learning framework to automatically create a sentiment lexicon from system log messages and utilizes this lexicon to accurately identify system errors and problematic nodes with an average f-score of 96%. The approach outperforms traditional machine learning methods, indicating the effectiveness of leveraging sentiment in failure log analysis.

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Giancarlo Succi

Empirical Softw. Engg., 2015

Predicting system failures can be of great benefit to managers that get a better command over system performance. Data that systems generate in the form of logs is a valuable source of information to predict system reliability. As such, there is an increasing demand of tools to mine logs and provide accurate predictions. However, interpreting information in logs poses some challenges. This study discusses how to effectively mining sequences of logs and provide correct predictions. The approach integrates different machine learning techniques to control for data brittleness, provide accuracy of model selection and validation, and increase robustness of classification results. We apply the proposed approach to log sequences of 25 different applications of a software system for telemetry and performance of cars. On this system, we discuss the ability of three well-known support vector machines-multilayer perceptron, radial basis function and linear kernels-to fit and predict defective log sequences. Our results show that a good analysis strategy provides stable, accurate predictions. Such strategy must at least require high fitting ability of models used for prediction. We demonstrate that such models give excellent predictions both on individual applications-e.g., 1% false positive rate, 94% true positive rate, and 95% precision-and across system applications-on average, 9% false positive rate, 78% true positive rate, and 95% precision. We also show that these results are similarly achieved for different degree of sequence defectiveness. To show how good are our results, we compare them with recent studies in system log analysis. We finally provide some recommendations that we draw reflecting on our study.

Log In

Sentiment Analysis based Error Detection for Large-Scale Systems

Sign up for access to the world's latest research

Related papers

Related papers

Related topics