FaultNet A Deep Convolutional Neural Network For B
FaultNet A Deep Convolutional Neural Network For B
fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
Date of publication xxxx 00, 0000, date of current version xxxx 00, 0000.
Digital Object Identifier 10.1109/ACCESS.2017.Doi Number
ABSTRACT The increased presence of advanced sensors on the production floors has led to the collection
of datasets that can provide significant insights into machine health. An important and reliable indicator of
machine health, vibration signal data can provide us a greater understanding of different faults occurring in
mechanical systems. In this work, we analyze vibration signal data of mechanical systems with bearings
by combining different signal processing methods and coupling them with machine learning techniques to
classify different types of bearing faults. We also highlight the importance of using different signal
processing methods and their effect on accuracy for bearing fault detection. Apart from the traditional
machine learning algorithms we also propose a convolutional neural network FaultNet which can
effectively determine the type of bearing fault with a high degree of accuracy. The distinguishing factor of
this work is the idea of channels proposed to extract more information from the signal, we have stacked the
‘Mean’ and ‘Median’ channels to raw signal to extract more useful features to classify the signals with
greater accuracy.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
[6][7][8]. As determined by Zhang et al., for rotating machine are highly compatible and effective when addressing the
health monitoring, vibration signal is very important as it bearing fault diagnostics problem.
contains rich information regarding machine health [9]. Most of the deep learning architectures used for bearing fault
Therefore, the analysis of vibration data may help us in the diagnosis are based on Convolutional Neural Network
detection and prevention of faults in bearing. In their study, (CNN). Guo et al. propose a hierarchical adaptive deep
Samanta et al. used time domain statistical features extracted convolution network for bearing fault size prediction. In their
from vibrations signals to classify faults using an artificial paper, they convert the signal data into a 32x32 array and use
neural network[10]. This study was one of the earliest CNNs to accomplish the task. However, their work does not
attempts to utilize the capabilities of deep learning for bearing use other information available from signal data like
fault detection using vibration signal data. Apart from signal skewness, kurtosis, impulse factor, RMS value[16]. Another
featurization, some researchers have also used wavelet work done by Pham et al. proposes a method that converts the
decomposition to extract relevant information from the signal. signal data into its spectrogram which is then fed to VGG16
In a study conducted on wavelet transformation of vibrations for classification[17], [18]. In their paper, they used only four
signals for fault diagnosis, Sun W et al., use a combination of classes and achieved 98.8% accuracy. When compared to
discrete wavelet transforms and envelope analysis using their work, we achieved a comparable accuracy on 10 classes
which they extract the characteristic spectrum of rolling with computationally inexpensive architecture. Pan et al.,
bearing vibration data. Subsequently, a spectrum cross- employed 1D CNN and LSTM, in order to take advantage of
correlation coefficient is then applied to identify different the signal data, in their paper one-dimensional CNN and
operating conditions of rolling bearings[11]. Based on this LSTM are combined into one unified structure by using the
coefficient, different vibration signals are then classified. CNNs output as input to the LSTM to identify the bearing
Building upon the previous works, we use different signal fault types[19]. They also compare the usage of nine different
featurization methods to extract 14 features from the raw featurization techniques and using them with different
vibration signals to classify bearing faults using machine traditional machine learning algorithms. However, Pan et al.,
learning and deep learning approaches. In order to do not use stacked median and mean channels in their work
comprehensively analyze the signal data, we also and use a more computationally heavy framework by
implemented wavelet decomposition on the raw signal and combining the CNN and LSTM approaches. Most of the
couple it with machine learning approaches to evaluate its state-of-the-art works report an accuracy of more than 98% in
performance for bearing fault classification. With the bearing fault detection. Guo et al., in their paper, used
motivation of developing a generalized model, we evaluate Stacked denoising Autoencoders have obtained an accuracy
our machine learning and deep learning approaches on two of 99.83%. However, they separate the data as per the size of
major publicly available datasets for bearing fault the fault and then make predictions and have only 4 classes in
classification. their predictions[20]. As the dataset created by the CWRU
The first dataset that we analyze has been developed by Case bearing center is able to mimic the actual operating
Western Reserve University (CWRU) bearing center[12]. conditions the dataset contains some noisy signals as is
The dataset from the CWRU bearing center will be referred expected in the actual environment. Therefore, the use of
to as the CWRU dataset throughout the paper. The CWRU SDAE has been made particularly by researchers to make
dataset is one of the important datasets in this research area their predictions more resilient to the noise in the
and has been widely used by researchers to benchmark the dataset[21][22]. Another approach used by Li et al. combines
performance of their models. In their study, Smith et al., have the convolutional neural network and Dempster-Shafer
proposed a benchmark for the CWRU dataset using three theory-based evidence fusion. In their work, they demonstrate
different techniques. They have carefully analyzed the adaptability to different loads and report an accuracy of
different ball faults and compared the signal data amongst the 98.92% [23]. LiftingNet by Pan et al. proposes split, predict
faults. Thus, articulating the difference among signals data and, update blocks that are accurately able to predict the
when different types of fault occur. However, they do not use bearing faults and are adaptable to different motor speeds and
the signal featurization techniques that we have employed and loads. However, their approach is not able to gauge the size
don’t use any deep learning models[13]. Many researchers of the fault[24]. Our FaultNet can not only predict the type of
recently have used different deep learning models on the fault but also the size of the fault based on the input signal.
CWRU bearing dataset. In their review paper, Zhang et al., Wang et al. propose the creation of a time-frequency image of
have compiled a comprehensive list of different methods used the signal and classifying them with AlexNet based
by researchers working in this area. Based on their review of architecture[25][26]. When compared to AlexNet which has 5
different methods, it is evident that the best performing deep convolutional layers ours is a relatively inexpensive
learning models have accuracies in the range of (97%- architecture computationally. Roy et al. propose an
99%)[14]. Another recent review paper by Neupane et al., autocorrelation-based methodology for feature extraction
also discusses different bearing fault classification datasets, from a raw signal and then use the random forest classifier for
signal feature extraction techniques, and some of the highly fault classification. They achieve comparable accuracies to
accurate deep learning architectures[15]. Based on both the the deep learning methods discussed earlier[27].
review papers we can conclude that deep learning methods
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
The second dataset that we have considered in this work is II. DATASET PREPROCESSING
the Paderborn University Data Center bearing dataset[28].
From here onwards, the dataset from Paderborn University A. Case Western Reserve University Bearing Dataset
will be referred to as the Paderborn dataset throughout the The test rig to generate the dataset consists of a 2 hp electric
paper. The dataset has vibration as well as motor current motor to the left, driving a shaft on which a torque transducer
signal captured on the test-rig. In the paper proposing the and encoder are mounted in the middle coupled to a
dataset, to extract the features, Fast Fourier Transform (FFT) dynamometer in the right. The torque is applied to the shaft
and power spectral density (PSD) are performed on vibration via a dynamometer and electronic control system. The test
and motor current signal. After feature extraction and feature rig also includes bearings at both the drive end (DE) and fan
selection,18 features emerge for motor current signals, and 15 end (FE) of the motor. The bearing at the DE and FE are
features are extracted for the vibration signal data[29]. Using 6205-2RS JEM and 6203-2RS JEM, respectively. The 6205-
conventional machine learning approaches, Karatzinis et al, bearing used for data collection is a Single Row Deep
achieved the highest accuracy of 98%. However, they do not Groove Radial Ball Bearing with an inner diameter of 25mm,
use advanced deep learning techniques which may possibly an outer diameter of 52mm, and 15mm in width. To collect
increase the accuracy. In another study Zhong et al., the vibration signal data single point faults were artificially
transform the signal using Short-time Fourier Transform
induced using electro-discharge machining (EDM) with fault
(STFT) and use CNN to classify the bearing fault [30]. On
diameters from 7 to 28 mils (0.18 to 0.71mm). The motor
the transformed signal domain, they apply CNN. The average
loads varied from 0 to 3 hp (approximate motor speeds of
accuracy achieved is 97.4%. Compared to their work, our
model yields better results by directly using the raw signal. 1720 to 1797 rpm). The vibration data was collected using
Bin Li et al., have implemented 1 dimensional CNN accelerometers, which were attached to the housing with
architecture and the best result achieved by them is 98.3 % magnetic bases. The data was collected with two sampling
accuracy in fault classification. However, they have not frequencies, one with 12,000 samples per second, and 48,000
explored the 2D CNN method to improve the results. In samples per second, and was processed using MATLAB®.
another study, Pandhare et al., have implemented 2D CNN In their study, the DE & FE bearing data for the normal (N),
for the bearing fault classification on the Paderborn inner race fault (IF), outer race fault (OF), and the rolling
dataset[31]. In their work, they have demonstrated 2D CNN element(ball) fault (BF) conditions was acquired for fault
on 3 different signal types – raw time domain signal, pattern classification where the fault diameters were selected
envelope spectrum, and spectrogram. The maximum accuracy to be 7 mils, 14 mils, and 21 mils.
achieved is with a spectrogram. For raw signals, the achieved TABLE I: CWRU BEARING HEALTH CONDITIONS AND CLASS LABELS
accuracy of 95% is slightly lower when compared to the other Health Fault size Total class
Condition (mm) dataset labels
studies. Another group of researchers, Wang et al, have
proposed a method to use 1D CNN as well as 2D CNN Normal - 280 0
together to predict the fault class in the Paderborn dataset ball fault 0.18 280 1
[32]. They have concatenated the 1D CNN output with 2D ball fault 0.36 280 2
CNN output before passing it on to a fully connected neural ball fault 0.53 280 3
network for classification. Their resultant accuracy for the inner race fault 0.18 280 4
classification task is 98.58%. However, their approach is inner race fault 0.36 280 5
computationally expensive and hence may not be very inner race fault 0.53 280 6
suitable for online deployment. outer race fault 0.18 280 7
In this paper, we propose FaultNet, a CNN based model to outer race fault 0.36 280 8
determine different types of bearing faults with high outer race fault 0.53 280 9
accuracy. The aim of this paper is to set a benchmark for Ten different conditions are investigated to verify the
bearing fault detection using conventional machine learning accuracy of the proposed method in consideration of multiple
algorithms and deep learning techniques on CWRU and fault patterns. The vibration signals of ten health conditions
Paderborn datasets. It is important to note that the base are referred to in table 1. In this paper, we used the data from
architecture for both the datasets is the same and the
the drive end of the test rig. The sampling frequency chosen
performance of FaultNet is not dataset specific, suggesting
is 48 kHz with the load condition being 2 HP at 1750 rpm.
wide applicability and deployability of the model to detect
To analyze and classify different bearing faults we do some
different types of bearing faults. We achieve state-of-the-art
accuracies for both datasets while proposing a different preprocessing steps on the dataset. The rotating speed of the
methodology to extract features from the data. We also study shaft is 1750 rpm and the sampling frequency is 48 kHz
different signal processing techniques and compare implies that approximately 1670 data points will be collected
accuracies of the traditional machine learning algorithms for one revolution. Out of 1670 data points, the first 35 points
when combining different types of signal features and our and last 35 points are ignored to account for the noise in the
own 2D CNN model. data. Thus, 467600 data points of each fault class are chosen
and divided into 280 samples, with 1670 data points. Finally,
we have 2800 samples with 10 different classes with 280
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
samples each. Further details, which introduce the test set-up paper, the features extracted from raw signal data include
and other data collected, can be found at the CWRU Bearing multiple time and time-frequency domain features. Some of
Data Center website. the statistical time domain features that we extract include
mean, variance, standard deviation, root mean square (RMS).
B. Paderborn University Dataset Moreover, features such as kurtosis and skewness are also
This dataset is generated using 32 bearings. The bearing type extracted as these signals are not stationary. In their paper,
used for this dataset generation is 6203, which stands for Caesarendra et al., give us some physical insights into the
Deep Groove Ball Bearings with dimensions (inner diameter, features as they report the approximate values of kurtosis and
outer diameter, and width) – 17X40X12mm. Out of 32, 6 skewness for a normal bearing to be 3 and 1 respectively
bearings are healthy, 12 bearings have artificially created [34]. Hence, for bearings that are faulty, we expect to have
defects and the remaining 14 bearings are naturally damaged. kurtosis and skewness values shifted from 3 and 1. Another
The artificial defects have been created by using drilling, important observation we made was that for faulty bearings,
EDM, and electric engraving machine. The artificial defects the bearing signal amplitude undergoes abrupt changes when
are produced on both, inner and outer race. The natural rolling elements pass over the defective region of the bearing.
damages are produced by accelerated lifetime tests. A These abrupt changes are responsible for disturbing the
detailed description can be found in the paper[28]. overall distribution of signal and therefore can act as an
important clue in detecting faulty bearings. Generally, the
Further, the bearings’ samples can be divided into 3 classes, value of kurtosis increases and skewness may change to the
healthy, inner race fault, and outer race fault. By this negative or positive side for faulty bearings. Apart from these
classification, there are 6 healthy bearings, 11 inner race fault features, dimensionless features such as crest factor, shape
bearings, and 12 outer race fault bearings. This amounts to factor, impulse factor are also extracted. The shape factor is
29 bearings in total. The Remaining 3 bearings are omitted affected by the shape but is independent of the dimension.
due to their nature of the fault. These 3 bearings have inner The crest factor is a measure of an impact when a rolling
as well as outer race fault. In the study conducted by element comes in contact with the raceway. Table 2
Paderborn University, the authors have classified these summarizes all the 14 features extracted from the raw signal
bearings on the grounds of the maximum contributing fault. If data along with their mathematical formulae used.
the inner race damage is more compared to the outer race, the TABLE II: FEATURES AND THE MATHEMATICAL FORMULAE USED TO
bearing is classified as inner race fault bearing. For the CALCULATE VALUES FOR EACH SIGNAL
current model, we used 29 bearings data which can be No Feature Formula
classified distinctively. The data set is generated with 1 Mean Mean =
multiple combinations of rpm, torque, and load. For the 2 Absolute Abs Mean =
purpose of this study, we use the following combination. mean
3 Maximum
N=1500 rpm, load torque=0.7 Nm and Radial force=1000 N.
4 Minimum
Each bearing is used 20 times to generate 20 signals with 5 Peak to Peak Maximum - Minimum
one fixed combination. The signal generated is a vibration 6 Absolute
signal for 4 sec with a sampling frequency of 64kHz. That max
means, in a signal, there are 256,000 data points. To avoid 7 Root Mean
initial and ending noise and disturbance, the sample signal is Square
clipped off for the first 1/16th part and the last 1/16th part.
Eventually, the signal used has 2,24,000 data points which 8 Variance
are used further for featurization. In total 2320 signals have
been used for classification.
9 Clearance
factor
III. FEATURIZATION
In every machine learning process, feature engineering plays
a very important role and can significantly affect the 10 Kurtosis
performance of an algorithm. Feature engineering can directly 11 Skewness
help the machine learning algorithm to identify the
underlying patterns and effectively improve the accuracy of
the model. For signal data, featurization includes deriving
different domains’ features from raw signals such as time 12 Impulse
domain, time-frequency domain, etc. The vibration signals Factor
from machinery components are in general considered to be
non-stationary. The non-stationary signals mean that the 13 Crest Factor
frequencies present in a signal vary with time [33]. Therefore,
it is important to extract features from the time domain as
14 Shape Factor
well as the time-frequency domain to capture the time-
varying nature of frequencies present in a signal. In this
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
Time-frequency domain representation methods such as CWRU dataset (figure 1(b)). Similar to CWRU multiple
short-time Fourier transform (STFT), wavelet transform, and combinations of different features
Wigner-Ville distribution (WVD) are commonly used for the
non-stationary or transient signal. These methods implement
a mapping of one-dimensional time-domain signals to a two-
dimensional function of time and frequency. The objective is
to provide a true time-frequency representation of a signal.
Similar, to the methods presented in the review [35]
conducted by Feng et al., on time-frequency analysis
methods for machinery fault diagnosis, we decomposed these
raw signals by employing wavelet decomposition package
(WPD) using Haar wavelet as a mother wavelet for the
extraction of time-frequency domain features. The wavelet
decomposed signal consists of approximation coefficients
and detailed coefficients. In this work, we use the
approximation coefficients as they are more sensitive
towards bearing conditions as suggested in [28] for the
extraction of statistical features mentioned in table II.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
(b)
FIGURE 3. Classification accuracy using all 14 features on different
signal wavelet decomposition level (a) shows accuracy for CWRU
dataset. Decomposing signals at different result in slightly higher
accuracy. Also, for some ML algorithms decomposed signals gives
(a) better accuracy compared to raw signals.
Similarly (b) shows the results obtained for Paderborn University
dataset.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
FIGURE 4. Convolutional Neural Network Architecture. 2D CNN architecture used for Paderborn University dataset. Different colors represent different
operations. There are 2 convolution and 2 max-pool layers along with fully connected neural network with 3 outputs. After each convolution layer, a
‘Batch-norm’ and ‘ReLU’ activation is applied.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
FaultNet performs reasonably well with an accuracy of working exactly opposite to the image conversion from RGB
82.12% and 89.3% for CWRU and Paderborn respectively. to grayscale.
When compared with deep learning-based architecture by The novelty of this work is in the concise CNN structure,
Zhang et al., FaultNet achieves comparable accuracy within also, in the concept of augmenting 2D raw signal with its
1% for SNR values of 8 and 10 and considerably mean and median value channels to extract more meaningful
outperforms it when the SNR values are less than 2 on the features for CNN. We have demonstrated that the CNN
CWRU dataset [38]. We would like to note that CNN structure devised here improves upon previous methods and
proposed by Zhang et al., consists of 5 convolutional layers has a highly competitive performance compared with state-
whereas our lightweight architecture FaultNet only has 2 of-the-art methods. We believe this work can pave the way
convolutional layers, making it more suitable for an online for online fault detection in the case of bearings which could
industrial setting. FaultNet is able to achieve high accuracy be extremely beneficial for industries. Our approach can be
because of the novel way in which it is able to use extended to similar types of datasets.
information from signals through mean and median
channels.
TABLE IV: PERFORMANCE EVALUATION OF FAULTNET FOR NOISE
ROBUSTNESS ON CWRU DATASET
SNR RF SVC LR kNN MLP FaultNet
-4 75.16 67.14 68.75 66.25 73.64 82.12
-2 77.27 69.01 70.14 68.32 74.89 84.14
0 79.04 71.52 71.71 69.39 76.03 87.68
2 81.65 73.24 72.17 68.75 76.21 90.74
6 82.38 73.89 72.78 69.25 76.42 93.17
8 84.29 74 74.07 69.10 76.62 96.21
10 85.62 74.78 74.32 69.32 77 97.77
VIII. CONCLUSION
In this paper, a systematic approach towards a data-driven
vibration-based diagnosis of faults in rolling element bearings
is demonstrated. We have benchmarked the performance of
different machine learning algorithms by using the featurized
signal data and deep learning approaches for the CWRU and
Paderborn datasets. Five-fold accuracies of ~99% are
obtained for both the datasets indicating the state-of-the-art
performance is achieved by the FaultNet architecture. For
classification, it is important to have enough differentiating
features between classes. As we stack a greater number of
channels, the algorithm is able to extract more features
compared to single-channel input. Each additional channel
could be considered as a feature map of the input which
provides more information about the input. This is analogous
to grayscale and RGB images. If we convert an RGB image
to a grayscale image, often, there happens to be information
loss leading to poor performance [39]. Hence, adding more
information improves the performance of our model by
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
Applied Soft Computing, vol. 73, pp. 898–913, Dec. [32] D. Wang, Q. Guo, Y. Song, S. Gao, and Y. Li,
2018, doi: 10.1016/j.asoc.2018.09.037. “Application of Multiscale Learning Neural Network
[23] S. Li, G. Liu, X. Tang, J. Lu, and J. Hu, “An Based on CNN in Bearing Fault Diagnosis,” Journal
Ensemble Deep Convolutional Neural Network of Signal Processing Systems, vol. 91, no. 10, pp.
Model with Improved D-S Evidence Fusion for 1205–1217, Oct. 2019, doi: 10.1007/s11265-019-
Bearing Fault Diagnosis,” Sensors (Basel), vol. 17, 01461-w.
no. 8, p. 1729, Jul. 2017, doi: 10.3390/s17081729. [33] James Kuria Kimotho, Walter Sextro, “An approach
[24] J. Pan, Y. Zi, J. Chen, Z. Zhou, and B. Wang, for feature extraction and selection from non-trending
“LiftingNet: A Novel Deep Learning Network With data for machinery prognosis,” Second European
Layerwise Feature Learning From Noisy Mechanical Conference of the Prognostics and Health
Data for Fault Classification,” IEEE Transactions on Management Society 2014, 2014.
Industrial Electronics, vol. 65, no. 6, pp. 4973–4982, [34] W. Caesarendra and T. Tjahjowidodo, “A Review of
Jun. 2018, doi: 10.1109/TIE.2017.2767540. Feature Extraction Methods in Vibration-Based
[25] J. Wang, Z. Mo, H. Zhang, and Q. Miao, “A Deep Condition Monitoring and Its Application for
Learning Method for Bearing Fault Diagnosis Based Degradation Trend Estimation of Low-Speed Slew
on Time-Frequency Image,” IEEE Access, vol. 7, pp. Bearing,” Machines, vol. 5, no. 4, p. 21, Sep. 2017.
42373–42383, 2019, doi: [35] Z. Feng, M. Liang, and F. Chu, “Recent advances in
10.1109/ACCESS.2019.2907131. time–frequency analysis methods for machinery fault
[26] A. Krizhevsky, I. Sutskever, and G. E. Hinton, diagnosis: A review with application examples,”
“ImageNet Classification with Deep Convolutional Mechanical Systems and Signal Processing, vol. 38,
Neural Networks,” in Advances in Neural no. 1, pp. 165–205, Jul. 2013, doi:
Information Processing Systems 25, F. Pereira, C. J. 10.1016/j.ymssp.2013.01.017.
C. Burges, L. Bottou, and K. Q. Weinberger, Eds. [36] Vinod Nair, Geoffrey E. Hinton, “Rectified Linear
Curran Associates, Inc., 2012, pp. 1097–1105. Units Improve Restricted Boltzmann Machines,”
[27] S. S. Roy, S. Dey, and S. Chatterjee, ICML, 2006.
“Autocorrelation Aided Random Forest Classifier- [37] Bridle J.S., “Probabilistic Interpretation of
Based Bearing Fault Detection Framework,” IEEE Feedforward Classification Network Outputs, with
Sensors Journal, vol. 20, no. 18, pp. 10792–10800, Relationships to Statistical Pattern Recognition,”
2020, doi: 10.1109/JSEN.2020.2995109. Soulié F.F., Hérault J. (eds) Neurocomputing. NATO
[28] Christian Lessmeier, James Kuria Kimotho, Detmar ASI Series (Series F: Computer and Systems
Zimmer, Walter Sextro, “Condition Monitoring of Sciences), vol 68. Springer, Berlin, Heidelberg, 1990,
Bearing Damage in Electromechanical Drive Systems doi: https://doi.org/10.1007/978-3-642-76153-9_28.
by Using Motor Current Signals of Electric Motors: [38] A. Zhang, S. Li, Y. Cui, W. Yang, R. Dong and J.Hu,
A Benchmark Data Set for Data-Driven “Limited Data Rolling Bearing Fault Diagnosis With
Classification,” Conference: European Conference of Few-Shot Learning,” in IEEE Access Vol. 7, pp.
the Prognostics and Health Management Society, Jul. 110895-110904, 2019, doi:
2016. 10.1109/ACCESS.2019.2934233
[29] G. Karatzinis, Y. S. Boutalis, and Y. L. Karnavas, [39] C. Lau, W. Heidrich, and R. Mantiuk, “Cluster-based
“Motor Fault Detection and Diagnosis Using Fuzzy color space optimizations,” in 2011 International
Cognitive Networks with Functional Weights,” in Conference on Computer Vision, Nov. 2011, pp.
2018 26th Mediterranean Conference on Control and 1172–1179, doi: 10.1109/ICCV.2011.6126366.
Automation (MED), Jun. 2018, pp. 709–714, doi:
10.1109/MED.2018.8443043.
[30] D. Zhong, W. Guo, and D. He, “An Intelligent Fault
Diagnosis Method based on STFT and Convolutional
Neural Network for Bearings Under Variable
Working Conditions,” in 2019 Prognostics and
System Health Management Conference (PHM-
Qingdao), Oct. 2019, pp. 1–6, doi: 10.1109/PHM-
Qingdao46334.2019.8943026.
[31] V. Pandhare, J. Singh, and J. Lee, “Convolutional
Neural Network Based Rolling-Element Bearing
Fault Diagnosis for Naturally Occurring and
Progressing Defects Using Time-Frequency Domain
Features,” in 2019 Prognostics and System Health
Management Conference (PHM-Paris), May 2019,
pp. 320–326, doi: 10.1109/PHM-Paris.2019.00061.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2021.3056944, IEEE Access
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/