Infrastructures 10 00003
Infrastructures 10 00003
Abstract: Developing reliable railway fault detection systems is crucial for ensuring both
safety and operational efficiency. Various artificial intelligence frameworks, especially deep
learning models, have shown significant potential in enhancing fault detection within
railway infrastructure. This study explores the application of deep learning models for
railway fault detection, focusing on both transfer learning architectures and a novel classifi-
cation framework. Transfer learning was utilized with architectures such as ResNet50V2,
Xception, VGG16, MobileNet, and InceptionV3, which were fine-tuned to classify railway
Academic Editor: Maciej Kruszyna track images into defective and non-defective categories. Additionally, the state-of-the-art
Received: 10 November 2024 YOLOv11 model was adapted for the same classification task, leveraging advanced data
Revised: 10 December 2024 augmentation techniques to achieve high accuracy. Among the transfer learning models,
Accepted: 24 December 2024 VGG16 demonstrated the best performance with a test accuracy of 89.18%. However,
Published: 28 December 2024
YOLOv11 surpassed all models, achieving a test accuracy of 92.64% while maintaining
Citation: Rodríguez-Abreo, O.; significantly lower computational demands. These findings underscore the versatility of
Quiroz-Juárez, M.A.; Macías-Socarras,
deep learning models and highlight the potential of YOLOv11 as an efficient and accurate
I.; Rodríguez-Reséndiz, J.; Camacho-
solution for railway fault classification tasks.
Pérez, J.M.; Carcedo-Rodríguez, G.;
Camacho-Pérez, E. Automatic
Detection of Railway Faults Using Keywords: deep learning; transfer learning; YOLOv11; railway fault detection; safety;
Neural Networks: A Comparative maintenance
Study of Transfer Learning Models
and YOLOv11. Infrastructures 2025, 10,
3. [Link]
infrastructures10010003 1. Introduction
Copyright: © 2024 by the authors. Railway infrastructure is essential for global economic and sustainable development,
Licensee MDPI, Basel, Switzerland. providing efficient transport for goods, people, and services while contributing to reduced
This article is an open access article
carbon emissions. A critical component of maintaining this infrastructure is Structural
distributed under the terms and
Health Monitoring (SHM), which evaluates the performance of various subsystems and
conditions of the Creative Commons
Attribution (CC BY) license
enables the early detection of faults to prevent major damage and costly shutdowns.
([Link] Railway systems operate in diverse environments and are exposed to threats posed by the
licenses/by/4.0/). time, location, and weather conditions. Cracks and poor track conditions are among the
tracks using a deep classifier network and 2-D Singular Spectrum Analysis, achieving 85.2%
accuracy under varying illumination conditions. Thendral et al. [17] introduced a computer
vision-based crack detection system using Gabor transforms and neural networks, attaining
94.9% accuracy. Wang et al. [18] proposed the YOLOv5s-VF network for detecting rail
surface defects, featuring a sharpening attention mechanism and adaptive spatial feature
fusion, resulting in 93.5% accuracy and a 114.9 fps detection speed. Jiang et al. [19] leveraged
MobileNet and YOLO-inspired architectures for real-time defect detection, demonstrating
the method’s potential in industrial applications. Dequiang et al. [20] enhanced obstacle
detection with an improved YOLOv4 network, optimizing performance for medium- and
long-distance obstacles with an mAP of 93% on NVIDIA Jetson AGX.
Jassada et al. [21] employed deep learning models, including CNN and RNN, to
detect and evaluate the severity of combined rail defects, achieving 99% accuracy for both
settlement and dipped joint detection, with CNN providing high accuracy for the dipped
joint severity and RNN for the settlement severity. These studies collectively highlight
the advancements and diverse methodologies in railway fault detection, emphasizing
the importance of automation and real-time monitoring to ensure safety and efficiency in
railway systems.
The YOLOv11 model implementation in our study offers an advanced approach for
automated fault detection in railways, especially in tasks that require real-time processing
and high accuracy. In contrast to previous versions such as YOLOv4 or YOLOv5, which
have been adapted for the detection of defects in specific track elements such as rail
fasteners, our model was optimized with data augmentation techniques and lightweight
configurations to address a wide range of structural defects and railway contexts. In
comparison, Brintha et al. [22] describe the FOD-YOLO model that specializes in detecting
objects and defects in fasteners, achieving an accuracy of 98.14% and an mAP of 96.85%,
thanks to the incorporation of modules such as SE-CSPDarkNet53 and SE-EfficientNet,
designed to improve attention to specific features and the processing efficiency. Although
both approaches employ YOLO-based architectures, our implementation of YOLOv11
stands out by integrating an enhanced generalization capability, with a test accuracy of
92.64% and a lightweight framework of 2.6 million parameters, facilitating its deployment
in low-power devices and diverse operating environments.
This study aims to develop a robust and reliable system for railway fault detection,
specifically targeting critical issues like track cracks through advanced image analysis with
transfer learning. By focusing on image-based analysis, this approach provides an efficient
solution for real-time track condition monitoring, enabling the early detection of structural
defects that could lead to derailments or costly shutdowns.
previous stage. In this step, the upper layers of the pre-trained models were unlocked
to allow their weights to be specifically adjusted to the rail dataset, optimizing the final
accuracy. Through fine-tuning, the VGG16, MobileNet, and InceptionV3 models achieved
adequate performance for real-world applications, providing accuracy and reliability in
detecting tracks in poor conditions.
While the current dataset is diverse, the inclusion of railway images captured under
varying climatic conditions is recommended for future research. This enhancement would
Infrastructures 2025, 10, 3 5 of 17
further validate the adaptability and robustness of the proposed models across a wider
range of real-world scenarios.
2.2. Stage 2: Architecture Definition and Model Selection for Transfer Learning
A transfer learning process was performed using the following architectures:
ResNet50V2, Xception, VGG16, MobileNet, and InceptionV3. All of these architectures
were accessed through the Keras Applications module in Python. The models utilized
pre-trained weights from the ImageNet dataset, which are readily available within the
Keras Applications module. These pre-trained weights provided a strong starting point for
the models, enabling them to leverage prior knowledge and adapt efficiently to the task of
railway fault detection.
Table 2 describes each pre-trained model along with the performance metrics on the
ImageNet database. This information was sourced from the Keras documentation [27]. As
explained in [27], the accuracy values represent the models’ performance on the ImageNet
validation dataset, while the time per inference step was calculated as the average over
30 batches, each with a size of 32, repeated 10 times. The hardware specifications used
included an AMD EPYC Processor (with IBPB) with 92 cores, 1.7 TB of RAM, and a Tesla
A100 GPU.
Table 2. Performance, on Image-Net, of pre-trained models considered for the transfer learning.
From these pre-trained models, their fully connected top layers were not used. Instead,
a custom model was built on top of these pre-trained models. This custom model consisted
of three layers: a flatten layer, a fully connected layer, and an output layer. Additionally, an
input layer was implemented before the pre-trained models. Figure 3 illustrates the final
structure of the defined models.
Infrastructures 2025, 10, 3 7 of 17
Figure 3. The diagram illustrates the overall structure of the deep learning model used for railway
track fault detection.
As a result, five models were generated, with the accuracy and loss values for both the
training and validation data recorded for subsequent analysis. Each model was evaluated
using a sample of 231 images.
During the training phase, the pre-trained model embedded within each defined
architecture was blocked, meaning its weights remained unchanged throughout the train-
ing process.
The number of epochs for fine-tuning was set to approximately half of the epochs
used for training in Stage 4 for all models, except MobileNet and VGG16. For these two
models, the number of epochs was increased beyond half because they achieved better
results in Stage 4. More details on this can be found in the Section 3.
The models were fine-tuned using the augmented training dataset (see Table 1), vali-
dated on the same set of 477 images, and tested on a sample of 231 images, similarly to the
setup in Stage 4.
The training and test results were recorded to evaluate whether the fine-tuning process
positively impacted model performance.
3. Results
3.1. Analysis of Stage 3: Initial Training and Overfitting
To analyze the performance of the models, the test results were compared with the
learning metrics (accuracy and loss) obtained during training. Figure 4 summarizes the
learning process of each model from Stage 3. The final values of accuracy and loss recorded
at the last epoch for each model are presented in Table 4.
According to Figure 4, all the models exhibited signs of overfitting, as evidenced by
the gap between the validation and training curves for both accuracy and loss. Table 4
further confirms this, with test accuracy values being significantly lower than the training
accuracy values recorded at the last epoch.
While most models achieved acceptable training accuracy values of over 85% (except
ResNet50V2), the poor performance on validation and test data indicated that these models
were not suitable for real-world applications in their current form.
Infrastructures 2025, 10, 3 10 of 17
Table 4. Training, validation, and testing results per stage for each model.
Models Training Loss Validation Loss Training Accuracy Validation Accuracy Test Accuracy
ResNet50V2 0.8003 4.0292 0.8154 0.6541 59.74
Xception 0.1221 0.6924 0.9618 0.7442 73.16
Stage 3 VGG16 0.1641 0.3751 0.9341 0.8491 77.06
MobileNet 0.2407 0.4387 0.8985 0.8029 79.65
InceptionV3 0.2381 0.9418 0.9038 0.6646 64.07
ResNet50V2 1.4049 1.1115 0.7121 0.7400 69.70
Xception 0.4775 0.6022 0.7615 0.7191 66.23
Stage 4 VGG16 0.2009 0.2764 0.9103 0.9140 85.71
MobileNet 0.3216 0.3953 0.8543 0.8176 80.09
InceptionV3 0.5548 0.5729 0.7050 0.6897 67.97
ResNet50V2 0.4190 1.2387 0.8131 0.7044 58.44
Xception 0.3942 0.4959 0.8153 0.7547 77.49
Stage 5 VGG16 0.0539 0.1747 0.9789 0.9560 89.18
MobileNet 0.2036 0.2903 0.9100 0.8826 85.71
InceptionV3 0.2591 0.3838 0.8856 0.8407 85.71
The results for the Stage 4 models are presented in Table 4 and Figure 5. As shown in
Figure 5, the validation and training curves tended to converge, with no significant gap,
indicating a reduction in overfitting. This conclusion is further supported by the closer
alignment between the test, training, and validation accuracy values in Table 4.
Despite improvements, the training accuracy for ResNet50V2, Xception, and Incep-
tionV3 remained below 85%, which is generally insufficient for real-world applications. On
the other hand, VGG16 and MobileNet showed promising performance, achieving over
90% accuracy during training and validation.
Table 4 summarizes the final loss and accuracy values for each model. The most
notable observation is the consistent improvement in VGG16’s performance, achieving the
highest test accuracy of 89.18% after fine-tuning in Stage 5. MobileNet and InceptionV3
also demonstrated robust performance, with test accuracies of 85.71%, positioning them
as reliable alternatives for deployment. Noteworthy as well is the marked reduction in
overfitting observed in Stage 4 for all models, evidenced by the convergence of training
and validation metrics. These trends highlight the effectiveness of data augmentation and
fine-tuning in enhancing the generalization capabilities of the models.
3.4. Performance of the YOLOv11 Model
The loss curves for the YOLOv11 model demonstrated consistent convergence without
any signs of overfitting, indicating that the training process was well regularized and robust,
as shown in Figure 7. While the accuracy curves did not fully converge, the stable behavior
of the loss curves suggested that the model remained viable for practical applications,
particularly in scenarios requiring reliable and efficient detection.
At the end of the training process, the YOLOv11 model achieved final accuracy values
of 99.87% on the original training dataset, 96.24% on the validation dataset, and 92.64%
on the test dataset. These results surpassed those obtained by all other models in this
study, including VGG16, which had previously shown the highest accuracy. The superior
performance of YOLOv11, especially on the validation and test datasets, underscores its
effectiveness in detecting railway faults with high precision.
In addition to its accuracy, YOLOv11 offered significant computational advantages.
The nano version of the model, YOLOv11n, contains only 2.6 million parameters, which is
considerably fewer than VGG16 (138.4 million parameters) and even less than MobileNetV2
(3.5 million parameters). This reduced parameter count translates to lower computational
Infrastructures 2025, 10, 3 13 of 17
Figure 7. Training and validation loss and accuracy curves for the YOLOv11 model.
Table 5. Comparison of the best transfer learning model (VGG16) and YOLOv11n.
This balance between high accuracy and a reduced computational demand positions
YOLOv11n as the most viable option for real-time railway fault detection, while still
acknowledging the strong performance of VGG16.
4. Discussion
In this study, a binary classification approach was utilized to distinguish between
defective and non-defective railway tracks as a foundational step toward automated fault
detection. This simplification enabled an efficient method for detecting structural defects,
particularly cracks, which represent a significant safety concern. However, this approach
Infrastructures 2025, 10, 3 14 of 17
inherently limits the ability to identify a broader spectrum of defect types, such as fastener
issues and wear patterns, which are critical for comprehensive railway maintenance.
The dataset primarily included images labeled as either cracked or non-defective,
which constrained the capacity of deep learning models such as VGG16 and MobileNet
to generalize beyond these categories. While fine-tuning improved model performance
significantly, achieving an accuracy above 89% in VGG16, the absence of diverse labeled ex-
amples for other defect types impeded the exploration of more granular classification tasks.
Future work will focus on expanding the dataset to incorporate a wider variety of defect
categories and implementing multiclass classification frameworks. These advancements
aim to provide a deeper understanding of defect characteristics and enhance the practical
utility of fault detection systems in real-time scenarios.
The incorporation of YOLOv11 in this study demonstrated its ability to surpass trans-
fer learning models in both accuracy and computational efficiency. By leveraging advanced
data augmentation techniques and a lightweight architecture with only 2.6 million pa-
rameters, YOLOv11 achieved a test accuracy of 92.64%, outperforming VGG16 (89.18%).
This indicates that models specifically designed for efficient image processing, such as
YOLOv11, can provide robust performance for classification tasks while requiring fewer
computational resources, making them more suitable for practical deployment in con-
strained environments.
Compared with other studies on railway fault detection, our findings support the
growing consensus that deep learning models, including transfer learning architectures
and specialized frameworks like YOLOv11, can effectively enhance fault detection accuracy
and efficiency. Previous research, such as the study by Gibert et al. [28], utilized a fully
convolutional DCNN for material classification and semantic segmentation in railway
images. In contrast, our methodology focused on transfer learning with pre-trained models
like ResNet50V2, Xception, and VGG16, as well as adapting YOLOv11 for classification
tasks. While Gibert et al. achieved 93.35% accuracy in material classification, our approach
using YOLOv11 demonstrated that advanced frameworks can achieve similar or better
performance with a lower computational overhead.
Compared to Chandran et al. [29], who combined image processing with CNN and
ResNet-50 models to detect missing railway fasteners, our study compared transfer learn-
ing and YOLOv11 to classify railway tracks into defective and non-defective categories.
YOLOv11, in particular, achieved superior accuracy with a more efficient architecture, high-
lighting its suitability for real-world deployment. Similarly, Yilmazer and Karakose [30]
achieved high accuracy for fastener defect detection using drone-acquired images and
ResNet-based models. Our study, relying on ground-level imagery and simpler setups,
demonstrated comparable accuracy, underscoring the versatility of YOLOv11 for classifica-
tion tasks across diverse imaging methods.
In a similar study, Yilmazer and Karakose [30] utilized an autonomous drone to collect
aerial railway images, applying Faster R-CNN for object detection and ResNet101v2 for
defect classification. Their approach achieved an accuracy of 95% for fastener defects and
98% for rail surface defects, benefiting from drone-acquired stability and video preprocess-
ing. In contrast, our approach, which used ground-level images, demonstrated comparable
accuracy in fault detection, particularly for fastener issues, by optimizing transfer learning
strategies without requiring aerial imaging setups. This highlights that model fine-tuning
can yield effective results across diverse data collection methods.
Studies such as those by Guo et al. [31] and Min et al. [32] focused on YOLO-based ob-
ject detection frameworks optimized for real-time applications. While Guo et al. achieved
a mean average precision of 94.4% with YOLOv4, and Min et al. achieved 96.1% accuracy
with an improved YOLOX, their methodologies prioritized real-time performance. In con-
Infrastructures 2025, 10, 3 15 of 17
trast, our adaptation of YOLOv11 for classification highlights its effectiveness in scenarios
where computational efficiency and accuracy are prioritized over the real-time speed.
Overall, these comparisons emphasize the complementary strengths of transfer learn-
ing models and frameworks like YOLOv11. While transfer learning provides a practical and
adaptable approach for fault detection across multiple defect types, YOLOv11’s lightweight
architecture and high accuracy offer an efficient alternative for real-world deployment in
constrained environments. Future work should explore integrating multiclass classifica-
tion capabilities into YOLO-based frameworks and transfer learning models to address a
broader range of railway defects with enhanced precision.
5. Conclusions
This research has demonstrated the effectiveness of using deep learning architectures,
including both transfer learning models and specialized frameworks like YOLOv11, in the
development of an automated and reliable system for railway defect detection, a critical
area for safety and operational efficiency. A methodology involving a combination of multi-
phase training and a controlled fine-tuning process was applied to the selected models,
enabling the identification of optimal architectures for the successful classification of rail de-
fects while minimizing the training time. Notable transfer learning models include VGG16,
MobileNet, and InceptionV3. Data augmentation and a fine-tuning process were essential
to improve model generalization and mitigate overfitting. The high accuracy achieved
by VGG16 underscores its feasibility for real-world applications in railway monitoring
systems, while MobileNet and InceptionV3 also demonstrated strong performance.
The inclusion of YOLOv11 further highlighted its potential as a state-of-the-art frame-
work for railway fault classification. The model achieved a test accuracy of 92.64%, surpass-
ing the best results obtained by VGG16 and other deep learning models used in this research.
Additionally, the nano version of YOLOv11, with only 2.6 million parameters, proved to
be the most computationally efficient architecture, requiring significantly fewer resources
while maintaining superior performance. These results position YOLOv11 as the most
viable option for real-world applications, particularly in scenarios where computational
resources are limited, such as in edge devices or embedded systems.
Despite some limitations, such as the constrained dataset focused on the binary clas-
sification of defects, our proposed approach confirms the potential of deep learning for
monitoring railway infrastructure. Expanding the dataset to include a broader range of
defect categories and implementing multiclass classification frameworks could further
enhance the system’s practical utility. Future work should also focus on refining YOLOv11’s
training processes, exploring real-time implementation, and optimizing deployment on
embedded systems to improve the scalability and efficiency. These advancements would
contribute significantly to ensuring railway safety and operational reliability while reducing
maintenance costs through predictive fault detection.
Author Contributions: Conceptualization, O.R.-A., E.C.-P. and G.C.-R.; methodology, O.R.-A., J.M.C.-P.
and I.M.-S.; software, O.R.-A., E.C.-P. and G.C.-R.; validation, O.R.-A., M.A.Q.-J., E.C.-P. and I.M.-S.;
formal analysis, O.R.-A., E.C.-P. and I.M.-S.; investigation, O.R.-A., E.C.-P. and I.M.-S.; resources,
O.R.-A., M.A.Q.-J., E.C.-P. and G.C.-R.; data curation, J.M.C.-P., E.C.-P. and G.C.-R.; writing—original
draft preparation, O.R.-A., E.C.-P. and I.M.-S.; writing—review and editing, O.R.-A., E.C.-P., M.A.Q.-J.
and I.M.-S.; visualization, O.R.-A., E.C.-P. and I.M.-S.; supervision, M.A.Q.-J., J.R.-R., O.R.-A., E.C.-P.
and I.M.-S.; project administration, O.R.-A., E.C.-P. and I.M.-S.; funding acquisition, O.R.-A., J.R.-R.,
M.A.Q.-J., J.M.C.-P., E.C.-P. and I.M.-S. All authors have read and agreed to the published version of
the manuscript.
Infrastructures 2025, 10, 3 16 of 17
Funding: This work was funded by the Consejo Nacional de Humanidades, Ciencias y Tecnologías
(CONAHCyT) of Mexico under Grant CF-2023-I-1496 and the Dirección General de Asuntos del
Personal Académico (DGAPA), National Autonomous University of Mexico (UNAM), under Project
UNAMPAPIIT TA101023 and the postdoctoral fellowship DGAPA-UNAM for O.R.-A.
Conflicts of Interest: The authors declare that they have no known competing financial interests or
personal relationships that could have appeared to influence the work reported in this paper.
References
1. Rampriya, R.; Jenefa, A.; Prathiba, S.B.; Julus, L.J.; Selvaraj, A.K.; Rodrigues, J.J. Fault Detection and Semantic Segmentation on
Railway Track Using Deep Fusion Model. IEEE Access 2024, 12, 136183–136201. [CrossRef]
2. Zheng, D.; Li, L.; Zheng, S.; Chai, X.; Zhao, S.; Tong, Q.; Wang, J.; Guo, L. A Defect Detection Method for Rail Surface and
Fasteners Based on Deep Convolutional Neural Network. Comput. Intell. Neurosci. 2021, 2021, 2565500. [CrossRef] [PubMed]
3. Shafique, R.; Siddiqui, H.U.R.; Rustam, F.; Ullah, S.; Siddique, M.A.; Lee, E.; Ashraf, I.; Dudley, S. A novel approach to railway
track faults detection using acoustic analysis. Sensors 2021, 21, 6221. [CrossRef]
4. Rifat, A.; Pandao, P.; Babu, B.S. Solar powered fault detection system for railway tracks. Eur. J. Electr. Eng. Comput. Sci. 2022,
6, 39–43. [CrossRef]
5. Gálvez, A.; Diez-Olivan, A.; Seneviratne, D.; Galar, D. Fault detection and RUL estimation for railway HVAC systems using a
hybrid model-based approach. Sustainability 2021, 13, 6828. [CrossRef]
6. Liu, Q.; Liang, T.; Dinavahi, V. Real-time hierarchical neural network based fault detection and isolation for high-speed railway
system under hybrid AC/DC grid. IEEE Trans. Power Deliv. 2020, 35, 2853–2864. [CrossRef]
7. Ghosh, C.; Verma, A.; Verma, P. Real time fault detection in railway tracks using Fast Fourier Transformation and Discrete
Wavelet Transformation. Int. J. Inf. Technol. 2022, 14, 31–40. [CrossRef]
8. Andrusca, M.; Adam, M.; Dragomir, A.; Lunca, E.; Seeram, R.; Postolache, O. Condition monitoring system and faults detection
for impedance bonds from railway infrastructure. Appl. Sci. 2020, 10, 6167. [CrossRef]
9. Siddiqui, H.U.R.; Saleem, A.A.; Raza, M.A.; Zafar, K.; Munir, K.; Dudley, S. IoT based railway track faults detection and
localization using acoustic analysis. IEEE Access 2022, 10, 106520–106533. [CrossRef]
10. Yu, Q.; Liu, A.; Yang, X.; Diao, W. An Improved Lightweight Deep Learning Model and Implementation for Track Fastener Defect
Detection with Unmanned Aerial Vehicles. Electronics 2024, 13, 1781. [CrossRef]
11. Tong, L.; Jia, L.; Geng, Y.; Liu, K.; Qin, Y.; Wang, Z. Anchor-adaptive railway track detection from unmanned aerial vehicle
images. Comput.-Aided Civ. Infrastruct. Eng. 2023, 38, 2666–2684. [CrossRef]
12. Moriello, R.S.L.; Caputo, E.; Gargiulo, F.; de Alteriis, G.; Donvito, A.; Bitonto, P.; Alfano, A. An Internet of Things-based Solution
for Monitoring Freight Train Carriages. In Proceedings of the 2024 IEEE International Workshop on Metrology for Industry 4.0 &
IoT (MetroInd4.0 & IoT), Firenze, Italy, 29–31 May 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 94–99.
13. Oh, K.; Yoo, M.; Jin, N.; Ko, J.; Seo, J.; Joo, H.; Ko, M. A review of deep learning applications for railway safety. Appl. Sci. 2022,
12, 10572. [CrossRef]
14. Wang, T.; Yang, F.; Tsui, K.L. Real-time detection of railway track component via one-stage deep learning networks. Sensors 2020,
20, 4325. [CrossRef] [PubMed]
15. Shim, J.; Koo, J.; Park, Y.; Kim, J. Anomaly detection method in railway using signal processing and deep learning. Appl. Sci.
2022, 12, 12901. [CrossRef]
16. Kapoor, R.; Goel, R.; Sharma, A. An intelligent railway surveillance framework based on recognition of object and railway track
using deep learning. Multimed. Tools Appl. 2022, 81, 21083–21109. [CrossRef] [PubMed]
17. Thendral, R.; Ranjeeth, A. Computer vision system for railway track crack detection using deep learning neural network. In
Proceedings of the 2021 3rd International Conference on Signal Processing and Communication (ICPSC), Coimbatore, India,
13–14 May 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 193–196.
18. Wang, M.; Li, K.; Zhu, X.; Zhao, Y. Detection of surface defects on railway tracks based on deep learning. IEEE Access 2022,
10, 126451–126465. [CrossRef]
19. Feng, J.H.; Yuan, H.; Hu, Y.Q.; Lin, J.; Liu, S.W.; Luo, X. Research on deep learning method for rail surface defect detection. IET
Electr. Syst. Transp. 2020, 10, 436–442. [CrossRef]
20. He, D.; Zou, Z.; Chen, Y.; Liu, B.; Yao, X.; Shan, S. Obstacle detection of rail transit based on deep learning. Measurement 2021,
176, 109241. [CrossRef]
21. Sresakoolchai, J.; Kaewunruen, S. Detection and severity evaluation of combined rail defects using deep learning. Vibration 2021,
4, 341–356. [CrossRef]
Infrastructures 2025, 10, 3 17 of 17
22. Brintha, K.; Joseph Jawhar, S. FOD-YOLO NET: Fasteners fault and object detection in railway tracks using deep yolo network 1.
J. Intell. Fuzzy Syst. 2024, 46, 1–15. [CrossRef]
23. Adnan, A.; Hossain, S.; Shihab, R.; Ibne, S. Railway Track Fault Detection, Dataset2 (Fastener). Available online: [Link]
com/datasets/ashikadnan/railway-track-fault-detection-dataset2fastener (accessed on 4 November 2024).
24. Adnan, A.; Hossain, S.; Shihab, R.; Ibne, S. Railway Track Fault Detection. Available online: [Link]
salmaneunus/railway-track-fault-detection (accessed on 4 November 2024).
25. Ibne, S.; Hossain, S.; RIDWAN, A. Railway-Rail-Fault_Detection. Available online: [Link]
salmaneunus/newdatasetridwanrailjanuary/discussion?sort=undefined (accessed on 4 November 2024).
26. Eunus, S.I.; Hossain, S.; Ridwan, A.E.M.; Adnan, A.; Islam, M.S.; Karim, D.Z.; Alam, G.R.; Uddin, J. ECARRNet: An Efficient
LSTM-Based Ensembled Deep Neural Network Architecture for Railway Fault Detection. AI 2024, 5, 482–503. [CrossRef]
27. Team, K. Keras Documentation: Keras Applications—[Link]. Available online: [Link] (accessed
on 30 October 2024).
28. Gibert, V.; Patel, M.; Chellappa, R. Semantic Segmentation of Railway Track Images with Deep Convolutional Neural Networks.
Available online: [Link] (accessed on
1 November 2024).
29. Chandran, P.; Asber, J.; Thiery, F.; Odelius, J.; Rantatalo, M. An investigation of railway fastener detection using image processing
and augmented deep learning. Sustainability 2021, 13, 12051. [CrossRef]
30. Yilmazer, M.; Karakose, M. Fastener and rail surface defects detection with deep learning techniques. Int. J. Adv. Intell. Inform.
2024, 10, 253–264. [CrossRef]
31. Guo, F.; Qian, Y.; Shi, Y. Real-time railroad track components inspection based on the improved YOLOv4 framework. Autom.
Constr. 2021, 125, 103596. [CrossRef]
32. Min, Y.; Guo, J.; Yang, K. Research on real-time detection algorithm of rail-surface defects based on improved YOLOX. J. Appl.
Sci. Eng. 2022, 26, 799–810.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.