Research on Road Damage Recognition Based on VanillaNet Neural Network

Authors

  • Han Wang
  • Min Li
  • Xuyang Gu
  • Xiang Liu
  • Boyu Yang
  • Kai Chen

DOI:

https://doi.org/10.54097/j2ex8q47

Keywords:

VanillaNet Neural Network, Nonlinear Activation Functions, Deep Training Techniques

Abstract

This paper presents a highly efficient and accurate model for identifying road conditions, specifically distinguishing between normal roads and potholes in images. Given the challenges posed by potholes, such as their prevalence in shaded areas, cluttered backgrounds, difficulty in annotation, and diverse shapes, we initiate the process by normalizing the dataset. We select the VanillaNet neural network as our foundational training architecture due to its elegant and simplistic design, which maintains exceptional performance in computer vision tasks. VanillaNet commences with several layers incorporating nonlinear activation functions, gradually eliminating these layers as training progresses, facilitating easy integration while preserving inference speed. This network surpasses contemporary models in both efficiency and accuracy, making it an ideal choice for our analysis. The quantitative results demonstrate promising outcomes, laying a solid foundation for future research endeavors. Subsequently, we train the developed VanillaNet model, employing deep training techniques to enhance its performance. The model is comprehensively evaluated based on accuracy, speed, and generalization capabilities to ensure it meets our requirements. In the object detection experiments, the model's performance is assessed using four key metrics.

Downloads

Download data is not yet available.

References

[1] A Study on Enhancing Chip Detection Efficiency Using the Lightweight Van-YOLOv8 Network. Comput. Mater. Contin. 79, 531–547 (2024).

[2] Tao, Z., He, Y., Lin, S., Yi, T. & Li, M. SnakeNet: An adaptive network for small object and complex background for insulator surface defect detection. Comput. Electr. Eng. 117, 109259 (2024).

[3] Zhou, K. et al. An improved YOLOv10 algorithm for automated damage detection of glass curtain-walls in high-rise buildings. J. Build. Eng. 101, 111812 (2025).

[4] Xu, S., Zhang, M., Chen, J. & Zhong, Y. YOLO-HyperVision: A vision transformer backbone-based enhancement of YOLOv5 for detection of dynamic traffic information. Egypt. Inform. J. 27, 100523 (2024).

[5] Liu, J. & Ren, Q. Fixed-time synchronization of coupled neural networks under Hölder continuous nonlinear activation function. Eur. J. Control 81, 101165 (2025).

Downloads

Published

27-04-2025

Issue

Section

Articles