Distant Water Surface Garbage Recognition Method in Complex Scenarios
DOI:
https://doi.org/10.54097/w29n3w46Keywords:
Remote garbage detection on water surfaces, Dynamic feature adjustment, Scale adaptationAbstract
To improve the insufficient identification rate of remote garbage targets by water surface cleaning equipment, a model based on RT-DETR for remote garbage target detection called FRT-DETR was developed. First, CSPCAA-ADown was proposed as a modified backbone network, which better captures feature information of garbage targets beyond 30 meters at different scales while reducing computational burden, thus minimizing the impact of insufficient feature information caused by distance. Second, a CCGAF module was introduced to dynamically adjust the importance of features from different layers. Finally, an adaptive wavelet pooling module, WaveletPool, was employed for upsampling and downsampling feature maps, thereby reducing the impact of target scale variations and achieving good recognition rates even under conditions of blurry distant targets and complex backgrounds. The improved model achieves an mAP0.5 of 91.3% on the FWSG dataset with a reduced parameter count of 11.8M, providing a high-precision, high-efficiency, and low-computational-cost detection solution for water environment monitoring.
Downloads
References
[1] Ren S, He K, Girshick R, et al. Towards real-time object detection with region proposal networks, Adv [J]. Neural Inf. Process, 2015, 28.
[2] Redmon J. You only look once: Unified, real-time object detection [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.
[3] Zeng Z C, Xu Y, Wang J Y, et al. A lightweight water surface object detection algorithm based on SOE-YOLO [J]. Journal of Graphics, 2024, 45(04): 736-744.
[4] Li C, Zhou A, Yao A. Omni-dimensional dynamic convolution [J]. arXiv preprint arXiv:2209.07947, 2022.
[5] Du A Q, Guo F L, Zhang Z L, et al. Surface Garbage Detection Method Based on Improved YOLOv5s [J]. Journal of Wuhan Polytechnic University, 2023, 42(05): 98-105+113.
[6] Zhou H P, Li Y H, Dang A P. Small Target Detection of Floating Garbage on Water Surface Based on YOLOv7 with Edge Enhancement [J]. Journal of Langfang Normal University (Natural Science Edition), 2024, 24(02): 45-51.
[7] Zhu L, Wang X, Ke Z, et al. Biformer: Vision transformer with bi-level routing attention [C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2023: 10323-10333.
[8] Vaswani A. Attention is all you need [J]. Advances in Neural Information Processing Systems, 2017.
[9] Carion N, Massa F, Synnaeve G, et al. End-to-end object detection with transformers [C] //European conference on computer vision, 2020: 213-229.
[10] ZHU X, SU W, LU L, et al. Deformable DETR: Deformable Transformers for End-to-End Object Detection [J]. arXiv preprint arXiv:2010.04159, 2020.
[11] Zhao Y, Lv W, Xu S, et al. Detrs beat yolos on real-time object detection [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2024: 16965-16974.
[12] Williams T, Li R. Wavelet pooling for convolutional neural networks [C]//International conference on learning representations. 2018.
[13] Lin T Y, Dollár P, Girshick R, et al. Feature pyramid networks for object detection [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2117-2125.
[14] Wang C Y, Yeh I H, Mark Liao H Y. Yolov9: Learning what you want to learn using programmable gradient information [C]//European Conference on Computer Vision. Springer, Cham, 2025: 1-21.
[15] Huang Y, Jia W, He X, et al. CAA: Channelized axial attention for semantic segmentation [J]. arXiv preprint arXiv: 2101.07434, 2021.
[16] Chen Z, He Z, Lu Z M. DEA-Net: Single image dehazing based on detail-enhanced convolution and content-guided attention [J]. IEEE Transactions on Image Processing, 2024.
[17] Hu S, Gao F, Zhou X, et al. Hybrid Convolutional and Attention Network for Hyperspectral Image Denoising [J]. IEEE Geoscience and Remote Sensing Letters, 2024.
[18] He K, Zhang X, Ren S, et al. Deep residual learning for image recognition [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 770-778.
[19] Wang Z, Li C, Xu H, et al. Mamba YOLO: SSMs-Based YOLO For Object Detection [J]. arXiv preprint arXiv: 2406.05835, 2024.
[20] Hu J L, Zhou M, Shen F. Improved RTDETR Detection Algorithm for Small Targets in UAV Applications [J]. Computer Engineering and Applications, 2024, 60(20): 198-206.
[21] Chen L, Zhu J. Water surface garbage detection based on lightweight YOLOv5 [J]. Scientific Reports, 2024, 14(1): 6133.
[22] Redmon J. You only look once: Unified, real-time object detection [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.
[23] Junzhe Z, Fuqiang J, Yupeng C, et al. A water surface garbage recognition method based on transfer learning and image enhancement [J]. Results in Engineering, 2023, 19: 101340.
[24] Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580-58.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Journal of Computer Science and Artificial Intelligence

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.








