Visible Watermark Detection Based on an Improved YOLOv8 Model

Authors

  • Ying Xiao

DOI:

https://doi.org/10.54097/svd9b569

Keywords:

Visible Watermarks, Deep Learning, Object Detection

Abstract

With the widespread dissemination of digital images on the internet, issues related to image copyright protection and content security have become increasingly prominent, making visible watermarks a common method of copyright marking. To address the frequent inaccuracies in visible watermark detection under complex background conditions, this paper proposes an improved deep learning-based visible watermark detection model that achieves high-precision detection and localization of watermark regions. structural optimization is conducted based on the YOLOv8 object detection framework. Considering the characteristics of visible watermarks, such as large scale variations, uneven transparency, and strong coupling with background textures, an attention mechanism module is introduced into the backbone network to enhance feature representation of critical regions. In addition, a watermark feature enhancement module is designed to strengthen multi-scale feature fusion. Furthermore, the improved YOLOv8 model is fused with the RT-DETR model, combining the local feature extraction capability of convolutional neural networks with the global modeling ability of the Transformer architecture to improve detection accuracy and localization robustness. Experimental results demonstrate that the proposed method achieves an mAP50 of 98.9% and an mAP50:95 of 97.6% on the LVW dataset, outperforming multiple mainstream detection models and effectively balancing detection accuracy with efficiency.

Downloads

Download data is not yet available.

References

[1] Zhang, J. R. (2015). The study of digital image watermark algorithm based on transformation domain. Laser Journal, 36(6), 126–129.

[2] Braudaway, G. W., Magerlein, K. A., & Mintzer, F. C. (1996). Protecting publicly available images with a visible image watermark. In Electronic imaging: Science & technology (pp. 126–133). International Society for Optics and Photonics.

[3] Meng, J., & Chang, S. F. (1998). Embedding visible video watermark in the compressed domain. In 1998 International Conference on Image Processing (ICIP 1998) (Vol. 1, pp. 474–477). IEEE.

[4] Kankanhalli, M. S., & Ramakrishnan, K. (1999). Adaptive visible watermarking of images. In 1999 IEEE International Conference on Multimedia Computing and Systems (pp. 568–573). IEEE.

[5] Ren, S. Q., He, K. M., Girshick, R., & Sun, J. (2016). Faster R-CNN: Towards real-time object detection with region proposal networks. arXiv:1506.01497v3.

[6] Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. arXiv:1506.02640v5.

[7] Liu, W., Anguelov, D., Erhan, D., Szegedy, C., & Reed, S. (2016). SSD: Single shot multibox detector. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 21–37). IEEE.

[8] Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., & Tian, Q. (2019). Centernet: Keypoint triplets for object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 6569–6578).

[9] Tian, Z., Shen, C., Chen, H., & He, T. (2019). Fcos: Fully convolutional one-stage object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 9627–9636).

[10] Kukartsev, V. V., Ageev, R. A., Borodulin, A. S., & Boyko, A. A. (2024). Deep learning for object detection in images: Development and evaluation of the YOLOv8 model using Ultralytics and Roboflow libraries. In Proceedings of the 13th Computer Science Online Conference (pp. 629–637). Springer.

[11] Cheng, D., Li, X., Li, W. H., & Wang, H. (2018). Large-scale visible watermark detection and removal with deep convolutional networks. In Chinese Conference on Pattern Recognition and Computer Vision (pp. 31–43). Springer.

[12] Everingham, M., Van Gool, L., Williams, C. K. I., Winn, J., & Zisserman, A. (2010). The Pascal Visual Object Classes (VOC) challenge. International Journal of Computer Vision, 88, 303–338. https://doi.org/10.1007/s11263-009-0275-4

[13] Dekel, T., Rubinstein, M., Liu, C., & Freeman, W. T. (2017). On the effectiveness of visible watermarks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 6864–6872). IEEE. https://doi.org/10.1109/CVPR.2017.726

[14] Zhao, Y., et al. (2024). DETRs beat YOLOs on real-time object detection. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 16965–16974). IEEE. https://doi.org/10.1109/CVPR52733.2024.01605

Downloads

Published

29-05-2026

Issue

Section

Articles