Separable Multi-scale Large Kernel Convolutional Remote Sensing Denoising Network

Gui Luo; Xiangguo Sun

doi:10.54097/d92zgq91

Authors

Gui Luo
Xiangguo Sun

DOI:

https://doi.org/10.54097/d92zgq91

Keywords:

Image Denoising, Multi-scale, Frequency Separation, Large Kernel Convolution, Remote Sensing Images

Abstract

Abstract: The abstract of the study stated that remote sensing images contain abundant details of land objects and terrain, and the denoising process should strive to preserve these critical pieces of information. However, traditional CNN methods performed poorly when dealing with high-resolution, multi-scale, and complex scenes, as they struggled to model the long-range dependencies within images. Methods based on Transformer improved this issue through the self-attention mechanism; however, their high computational cost limited their application in resource-constrained environments. To address this, a Multi-Scale Large Kernel Detail Enhancement Network was proposed, aiming to effectively retain the detailed information in remote sensing images. By utilizing pooling to separate high and low-frequency information, the approach adopted separable multi-scale large kernel convolutions to capture extensive spatial information, enhancing high-frequency features while reducing computational complexity. These innovative techniques effectively expanded the receptive field, improving the denoising effect of remote sensing images. Currently, compared with the best results from other methods, MLKNet achieves an average improvement of approximately 3.1 dB in grayscale remote sensing image denoising across three different noise levels, and an average improvement of about 1.17 dB in color remote sensing image denoising under the same conditions.

Downloads

Download data is not yet available.

References

[1] Wang, H. Y., Yang, H. T., Wang, J. Y., et al. (2024). A review of research on remote sensing image denoising methods [J]. Journal of Computer Engineering & Applications, 60(15)

[2] Li Y, Li X, Dai Y, et al. LSKNet: A Foundation Lightweight Backbone for Remote Sensing [J]. arxiv preprint arxiv: 2403. 11735, 2024.

[3] Dabov K, Foi A, Katkovnik V, et al. Image restoration by sparse 3D transform-domain collaborative filtering[C]//Image processing: algorithms and systems VI. SPIE, 2008, 6812: 62-73.

[4] Aharon M, Elad M, Bruckstein A. K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J]. IEEE Transactions on signal processing, 2006, 54(11): 4311-4322.

[5] Gu S, Zhang L, Zuo W, et al. Weighted nuclear norm minimization with application to image denoising[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 2862-2869.

[6] Zhang K, Zuo W, Chen Y, et al. Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising [J]. IEEE transactions on image processing, 2017, 26(7): 3142-3155.

[7] Jin, H. Z., Zhang, X. Y., Ye, Z. W., et al. (2022). Image denoising model based on approximate U-shaped network structure. Journal of Computer Applications, 42(8), 2571-2577.

[8] Zhang K, Zuo W, Zhang L. FFDNet: Toward a fast and flexible solution for CNN-based image denoising [J]. IEEE Transactions on Image Processing, 2018, 27(9): 4608-4622.

[9] Tian C, Xu Y, Fei L, et al. Enhanced CNN for image denoising [J]. CAAI Transactions on Intelligence Technology, 2019, 4(1): 17-23.

[10] Tian C, Xu Y, Li Z, et al. Attention-guided CNN for image denoising [J]. Neural Networks, 2020, 124: 117-129.

[11] Han L, Zhao Y, Lv H, et al. Remote sensing image denoising based on deep and shallow feature fusion and attention mechanism [J]. Remote Sensing, 2022, 14(5): 1243.

[12] Alexey D. An image is worth 16x16 words: Transformers for image recognition at scale [J]. arxiv preprint arxiv: 2010.11929, 2020.

[13] Chen H, Wang Y, Guo T, et al. Pre-trained image processing transformer[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021: 12299-12310.

[14] Liang J, Cao J, Sun G, et al. Swinir: Image restoration using swin transformer[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2021: 1833-1844.

[15] Wang Z, Cun X, Bao J, et al. Uformer: A general u-shaped transformer for image restoration[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022: 17683-17693.

[16] Tu Z, Talebi H, Zhang H, et al. Maxim: Multi-axis mlp for image processing[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022: 5769-5780.

[17] Zamir S W, Arora A, Khan S, et al. Restormer: Efficient transformer for high-resolution image restoration[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022: 5728-5739.

[18] Zhang J, Zhang Y, Gu J, et al. Xformer: Hybrid x-shaped transformer for image denoising [J]. arxiv preprint arxiv: 2303. 06440, 2023.

[19] Ding, Y. W., Shi, H. B., Li, J., et al. (2024). Image denoising network based on local and global feature decoupling. Journal of Computer Applications, 44(8), 2571-2579.

[20] Lau K W, Po L M, Rehman Y A U. Large separable kernel attention: Rethinking the large kernel attention design in cnn [J]. Expert Systems with Applications, 2024, 236: 121352.

[21] Hao F, Wu J, Lu H, et al. Large coordinate kernel attention network for lightweight image super-resolution [J]. arxiv preprint arxiv: 2405. 09353, 2024.

[22] Guo M H, Lu C Z, Liu Z N, et al. Visual attention network [J]. Computational Visual Media, 2023, 9(4): 733-752.

[23] Chen L, Chu X, Zhang X, et al. Simple baselines for image restoration[C]//European conference on computer vision. Cham: Springer Nature Switzerland, 2022: 17-33.

[24] Yang K, Hu T, Dai K, et al. CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task [J]. arxiv preprint arxiv: 2404. 14132, 2024.

[25] Ouyang D, He S, Zhang G, et al. Efficient multi-scale attention module with cross-spatial learning[C]//ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023: 1-5.

[26] Hou Q, Zhou D, Feng J. Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021: 13713-13722.

[27] Cheng G, Han J, Lu X. Remote sensing image scene classification: Benchmark and state of the art [J]. Proceedings of the IEEE, 2017, 105(10): 1865-1883.

[28] Li, C., Li, X. T., Li, H. X., et al. (2024). A remote sensing image denoising method fused with multi-scale features. Electronics Optics & Control, 31(6), 74-80.

[29] Yang Y, Newsam S. Bag-of-visual-words and spatial extensions for land-use classification[C]//Proceedings of the 18th SIGSPATIAL international conference on advances in geographic information systems. 2010: 270-279.

Separable Multi-scale Large Kernel Convolutional Remote Sensing Denoising Network

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

Cover

Indexing & Abstracting