Innovation in Industrial UAV Applications Based on a Standard Modular Aerial Survey AI Platform

Authors

  • Hong Yang
  • Yanan Zhang
  • Chunxiu Su

DOI:

https://doi.org/10.54097/3cjdwb22

Keywords:

Mechanical engineering, Manufacturing technology, Electrical automation

Abstract

To break through the technical bottlenecks of traditional UAVs in scene reproduction accuracy, situational awareness efficiency, and real-time data processing, and to promote the development of UAV technology towards intelligence, collaboration, and high precision, this paper systematically develops and elaborates on three core technology systems: real-scene multi-modal AI fusion modeling technology, global situational intelligent perception and collaboration technology, and multi-modal data real-time fusion technology based on sparse representation. In terms of technical performance, the real-scene modeling accuracy reaches the centimeter level, and the global perception response speed is increased to the millisecond level. By building a hardware verification platform and a software simulation system, application tests have been completed in scenarios such as smart cities, power inspection, and geological exploration. The results show that this technology system can achieve an equipment defect recognition rate of 97.8% and improve the inspection path planning efficiency by 40%, providing key technical support for the efficient operation of UAVs in complex environments. At present, the relevant technologies have been piloted in a number of power enterprises and municipal units, with potential for large-scale promotion, and are of great significance for promoting the upgrading of the UAV industry and the development of the digital economy.

Downloads

Download data is not yet available.

References

[1] Khullar V, Kansal I, Bhattacharjee B S, et al. Multiple model visual feature embedding and selection method for an efficient pest classification supporting precision agriculture [J]. Scientific Reports, 2025, 15(1):31791-31791.

[2] Kansal I, Khullar V, Sharma P, et al. Correction: Multiple model visual feature embedding and selection method for an efficient ocular disease classification [J]. Scientific reports, 2025, 15(1):17643.

[3] Yang F, Li J, Zhang C, et al. Exploring the impact of built environment on traffic risk perception of school-aged children: A case study of old residential neighborhoods in Changsha City, Hunan Province, China [J]. Accident Analysis and Prevention, 2025, 212107920-107920.

[4] Liu X, Xu X, Abuduwayiti A, et al. Identification and Evaluation of Representative Places in Cities Using Multisource Data: Focusing on Human Perception [J]. Sustainability, 2024, 16(19):8299-8299.

[5] Hong X, Han Y, Wang B. Coupling dynamics of epidemic and multi-source information: The roles of social reinforcement and risk perception [J]. Information Sciences, 2025, 714122203-122203.

[6] Zhao X, Huang H, Lin G, et al. Exploring temporal and spatial patterns and nonlinear driving mechanism of park perceptions: A multi-source big data study [J]. Sustainable Cities and Society, 2025, 119106083-106083.

[7] Shan D, Zhu J, Luo Y. SANTM: A Sparse Access Neural Turing Machine with local multi-head self-attention for long-term memorization [J]. Applied Soft Computing, 2025, 181113448-113448.

[8] Zheng H, Fang G, Duan S, et al. A novel method for noise reduction of blade tip timing signals based on sparse representation and dictionary learning [J]. Journal of Low Frequency Noise, Vibration and Active Control, 2024, 43(1):437-454.

[9] Zhiyu S, Jiatong B, Jingwei L, et al. Haptic Recognition of Texture Surfaces Using Semi-Supervised Feature Learning Based on Sparse Representation [J]. Cognitive Computation, 2023, 15(5):1656-1671.

[10] Mi T, Takács D, Liu H, et al. Capturing the true bounding boxes: vehicle kinematic data extraction using unmanned aerial vehicles [J]. Journal of Intelligent Transportation Systems, 2025, 29(5):566-578.

[11] Technology - Cybernetics; Findings from Guangdong University of Technology in the Area of Cybernetics Reported (Distributed Sliding-mode Tracking Control of Second-order Nonlinear Multiagent Systems: an Event-triggered Approach) [J]. Journal of Engineering, 2020, 549-.

[12] Technology - Cybernetics; New Cybernetics Findings from Northeastern University Described (Distributed Fault Estimation for a Class of Nonlinear Multiagent Systems) [J]. Journal of Engineering, 2020, 14321-.

[13] Cheng Saixian. A Method for Applying Cloud Computing in Distributed Control of Non-Homo Sapiens Systems [J]. Aerodynamic Missile., 2018, (09):84-91. DOI:10.16338/j.issn.1009-1319.20180056.

[14] Wu Zhongbing, Shao Weijian, Hao Guofeng, et al. Implementation of Wireless Update Scheme for Homo sapiens-Free Machine Control Program [J/OL]. Electronics & Packaging, 1-8 [2025-09-09]. https://doi.org/10.16257/j.cnki.1681-1070.2026.0024.

[15] Shen Jiahui, Yang Lekai, Hai Shunhang, et al. Hierarchical Risk-Aware Path Planning for Unmanned Vehicles in Satellite-Denied Environments [J/OL]. Journal of Transportation Science and Engineering, 1-9 [2025-09-09]. https://doi.org/10.16544/j.cnki.cn43-1494/u.20250805001.

[16] Sun Yiming, Zhao Lithocarpus Glaberjia, Wang Shuo, et al. A Survey on Low Visual Datasets for Unmanned Homo Sapiens Machines [J]. Journal of Data Acquisition and Processing, 2025, 40(02):274-302. DOI:10.16337/j.1004-9037.2025.02.002.

[17] Wang Honglei, Shi Jingxuan, Zhang Yifan. Research on Homo Sapiens Body Pose Recognition Algorithm Based on LSTM-KAN [J/OL]. Mining Safety and Environmental Protection,1-13 [2025-09-09]. https://link.cnki.net/urlid/50.1062.TD.20250613.1432.003.

[18] Yang Changzheng. The Influence of Multi-Source Perception Association of Emergencies on the Reversal of Public Opinion Orientation from the Perspective of Block Modal Coupling [J]. Journal of Intelligence., 2024, 43(05):152-158+183.

[19] Xu Fangyi, Zheng Jiabin. AI-Empowered Audiovisual Communication: A Multi-Source Content Perception-Based Support System for Audiovisual Work Creation [J/OL]. Information Communication Research., 1-13 [2025-09-09]. https://doi.org/10.16196/j.cnki.issn.1673-4793.20250613.002.

[20] Qu Yingda. Research on Anchor-Free Mechanism Based SAR Image Target Detection Technology [D]. Beijing University of Posts and Telecommunications., 2023. DOI:10.26969/d.cnki.gbydu.2023.000217.

[21] SAR Image Target Perception, Recognition, and Interpretation Software System. Shaanxi Province, Xidian University, 2009-01-01.

[22] Prunus salicina Sihai, Liu Zhenmin, He Guangshun, et al. Remote Sensing Image Interpretation Application System in Tianjin Coastal Waters [J]. Hydrographic Surveying and Charting., 2005, (04):13-15.

[23] Wang Wei, Wu Zhigang. Research on Multi-source Information Fusion and Control Technology for Bionic Aircraft [J]. Navigation and Control, 2025, 24(Z1):1-12.

[24] Cui Ruixue, Wang Xuzhi, Wan Wanggen, et al. Multi-modal Feature Enhancement Based Visual Question Answering for Scene Text [J]. Industrial Control Computer., 2025, 38(06):12-14.

[25] Prunus salicina Xinran. Research on the Improvement of Multimodal Feature Fusion Algorithms Under Visual-Language Tasks [D]. Dalian Jiaotong University, 2025. DOI:10.26990/d.cnki.gsltc.2025.000676.

Downloads

Published

27-09-2025

Issue

Section

Articles