首页|基于改进YOLOX 的自然环境中火龙果检测方法

基于改进YOLOX 的自然环境中火龙果检测方法

etection Method for Dragon Fruit in Natural Environment Based on Improved YOLOX

来源：

中文摘要

英文摘要

自然环境下果实的精准检测是火龙果采摘机器人执行采摘作业的先决条件。为提高自然环境下果实识别的精确性、鲁棒性和检测效率，本研究对YOLOX（You Only Look Once X）网络进行改进，提出了一种含有注意力模块的目标检测方法。为便于在嵌入式设备上部署，本方法以YOLOX-Nano网络为基准，将卷积注意力模块（Convolutional Block Attention Module，CBAM）添加到YOLOX-Nano的主干特征提取网络中，通过为主干网络提取到不同尺度的特征层分配权重系数来学习不同通道间特征的相关性，加强网络深层信息的传递，降低自然环境背景下对火龙果识别的干扰。对该方法进行性能评估和对比试验，经过训练后，该火龙果目标检测网络在测试集的AP0.5值为98.9%，AP0.5:0.95的值为72.4%。在相同试验条件下对比其它 YOLO网络模型，该方法平均检测精度分别超越YOLOv3、YOLOv4-Tiny和YOLOv5-S模型26.2%、9.8%和 7.9%。最后对不同分辨率的火龙果果园自然环境下采集的视频进行实时测试。试验结果表明，本研究提出的改进YOLOX-Nano目标检测方法，每帧平均检测时间为21.72 ms，F1值为0.99，模型大小仅3.76 MB，检测速度、检测精度和模型大小满足自然环境下火龙果采摘的技术要求。

ragon fruit detection in natural environment is the prerequisite for fruit harvesting robots to perform harvesting. In order to improve the harvesting efficiency, by improving YOLOX (You Only Look Once X) network, a target detection network with an attention module was proposed in this research. As the benchmark, YOLOX-Nano network was chose to facilitate deployment on embedded devices, and the convolutional block attention module (CBAM) was added to the backbone feature extraction network of YOLOX-Nano, which improved the robustness of the model to dragon fruit target detection to a certain extent. The correlation of features between different channels was learned by weight allocation coefficients of features of different scales, which were extracted for the backbone network. Moreover, the transmission of deep information of network structure was strengthened, which aimed at reducing the interference of dragon fruit recognition in the natural environment as well as improving the accuracy and speed of detection significantly. The performance evaluation and comparison test of the method were carried out. The results showed that, after training, the dragon fruit target detection network got an AP0.5 value of 98.9% in the test set, an AP0.5:0.95 value of 72.4% and F1 score was 0.99. Compared with other YOLO network models under the same experimental conditions, on the one hand, the improved YOLOX-Nano network model proposed in this research was more lightweight, on the other hand, the detection accuracy of this method surpassed that of YOLOv3, YOLOv4 and YOLOv5 respectively. The average detection accuracy of the improved YOLOX-Nano target detection network was the highest, reaching 98.9%, 26.2% higher than YOLOv3, 9.8% points higher than YOLOv4-Tiny, and 7.9% points higher than YOLOv5-S. Finally, realtime tests were performed on videos with different input resolutions. The improved YOLOX-Nano target detection network proposed in this research had an average detection time of 21.72 ms for a single image. In terms of the size of the network model was only 3.76 MB, which was convenient for deployment on embedded devices. In conclusion, not only did the improved YOLOX- Nano target detection network model accurately detect dragon fruit under different lighting and occlusion conditions, but the detection speed and detection accuracy showed in this research could able to meet the requirements of dragon fruit harvesting in natural environment requirements at the same time, which could provide some guidance for the design of the dragon fruit harvesting robot.

作者：梁英凯、陈桥、肖明玮、商枫楠、罗陈迪、周学成

作者单位：

DOI：10.12074/202302.00142V1

学科分类：农业科学技术发展自动化技术、自动化技术设备计算技术、计算机技术

中文关键词：水果采摘自然环境火龙果目标检测YOLOX注意力机制深度学习

推荐引用：梁英凯,陈桥,肖明玮,商枫楠,罗陈迪,周学成.基于改进YOLOX 的自然环境中火龙果检测方法[EB/OL].(2023-02-17)[2025-08-03].https://chinaxiv.org/abs/202302.00142.点此复制

基于改进YOLOX 的自然环境中火龙果检测方法

etection Method for Dragon Fruit in Natural Environment Based on Improved YOLOX

评论