Abstract:The entire process of selective harvesting of broccoli requires precise maturity recognition, harvesting posture analysis, and rapid crop row navigation. However, at present, there is a lack of a visual algorithm suitable for the full-process operation. To address these issues, an efficient harvesting visual system, EH-YOLO, was proposed for the selective harvesting robot of broccoli, aiming to achieve accurate maturity recognition and autonomous harvesting in complex agricultural environments. The efficient multi-scale attention module was incorporated into the YOLO v8n backbone network, and a soft spatial pyramid pooling-fast module was designed. Additionally, a lightweight neck structure based on grouped shuffle convolution was employed, and the loss function was improved to enhance feature extraction and fusion capabilities, reduce computational complexity, and improve model performance in complex environments. Furthermore, by analyzing the crop??s agronomic features, the plant location was determined based on the center point of the broccoli head. The harvesting process was adjusted according to the quadrant in which the broccoli was located, and clustering algorithms, combined with the least squares method, were used to extract the crop row navigation lines. The model size was 3. 3 MB, and the detection speed reached 61. 45 f/ s. The accuracy of maturity recognition for mature broccoli was 92. 84% . Additionally, compared with the baseline model, EH-YOLO reduced the computational parameters by 38. 68% , decreased the model size by 46. 77% , and increased the average accuracy by 4. 01 percentage points. In field trials, the system successfully completed the full-process selective harvesting task, with a motion and positioning success rate of 96. 77% and a selective harvesting success rate of 87. 37% . Therefore, EH-YOLO was a practical and efficient visual system for the selective harvesting robot of broccoli, with strong applicability for commercial broccoli fields.