Abstract:Lightweight detection, tracking, and counting passion fruit are crucial technologies for achieving intelligent harvesting in smart agriculture. In real orchard environments, factors such as overlapping occlusion with leaves and branches, along with lighting variations, often result in missed detections, false detections and repeated counting. To address these challenges, a lightweight counting framework was proposed based on YOLO 11n?CLL and BoT?SORT. In object detection, YOLO 11n?CLL was built upon YOLO 11n as its baseline model. Firstly, a lightweight context guided block was introduced to enhance contextual perception capability. Nextly, the large separable kernel attention mechanism was incorporated to improve multi?scale feature modeling. Finally, the lightweight shared detail?enhanced convolutional detection head was employed to effectively reconstruct the texture features of occluded fruits. For object tracking, the BoT?SORT tracker was adopted, which incorporated a camera?motion?compensation module to mitigate camera?shake interference and improve tracking accuracy. In fruit counting task, a region?based counting method was adopted to achieve accurate counting of passion fruits. Results showed that YOLO 11n?CLL achieved mAP@0.5 of 87.0% with a compact 5.0 MB model size. The BoT?SORT tracker achieved a HOTA of 62.4% and a MOTA of 68.4%. The region?based counting method yielded an average counting accuracy of 92.8%, outperforming the line?based counting and ID?based counting method by 8.8 percentage points and 29.9 percentage points. The results demonstrated that the proposed framework enabled detection, tracking, and counting of passion fruits and provided technical support for intelligent harvesting in passion fruit orchards.