基于深度强化学习的茶芽采摘三维路径规划方法
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家自然科学区域创新联合基金重点项目(U23A20175)和国家自然科学基金项目(32472009)


Deep Reinforcement Learning-based Three-dimensional Path Planning Algorithm for Tea Bud Picking
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    路径规划是决定自动化茶芽采摘机器人采摘效率的核心决策环节。针对自动茶芽采摘路径规划中三维空间建模困难、实时响应要求高以及对动态节点规模适应性差等问题,将茶芽采摘任务建模为三维旅行商问题,并提出了一种基于深度强化学习的茶芽采摘三维路径规划方法。本研究采用零填充策略构建了参数化的三维茶芽数据集,设计了动态掩码机制以确保路径的有效性,引入了长短期记忆模块来动态融合历史路径信息,开发了约束多头自注意力机制以提高决策准确性,并基于演员-评论家(Actor-Critic)框架稳定模型训练。对比试验表明,所提出的算法与精确的Concorde算法之间的相对最优性差距控制在2%以内,而在处理80个茶叶芽的情况下计算时间仅为Concorde算法的1.7%。该算法优于经典的启发式算法和其他基于深度强化学习的算法,在实际茶园场景中表现稳定,为采茶机器人路径规划提供了高效方案,可迁移至其他精准采摘任务,也为三维路径规划提供了理论基础。

    Abstract:

    Picking path planning is the core decision-making link that determines the harvesting efficiency of automated tea bud picking robots. Aiming at the problems of difficult three-dimensional spatial modeling, high real-time response requirements and poor adaptability to dynamic node scales in automatic tea bud picking path planning, the tea bud picking task was modelled as a three-dimensional traveling salesman problem (3D-TSP), and a deep reinforcement learning-based 3D path planning algorithm was proposed for tea bud picking. A parametric 3D tea bud dataset with a zero-padding strategy was constructed, a dynamic masking mechanism was designed to ensure path validity, a long short-term memory (LSTM) module was introduced to dynamically fuse historical path information, a constrained multi-head self-attention mechanism was developed to improve decision-making accuracy, and the stability model training was done based on the Actor-Critic framework. Comparative experiments showed that the relative optimality gap between the proposed algorithm and the exact Concorde algorithm was controlled within 2% , and its computation time in the scenario with 80 tea buds was only 1.7% of that of Concorde. The algorithm outperformed classical heuristic algorithms and other deep reinforcement learning-based algorithms, maintained stable performance in real tea garden scenarios, and provided an efficient solution for tea picking robot path planning.

    参考文献
    相似文献
    引证文献
引用本文

桂志勇,梅相贞,陈建能,俞焘杰,贾江鸣,贺磊盈,武传宇.基于深度强化学习的茶芽采摘三维路径规划方法[J].农业机械学报,2026,57(13):68-78. Gui Zhiyong, Mei Xiangzhen, Chen Jianneng, Yu Taojie, Jia Jiangming, He Leiying, Wu Chuanyu. Deep Reinforcement Learning-based Three-dimensional Path Planning Algorithm for Tea Bud Picking[J]. Transactions of the Chinese Society for Agricultural Machinery,2026,57(13):68-78.

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2026-03-09
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-07-01
  • 出版日期:
文章二维码