2025年4月9日 周三
基于改进TransUNet的黄土高原梯田作业区域提取方法
基金项目:

国家重点研发计划项目(2022YFD2001300、2023YFD1000800)和陕西省重点研发计划项目(2022ZDLNY03-04)


Extraction Method of Terrace Operation Area in Loess Plateau Based on Improved TransUNet
  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • | |
  • 文章评论
    摘要:

    农田作业区域地图准确构建是实现农机路径规划和导航作业的重要前提。黄土高原梯田田块大小各异、形状复杂多变,并且存在部分凹坑、沟坎和诸多危险作业边界,常用的卫星测点等方法难以准确地提取梯田作业区域,本文以无人机梯田遥感图像为数据基础,提出一种基于多尺度特征提取与融合上采样的改进TransUNet模型。在编码器部分,通过引入金字塔压缩注意力模块(Pyramid squeeze attention, PSA),在通道注意力的基础上增强对不同尺度梯田特征提取和融合的能力,并使用残差结构优化Transformer层;在解码器部分,引入Dual up-sample模块将亚像素卷积层与双线性插值上采样两者融合,提升梯田边界分割精度的同时防止棋盘效应,并在解码器末尾添加通道和空间注意力机制模块(Concurrent spatial and channel squeeze and channel excitation, SCSE),同时对空间和通道维度的信息进行整合增强,有助于图像细节特征逐步恢复。实验结果表明,改进TransUNet模型在直长条形、蜿蜒长条形和不规则形3类典型梯田测试集上平均像素准确率、F1值和平均交并比平均分别达96.0%、96.0%和92.3%,3项指标相较于改进前平均提升1.8个百分点,与代表性的PSPNet、HRNet V2、DeepLab V3+、U-Net模型相比,3项指标平均提升8.3、6.2、5.0、4.2个百分点。在3类单块梯田测试集上,本文模型表现最优,像素交并比平均可达97.0%。本文方法可为黄土高原梯田环境地图构建和丘陵山地农机导航作业提供参考。

    Abstract:

    Accurate map construction of farmland operation area is an important prerequisite for realizing the path planning and navigation operation of farm machinery. The terraced fields on the Loess Plateau have different sizes and complex shapes, and there are some pits, ditches and many dangerous operation boundaries, so it is difficult to accurately extract the terraced operation area by the commonly used satellite point measurement methods. An improved TransUNet model based on multi-scale feature extraction and fusion up-sampling was proposed with the remote sensing images of terraced fields from UAVs as the data base.In the encoder part, the ability of feature extraction and fusion for different scales of terraces was enhanced by introducing the pyramid squeeze attention (PSA) module on top of the channel attention and the Transformer layer was optimized by using the residual structure.In the decoder part, the Dual up-sample module was introduced to integrate the sub-pixel convolutional layer with the bilinear interpolation upsampling to improve the accuracy of the terraced field boundary segmentation while preventing the checker board effect, and the channel and spatial attention mechanism module (concurrent spatial and channel squeeze and channel excitation (SCSE)) was added at the end of the decoder to integrate and enhance the information of spatial and channel dimensions, which helped to recover the detailed features of the image step by step.The experimental results showed that the mean pixel accuracy, F1 value, and mean intersection over union of the improved TransUNet model can reach up to 96.0%, 96.0%, and 92.3% on average on the test set of three typical terraces, namely, straight and long stripes, meandering stripes, and irregular shapes, respectively, which was an average enhancement of 1.8 percentage points compared with the pre-improvement period, and compared with the representative PSPNet, HRNet V2, DeepLab V3+, and U-Net models, the average improvement of the three indicators was 8.3, 6.2, 5.0, and 4.2 percentage points. On the test set of three types of single terraces, the proposed model performed the best, and intersection over union can reach 97.0% on average. The method can provide a reference for the construction of terraced field environment maps in the Loess Plateau and the navigation operation of agricultural machinery in hilly and mountainous areas.

    参考文献
    相似文献
    引证文献
引用本文

杨福增,袁敏鑫,许翔虎,王旺,杨江涛,刘志杰.基于改进TransUNet的黄土高原梯田作业区域提取方法[J].农业机械学报,2024,55(12):278-286. YANG Fuzeng, YUAN Minxin, XU Xianghu, WANG Wang, YANG Jiangtao, LIU Zhijie. Extraction Method of Terrace Operation Area in Loess Plateau Based on Improved TransUNet[J]. Transactions of the Chinese Society for Agricultural Machinery,2024,55(12):278-286.

复制
分享
文章指标
  • 点击次数:40
  • 下载次数: 195
  • HTML阅读次数: 0
  • 引用次数: 0
历史
  • 收稿日期:2024-01-16
  • 在线发布日期: 2024-12-10
文章二维码