Abstract:Aiming to address the challenges of extracting water bodies in large-scale environments and clarifying their long-term evolution patterns, the Landsat images of Heihe River Basin (1986—2024) were processed by using Google Earth Engine (GEE), approximately 78000 water and non-water samples were collected, and an annual sample dataset was built. Based on this dataset, a random forest (RF)-based water extraction method was developed by integrating the multi band water index (MBWI), enhanced water index (EWI), and modified normalized difference water index (MNDWI) with spectral bands, both individually and in combination. Through systematic screening, the optimal fusion index was selected, enabling the accurate extraction of surface water bodies across 39 temporal phases. The Mann-Kendall (M-K) test was applied to detect interannual trends in surface water area, while principal component analysis (PCA) and sensitivity analysis were used to identify the dominant driving factors influencing water body evolution. The results demonstrated that the RF method incorporating all three water indices (MBWI, EWI, and MNDWI) achieved the best extraction performance for Landsat images of the Heihe River Basin, with average overall accuracy (OA) of 96.16% and average Kappa coefficient (KC) of 0.9128. The M-K test indicated a fluctuating downward trend in surface water area from 1986 to 2024. Annual precipitation, population, and annual evapotranspiration were identified as the main driving factors for the evolution of surface water bodies in the Heihe River Basin. The research result can provide a theoretical foundation for the rapid and accurate extraction of surface water bodies at the basin scale and support future hydrological and environmental applications.