Abstract:Canola is one of the most important oilseed crops worldwide, and accurate prediction of its yield and total production is of great significance for food and edible oil security assessment, agricultural production management, and bioenergy potential analysis. To address the limited sample size in canola yield and production prediction, as well as the lack of systematic evaluation of production modeling and multi-source information synergy, county-level canola yield and total production data were collected, and multi-source remote sensing and meteorological big data were integrated to develop a collaborative prediction framework for canola yield and total production. On this basis, six tree-based machine learning models were employed to systematically evaluate the performance differences among models and data- source combinations in predicting canola yield and production. The results demonstrated that the joint use of remote sensing and meteorological data consistently outperformed single-source inputs for both yield and production prediction, with this advantage showing strong consistency and robustness across different tree- based model structures. For yield prediction, CatBoost achieved the best performance ( R2 = 0. 679, RMSE = 0. 269 t/ hm2, MAE = 0. 194 t/ hm2), whereas for production prediction, Cubist performed best (R2 = 0. 952, RMSE = 9. 42 × 103 t, MAE = 5. 00 × 103 t). The spatial patterns of the predicted yield and production were generally consistent with the observations. Overall, the results indicated that the integration of multi-source remote sensing and meteorological information with tree-based machine learning methods can effectively improve the accuracy of regional-scale canola yield and production prediction. The research result can provide reliable technical support for canola yield monitoring, precision agricultural management, and harvest scheduling decisions for agricultural machinery.