Arid Zone Research ›› 2026, Vol. 43 ›› Issue (1): 144-155.doi: 10.13866/j.azr.2026.01.13

• Plant Ecology • Previous Articles     Next Articles

Application of machine learning algorithms for estimation of aboveground biomass of Tamarix nebkha

DONG Yaqing1,2,3(), SONG Shaoteng1,2,3, GUO Xiaoqian1,2,3, ZHAO Yuanjie1,2,3()   

  1. 1. School of Geographical Sciences, Hebei Normal University, Shijiazhuang 050024, Hebei, China
    2. Hebei Technology Innovation Center for Remote Sensing Identification of Environmental Change, Shijiazhuang 050024, Hebei, China
    3. Hebei Key Laboratory of Environmental Change and Ecological Construction, Shijiazhuang 050024, Hebei, China
  • Received:2025-09-08 Revised:2025-10-30 Online:2026-01-15 Published:2026-01-28
  • Contact: ZHAO Yuanjie E-mail:13832361024@163.com;ecoenvir@163.com

Abstract:

Tamarix chinensis, a dominant shrub species in arid ecosystems, is characterized by high stress resistance and adaptive growth strategies. It is crucial for stabilizing sand dunes, reducing wind erosion, and facilitating ecological restoration. Consequently, the aboveground biomass (AGB) of Tamarix nebkha is an essential indicator of vegetation status and desertification control in arid regions. The research object of this study was Tamarix nebkha in the lower reaches of the Tarim River. In total, 92 spectral, vegetation, and texture indices were extracted as feature variables from Landsat 8 images. For subsequent variable selection, stepwise regression, least absolute shrinkage and selection operator (LASSO), and extreme gradient boosting (XGBoost) algorithms were employed. Random forest (RF), support vector regression (SVR), and backpropagation neural network (BPNN) models were then constructed to estimate the AGB of Tamarix nebkha. The adaptability and estimation accuracy of different feature selection methods and models were compared, and the applicability of multivariate algorithms for the estimation of Tamarix nebkha AGB were explored. The results indicated that: (1) The variable sets selected by the stepwise regression, LASSO, and XGBoost algorithms were significantly correlated with the AGB of Tamarix nebkha, and the low multicollinearity among variables (VIF<5) confirmed the effectiveness of the algorithms. (2) The models constructed using the LASSO and XGBoost algorithms were significantly more accurate than those constructed using the stepwise method. Among the tested models, the RF model developed using the LASSO algorithm had the best performance (R2=0.73, RMSE=447.63 g·m-2). The incorporation of multivariate combinations substantially enhanced the predictive capability of the models. (3) The LASSO-based RF model estimated the mean AGB of Tamarix nebkha in the lower Tarim River region to be 1733.63 g·m-2 and the total biomass to be approximately 1.71×108 kg. The developed model exhibited high reliability and strong applicability across the regional scale. The findings of this study provide a scientific basis for optimizing the selection of remote sensing inversion methods and enhancing the accuracy of AGB estimation in desert shrub ecosystems.

Key words: Tamarix nebkha, aboveground biomass, LASSO algorithm, machine learning, remote sensing estimation