基于REDCAP法的土壤有机碳高光谱反演模型代表性校正集的构建

Construction of Representative Calibration Set Based on the REDCAP Method in Hyperspectral Inversion Model for Soil Organic Carbon

  • 摘要:
    目的 土壤属性在多元环境变量的影响下,通常呈现显著的空间分异性。现有校正集选择方法鲜少考虑土壤属性自身的空间分异特征,易导致校正集样本全局代表性不足,从而影响模型的预测精度和鲁棒性。本文拟在现有经典校正集选择方法的基础上,进一步考虑空间分异模式对样本代表性的影响,提出一套改进的顾及空间分异模式的校正集选择策略。
    方法 首先采用动态约束的区域聚类与分割算法(Regionalization with dynamically constrained agglomerative clustering and partitioning,REDCAP)挖掘土壤属性的空间分异模式,得到地理连续的区域划分,且在划分的子区域内部具有相似的土壤属性分布特征,子区域之间具有显著差异的土壤属性分布特征;然后,于各分异子区域内,采用经典校正集选择方法,包括浓度梯度(Rank)法、Kennard-Stone(KS)法以及SPXY法,选取具有局部代表性的样本;最后合并各子区域内的代表性样本,构建具有全局地理空间信息代表性及土壤属性代表性的校正集,并将方法记为REDCAP-Rank法、REDCAP-KS法和REDCAP-SPXY法。为验证本文提出方法的有效性,在德国北部区域开展应用,并与传统校正集选择方法建模结果进行对比分析。其中土壤有机碳预测模型采用偏最小二乘模型(PLSR),支持向量机模型(SVM)和随机森林模型(RF)。
    结果 相比传统校正集选择方法,REDCAP-Rank法、REDCAP-KS法和REDCAP-SPXY法选取的校正集建模精度整体得到提升,其中,REDCAP-KS校正集选择方法相较于KS方法,预测模型结果精度均有提升, R_\textp^\text2 最高提升0.11,RPD增长百分比最高达14.47%; REDCAP-SPXY校正集选择方法相较SPXY方法,93.3%的预测模型结果精度得到提升, R_\textp^\text2 最高提升0.09,RPD增长百分比最高达13.04%。KS、REDCAP-KS、SPXY、REDCAP-SPXY、Rank以及REDCAP-Rank六种方法中REDCAP-KS的建模效果最优, R_\textp^\text2 达到0.71,RPD达到1.80。
    结论 基于REDCAP法的校正集选择策略能够选取具有地理空间信息代表性的样本,结合REDCAP-KS方法划分的样本集构建PLSR模型,能够较好的满足高光谱反演土壤有机碳预测需求。

     

    Abstract:
    Objective Soil properties often exhibit remarkable spatial heterogeneity under the influence of multiple environmental variables. Existing calibration set selection methods seldom take into account the spatial heterogeneity of soil properties, which can lead to inadequate global representativeness of the calibration set samples, thereby affecting the predictive accuracy and robustness of the model. This study aimed to further consider the influence of spatial heterogeneity patterns on sample representativeness based on the existing classical calibration set selection methods, to propose an improved calibration set selection strategy that takes spatial heterogeneity patterns into account.
    Method Firstly, the regionalization with dynamically constrained agglomerative clustering and partitioning (REDCAP) method was employed to mine the spatial heterogeneity patterns of soil attributes, and the geographical continuous regional division was obtained, which result in the soil attribute distribution characteristics similar within each subzone ant substantial differences among subzones. Subsequently, classical calibration set selection methods, including concentration gradient (Rank) method, Kennard-Stone (KS) method, and SPXY method, were employed within each subzone to select samples that exhibit local representativeness. Finally, representative samples from each subzone are combined to construct a calibration set that exhibits both global geographical spatial information representativeness and soil attribute representativeness. These methods are referred to as the REDCAP-Rank method, the REDCAP-KS method, and the REDCAP-SPXY method. To validate the effectiveness of the proposed method, applications were conducted in the northern region of Germany, and comparative analyses were performed with traditional calibration set selection methods. Among them, the prediction model of soil organic carbon (SOC) by using Partial Least Square Regression (PLSR), Support Vector Machine (SVM) model and Random Forest (RF).
    Result The results showed that the modeling accuracy of correction sets selected by REDCAP-Rank method, REDCAP-KS method and REDCAP-SPXY method was improved overall compared with the traditional correction set selection methods. Among them, compared with KS method, REDCAP-KS correction set selection method has improved the accuracy of all prediction model results, the highest increase of R_\textp^\text2 was 0.11, and the RPD growth rate was up to 14.47%. Compared with SPXY method, REDCAP-SPXY correction set selection method could improve the accuracy of 93.3% prediction, the highest increase of R_\textp^\text2 was 0.09, and the RPD growth rate was up to 13.04%. Among the six methods of KS, REDCAP-KS, SPXY, REDCAP-SPXY, Rank and REDCAP-Rank, the modeling effect of REDCAP-KS was the best, R_\textp^\text2 reaching 0.71 and RPD was 1.80.
    Conclusion The correction set selection strategy based on REDCAP method can select samples with representative geospatial information, and construct PLSR model combined with sample sets divided by REDCAP-KS method, which can better meet the demand of hyperspectral inversion of SOC prediction.

     

/

返回文章
返回