孟 珊, 李新国, 焦 黎. 基于机器学习算法的湖滨绿洲土壤电导率高光谱估算模型[J]. 土壤通报, 2023, 54(2): 286 − 294. DOI: 10.19336/j.cnki.trtb.2022011003
引用本文: 孟 珊, 李新国, 焦 黎. 基于机器学习算法的湖滨绿洲土壤电导率高光谱估算模型[J]. 土壤通报, 2023, 54(2): 286 − 294. DOI: 10.19336/j.cnki.trtb.2022011003
MENG Shan, LI Xin-guo, JIAO Li. Hyperspectral Estimation Model of Soil Conductivity in the Lakeside Oasis Based on Machine Learning Algorithm[J]. Chinese Journal of Soil Science, 2023, 54(2): 286 − 294. DOI: 10.19336/j.cnki.trtb.2022011003
Citation: MENG Shan, LI Xin-guo, JIAO Li. Hyperspectral Estimation Model of Soil Conductivity in the Lakeside Oasis Based on Machine Learning Algorithm[J]. Chinese Journal of Soil Science, 2023, 54(2): 286 − 294. DOI: 10.19336/j.cnki.trtb.2022011003

基于机器学习算法的湖滨绿洲土壤电导率高光谱估算模型

Hyperspectral Estimation Model of Soil Conductivity in the Lakeside Oasis Based on Machine Learning Algorithm

  • 摘要:
      目的  为湖滨绿洲土壤高光谱估算土壤电导率值提供方法支持,实现区域土壤盐分快速估测。
      方法  利用实测的土壤电导率值与土壤高光谱数据联合分析,采用竞争自适应重加权采样(CARS)、连续投影算法(SPA)、遗传算法(GA)筛选土壤电导率的特征波段,并基于全波段及特征波段构建BP神经网络(BPNN)、支持向量机(SVM)、极限学习机(ELM)三种机器学习算法模型,引入偏最小二乘模型(PLSR)进行对照,比较其模型精度。
      结果  研究区土壤电导率值变化范围0.02~17.22 mS cm−1,平均值为2.61 mS cm−1,变异系数为134.87%,呈现强变异性;CARS、SPA、GA算法筛选的特征波段将建模输入量分别压缩至全波段数量的0.87%、1.68%、0.70%,减少建模输入量,提升建模速率,变量方法的选择CARS > SPA > GA;三种机器学习算法模型均优于PLSR模型,决定系数(R2)平均增加20.57%,相对分析误差(RPD)平均增加17.84%,土壤电导率高光谱估算模型以CARS-SVM最优,训练集与验证集R2分别为0.76和0.75,RMSE分别为1.79 和1.68 mS cm−1,RPD分别为2.04和2.00。土层深度20 ~ 30 cm的土壤电导率高光谱估算模型精度最高,训练集与验证集R2分别为0.83和0.84,RMSE分别1.37和1.77 mS cm−1,RPD分别为2.41和2.50。
      结论  基于CARS-SVM的土壤电导率高光谱估算模型精度高,估算能力最优,可以为湖滨绿洲土壤电导率估算提供科学参考。

     

    Abstract:
      Objective  The paper aims to provide method for estimating the soil conductivity of lakeside oasis soil by hyperspectral, so as to realize the rapid estimation of regional soil salinity.
      Method  Combined analysis of soil conductivity values and soil hyperspectral data, competitive adaptive reweighted sampling (CARS), successive projection algorithm (SPA) and genetic algorithm (GA) were used to screen the characteristic bands of soil conductivity. Based on the full band and characteristic band, three machine learning algorithm models, inlcuding BP neural network (BPNN), support vector machine (SVM) and extreme learning machine (ELM), were constructed, and the partial least squares model (PLSR) was introduced for comparing their accuracy.
      Result  The soil conductivity ranged from 0.20 to 17.22 mS cm−1 in the study area, with an average value of 2.61 mS cm−1 and a coefficient of variation of 134.87%, showing strong variability; The characteristic bands screened by the CARS, SPA, and GA algorithms compress the modeling input to 0.87%, 1.68%, and 0.70% of the total number of bands, respectively, which reduced the amount of modeling input and increased the modeling speed. The choice of variable method CARS > SPA > GA; The three machine learning algorithm models were all better than PLSR model. The coefficient of determination (R2) increased by 20.57% and the relative percent deviation (RPD) increased by 17.84% on average. The CARS-SVM was the best model for soil conductivity hyperspectral estimation, with R2 of 0.76 and 0.75 for training set and validation set, respectively, RMSE of 1.79 mS cm−1 and 1.68 mS cm−1, and RPD of 2.04 and 2.00, respectively; The soil conductivity hyperspectral estimation model with a soil depth of 20 ~ 30 cm has the highest accuracy, with R2 of 0.83 and 0.84 for training set and validation set, respectively, RMSE of 1.37 mS cm−1 and 1.77 mS cm−1, and RPD of 2.41 and 2.50, respectively.
      Conclusion  The soil conductivity hyperspectral estimation model based on CARS-SVM has high accuracy and optimal estimation ability, which can provide a scientific reference for the estimation of soil conductivity in lakeside oasis.

     

/

返回文章
返回