基于深度神经网络与热红外成像高光谱数据的黑土土壤全磷含量预测研究

Prediction of Total Phosphorus Content Using Deep Neural Network and Thermal Infrared Imaging Hyperspectral Data in Black Soil

  • 摘要: 土壤磷素含量是评价土壤养分的重要指标之一。利用热红外发射率数据对土壤全磷含量进行反演的研究较少且多使用常规线性回归方法,本文利用东北海伦地区采集的热红外航空成像光谱仪TASI(Thermal Airborne Hyperspectral Imager)数据,通过机器学习方法探究黑土土壤发射率与全磷含量关系,选出最优模型对研究区土壤全磷含量进行预测研究。结果表明:在8 ~ 11.5 μm范围内,土壤热红外发射率值随着全磷含量的增加而增加;除原始光谱10.792 μm这一波段外,发射率值及其数学变换与全磷含量相关系数均在0.5以下,相关性较弱;在DNN(Deep Neural Networks)训练集和测试集中,从模型精度和PSO(Particle Swarm Optimiazation)算法优化时间来看,弹性传播训练算法(RP)表现最佳,决定系数R2分别为0.51、0.7,均方根误差RMSE分别为0.0443、0.0301;激活函数对本次构建的网络精度影响非常有限,激活函数为Tansig和ReLU的深度神经网络模型训练集精度基本与偏最小二乘及逐步回归模型一致,测试集精度有所提高但稳定性较为欠缺;研究区土壤全磷含量整体较高,水田和旱田含量均大于0.8 g kg−1,城镇及建筑群密集程度与土壤全磷含量呈负相关,与偏最小二乘模型对土壤全磷含量预测结果相比,神经网络模型对研究区土壤全磷含量小于0.6 g kg−1和0.6 ~ 0.8 g kg−1两区间做了更多划分,将更多人为活动密集区附近像元划分到该含量区间内,从而使得预测含量更加符合真实情况分布;总体来看,全磷含量最高值集中分布于研究区西部、西北及西南部,但在东部及其北部区域则呈无规律性分散分布,中部地区及其余地区含量值大多为0.8 ~ 1.0 g kg−1。综上,合适调参的深度神经网络模型在反演土壤元素类问题中的表现与偏最小二乘及逐步回归等方法相比更加具有发展的潜力,在样本数据量足够的前提下,深度神经网络能够得到充分训练从而使得预测结果更加精确、稳定。

     

    Abstract: Soil phosphorus content is one of the important indicators to evaluate soil nutrients. There are few researches on the inversion of soil total phosphorus content using thermal infrared emissivity data, and the conventional linear regression methods are mostly used to establish the model. In this paper, the TASI (Thermal Airborne Hyperspectral Imager) data were collected in the Hailun region of northeast China which were used to explore the relationship between soil emissivity and total phosphorus content in black soil by machine learning, with this result, the best model was selected to predict the total phosphorus content in the soil. The results show that in the range of 8 ~ 11.5 μm, the thermal infrared emissivity of soil increases with the increase of total phosphorus content; Except for the original spectrum 10.792 μm, the correlation coefficients between emissivity and its mathematical transformation and total phosphorus content are all less than 0.5, which shows that the correlation is weak; In the training set and testing set of DNN (Deep Neural Networks), RP performs best in terms of model accuracy and optimization time of PSO(Particle Swarm Optimiazation) algorithm, the determination coefficients R2 are respectively 0.51 and 0.7, the root mean square error RMSE are 0.0443 and 0.0301, respectively; In the further research, it was found that the change of activation function has very limited influence on the accuracy of the network, specifically, the training set accuracy of the DNN with activation function Tansig and ReLU is basically consistent with the partial least squares and stepwise regression model, and the accuracy of the test set is improved but the stability is relatively worse. The total phosphorus content of the soil is high, which is greater than 0.8 g kg−1 both in the paddy field and dry field, it can be conclude that the density of towns is negatively correlated with it. Compared with the prediction results of the partial least square model, the neural network model has made more divisions into two intervals of the content: < 0.6 g kg−1 and 0.6-0.8 g kg−1, more pixels near the active region are divided into the range, which makes the prediction more consistent with the real distribution. In general, the highest total phosphorus content is concentrated in the west, northwest and southwest of the study area, but it is scattered and irregularly distributed in the east and north. The content is mostly 0.8-1.0 g kg−1 in the central and other regions.. In summary, the deep neural network model with appropriate parameters has more potential than methods such as partial least square and stepwise regression in the inversion of soil element problems, it can be fully trained to make the prediction results more accurate and stable with sufficient sample data.

     

/

返回文章
返回