首页 | 官方网站   微博 | 高级检索  
     

基于LightGBM算法的能见度预测模型
引用本文:余东昌,赵文芳,聂凯,张舸.基于LightGBM算法的能见度预测模型[J].计算机应用,2021,41(4):1035-1041.
作者姓名:余东昌  赵文芳  聂凯  张舸
作者单位:1. 北京城市气象研究院, 北京 100089;2. 北京市气象信息中心, 北京 100089;3. 北京市气象探测中心, 北京 100176;4. 信图智行(北京)科技有限公司, 北京 100022
摘    要:为了提高能见度预报的准确率,尤其是低能见度预报的准确率,提出一种基于集成学习随机森林和LightGBM的能见度预测模型。首先,以数值模式系统的气象预报数据为基础,结合地面气象观测数据、PM2.5浓度观测数据,利用随机森林算法构建特征向量;其次,针对不同时间跨度的缺失数据,设计了3种缺失值处理方法对缺失值进行替代,生成用于训练和测试的连续性较好的数据样本集;最后,建立基于LightGBM的能见度预测模型,并用网络搜索法对其进行参数优化。把所提模型与支持向量机(SVM)、多元线性回归(MLR)、人工神经网络(ANN)在性能上进行对比。实验结果表明,对于不同的等级的能见度,应用LightGBM的能见度预测模型获得预兆得分(TS)均较高,而对于<2 km的低能见度,该模型对各观测站点的能见度预测值与各观测站点的能见度实况值的平均相关系数为0.75,平均均方误差为6.49。可见基于LightGBM的预测模型能有效提高能见度预测精度。

关 键 词:能见度预测  集成学习  随机森林算法  LightGBM算法  
收稿时间:2020-10-13
修稿时间:2020-11-01

Visibility forecast model based on LightGBM algorithm
YU Dongchang,ZHAO Wenfang,NIE Kai,ZHANG Ge.Visibility forecast model based on LightGBM algorithm[J].journal of Computer Applications,2021,41(4):1035-1041.
Authors:YU Dongchang  ZHAO Wenfang  NIE Kai  ZHANG Ge
Affiliation:1. Beijing Institute of Urban Meteorology, Beijing 100089, China;2. Beijing Meteorological Information Center, Beijing 100089, China;3. Beijing Meteorological Observation Center, Beijing 100176, China;4. XinTuZhiXing (Beijing) Technology Corporation Limited, Beijing 100022, China
Abstract:In order to improve the accuracy of visibility forecast, especially the accuracy of low-visibility forecast, an ensemble learning model based on random forest and LightGBM for visibility forecast was proposed. Firstly, based on the meteorological forecast data of the numerical modeling system, combined with meteorological observation data and PM2.5 concentration observation data, the random forest method was used to construct the feature vectors. Secondly, for the missing data with different time spans, three missing value processing methods were designed to replace the missing values, and then the data sample set with good continuity for training and testing was created. Finally, a visibility forecast model based on LightGBM was established, and its parameters were optimized by using the network search method. The proposed model was compared to Support Vector Machine(SVM), Multiple Linear Regression(MLR) and Artificial Neural Network(ANN) on performance. Experimental results show that for different levels of visibility, the proposed visibility forecast model based on LightGBM algorithm obtains the highest Threat Score(TS); when the visibility is less than 2 km, the average correlation coefficient between the visibility values of observation stations predicted by the model and the observation values of visibility of observation stations is 0.75, the average mean square error between them is 6.49. It can be seen that the forecast model based on LightGBM can effectively improve the accuracy of visibility forecast.
Keywords:visibility forecast  ensemble learning  random forest algorithm  LightGBM algorithm  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号