首页 | 官方网站   微博 | 高级检索  
     

基于Python语言的ARIMA模型在天津市结核病发病率预测中的应用
引用本文:张晓卉,姚婷婷,陈阳,张甜甜,马骏.基于Python语言的ARIMA模型在天津市结核病发病率预测中的应用[J].中国感染控制杂志,2020,19(7):634-642.
作者姓名:张晓卉  姚婷婷  陈阳  张甜甜  马骏
作者单位:天津医科大学公共卫生学院流行病与卫生统计学系, 天津 300041
摘    要: 目的 探讨差分自回归移动平均(ARIMA)模型在结核病发病率预测中的可行性。方法 基于Python语言的statsmodels模块,以天津市2004年1月—2015年12月结核病月发病率数据作为训练集建立最优季节性差分自回归移动平均(SARIMA)模型,以2016年1—12月数据对SARIMA模型进行效果评价,并对2017年1月—2019年12月天津市结核病月发病率进行预测。结果 流行病学结果显示,2004年1月—2015年12月天津市结核病月发病率总体呈下降趋势。2005—2008年出现一个发病高峰,2009年后大幅度下降,随后趋于平稳。2017年1月—2019年12月天津市结核病月发病率与往年相比平稳下降。建立的最佳模型为SARIMA(1,1,1)×(3,1,1)12,该模型残差BOX-Ljung统计量P值为0.493,提示残差为白噪声序列,模型拟合良好。预测结果实际值均在预测值的95%置信区间。结论 SARIMA(1,1,1)×(3,1,1)12模型可对天津市结核病月发病率进行较准确的预测。

关 键 词:结核病  ARIMA时间序列  Python语言  发病率  预测  
收稿时间:2019/9/16 0:00:00

Application of ARIMA model in predicting the incidence of tuberculosis in Tianjin City based on Python language
ZHANG Xiao-hui,YAO Ting-ting,CHEN Yang,ZHANG Tian-tian,MA Jun.Application of ARIMA model in predicting the incidence of tuberculosis in Tianjin City based on Python language[J].Chinese Journal of Infection Control,2020,19(7):634-642.
Authors:ZHANG Xiao-hui  YAO Ting-ting  CHEN Yang  ZHANG Tian-tian  MA Jun
Affiliation:Department of Health Statistics, School of Public Health, Tianjin Medical University, Tianjin 300041, China
Abstract:Objective To evaluate feasibility of autoregressive integrated moving average (ARIMA) model in predicting the incidence of tuberculosis (TB). Methods Using statsmodels module-based Python language, incidence of TB in Tianjin City from January 2004 to December 2015 was as training set, the optimal seasonal ARIMA (SARIMA) model was established, data from January to December 2016 were used to evaluate the efficacy of SARIMA model, and monthly incidence of TB in Tianjin City from January 2017 to December 2019 was predicted. Results Epidemiological results showed that monthly incidence of TB in Tianjin showed a overall downward trend from January 2004 to December 2015. There was a of peak disease incidence in 2005-2008, which dropped sharply after 2009 and then stabilized. From January 2017 to December 2019, monthly incidence of TB in Tianjin City declined steadily compared with previous years. The established optimal model was SARIMA(1,1,1)×(3,1,1)12, residual BOX-Ljung statistic of the model was P=0.493, which indicated that the residual was a white noise sequence and the model fitted well. The actual value of predicted results was within 95% confidence interval of predicted value. Conclusion SARIMA (1,1,1)×(3,1,1)12 model can accurately predict the monthly incidence of tuberculosis in Tianjin City.
Keywords:tuberculosis|ARIMA time series|Python language|incidence|prediction
本文献已被 CNKI 等数据库收录!
点击此处可从《中国感染控制杂志》浏览原始摘要信息
点击此处可从《中国感染控制杂志》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号