Using Partial Least Squares Regression to Fit Small Data of H7N9 Incidence Based on the Baidu Index

IEEE access(2020)

引用 10|浏览12
暂无评分
摘要
The internet search data will help the disease control department to estimate the disease in advance. The H7N9 epidemic that occurred in Guangxi Province was used as an example to demonstrate its association with Baidu search data. At first,16 search terms which have high correlation with H7N9 disease were selected by expert determination and calculation. At the same time, the number of disease cases were downloaded from the website of Guangxi CDC. The partial least square regression was choosed to estimate after comparing the regression models for the number of epidemic cases is very less than baidu searches data. To filter independent variables, cross validation and variable importance in projection were applied. The results show that: 1.the proposed method is suitable for fitting the data of H7N9 disease with few samples, and the fitting degree is perfect. 2.it will help to screen out the important searching index which are more relate to H7N9 epidemic by using cross validation and variable import in project. 3.compared with the PCA methods, the proposed method presented great advantages in performance index, especially with the help of cross validation and variable importance in projection.
更多
查看译文
关键词
Partial least squares regression,H7N9,Baidu index,variable importance in projection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要