TY - JOUR
T1 - Machine learning forecasts for seasonal epidemic peaks
T2 - Lessons learnt from an atypical respiratory syncytial virus season
AU - Morbey, Roger A.
AU - Todkill, Daniel
AU - Watson, Conall
AU - Elliot, Alex J.
N1 - Publisher Copyright:
© 2023 Morbey et al.
PY - 2023/9
Y1 - 2023/9
N2 - Seasonal peaks in infectious disease incidence put pressures on health services. Therefore, early warning of the timing and magnitude of peak activity during seasonal epidemics can provide information for public health practitioners to take appropriate action. Whilst many infectious diseases have predictable seasonality, newly emerging diseases and the impact of public health interventions can result in unprecedented seasonal activity. We propose a Machine Learning process for generating short-term forecasts, where models are selected based on their ability to correctly forecast peaks in activity, and can be useful during atypical seasons. We have validated our forecasts using typical and atypical seasonal activity, using respiratory syncytial virus (RSV) activity during 2019 - 2021 as an example. During the winter of 2020/21 the usual winter peak in RSV activity in England did not occur but was 'deferred' until the Spring of 2021. We compare a range of Machine Learning regression models, with alternate models including different independent variables, e.g. with or without seasonality or trend variables. We show that the best-fitting model which minimises daily forecast errors is not the best model for forecasting peaks when the selection criterion is based on peak timing and magnitude. Furthermore, we show that best-fitting models for typical seasons contain different variables to those for atypical seasons. Specifically, including seasonality in models improves performance during typical seasons but worsens it for the atypical seasons.
AB - Seasonal peaks in infectious disease incidence put pressures on health services. Therefore, early warning of the timing and magnitude of peak activity during seasonal epidemics can provide information for public health practitioners to take appropriate action. Whilst many infectious diseases have predictable seasonality, newly emerging diseases and the impact of public health interventions can result in unprecedented seasonal activity. We propose a Machine Learning process for generating short-term forecasts, where models are selected based on their ability to correctly forecast peaks in activity, and can be useful during atypical seasons. We have validated our forecasts using typical and atypical seasonal activity, using respiratory syncytial virus (RSV) activity during 2019 - 2021 as an example. During the winter of 2020/21 the usual winter peak in RSV activity in England did not occur but was 'deferred' until the Spring of 2021. We compare a range of Machine Learning regression models, with alternate models including different independent variables, e.g. with or without seasonality or trend variables. We show that the best-fitting model which minimises daily forecast errors is not the best model for forecasting peaks when the selection criterion is based on peak timing and magnitude. Furthermore, we show that best-fitting models for typical seasons contain different variables to those for atypical seasons. Specifically, including seasonality in models improves performance during typical seasons but worsens it for the atypical seasons.
UR - http://www.scopus.com/inward/record.url?scp=85172139852&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0291932
DO - 10.1371/journal.pone.0291932
M3 - Article
C2 - 37738241
AN - SCOPUS:85172139852
SN - 1932-6203
VL - 18
JO - PLoS ONE
JF - PLoS ONE
IS - 9 September
M1 - e0291932
ER -