Evaluation of FluSight influenza forecasting in the 2021–22 and 2022–23 seasons with a new target laboratory-confirmed influenza hospitalizations

Mathis, Sarabeth M.; Webber, Alexander E.; León, Tomás M.; Murray, Erin L.; Sun, Monica; White, Lauren A.; Brooks, Logan C.; Green, Alden; Hu, Addison J.; Rosenfeld, Roni; Shemetov, Dmitry; Tibshirani, Ryan J.; McDonald, Daniel J.; Kandula, Sasikiran; Pei, Sen; Yaari, Rami; Yamana, Teresa K.; Shaman, Jeffrey; Agarwal, Pulak; Balusu, Srikar; Gururajan, Gautham; Kamarthi, Harshavardhan; Prakash, B. Aditya; Raman, Rishi; Zhao, Zhiyuan; Rodríguez, Alexander; Meiyappan, Akilan; Omar, Shalina; Baccam, Prasith; Gurung, Heidi L.; Suchoski, Brad T.; Stage, Steve A.; Ajelli, Marco; Kummer, Allisandra G.; Litvinova, Maria; Ventura, Paulo C.; Wadsworth, Spencer; Niemi, Jarad; Carcelen, Erica; Hill, Alison L.; Loo, Sara L.; McKee, Clifton D.; Sato, Koji; Smith, Claire; Truelove, Shaun; Jung, Sung-mok; Lemaitre, Joseph C.; Lessler, Justin; McAndrew, Thomas; Ye, Wenxuan; Bosse, Nikos; Hlavacek, William S.; Lin, Yen Ting; Mallela, Abhishek; Gibson, Graham C.; Chen, Ye; Lamm, Shelby M.; Lee, Jaechoul; Posner, Richard G.; Perofsky, Amanda C.; Viboud, Cécile; Clemente, Leonardo; Lu, Fred; Meyer, Austin G.; Santillana, Mauricio; Chinazzi, Matteo; Davis, Jessica T.; Mu, Kunpeng; Pastore y Piontti, Ana; Vespignani, Alessandro; Xiong, Xinyue; Ben-Nun, Michal; Riley, Pete; Turtle, James; Hulme-Lowe, Chis; Jessa, Shakeel; Nagraj, V. P.; Turner, Stephen D.; Williams, Desiree; Basu, Avranil; Drake, John M.; Fox, Spencer J.; Suez, Ehsan; Cojocaru, Monica G.; Thommes, Edward W.; Cramer, Estee Y.; Gerding, Aaron; Stark, Ariane; Ray, Evan L.; Reich, Nicholas G.; Shandross, Li; Wattanachit, Nutcha; Wang, Yijin; Zorn, Martha W.; Aawar, Majd Al; Srivastava, Ajitesh; Meyers, Lauren A.; Adiga, Aniruddha; Hurt, Benjamin; Kaur, Gursharn; Lewis, Bryan L.; Marathe, Madhav; Venkatramanan, Srinivasan; Butler, Patrick; Farabow, Andrew; Ramakrishnan, Naren; Muralidhar, Nikhil; Reed, Carrie; Biggerstaff, Matthew; Borchering, Rebecca K.

doi:10.1038/s41467-024-50601-9

Evaluation of FluSight influenza forecasting in the 2021–22 and 2022–23 seasons with a new target laboratory-confirmed influenza hospitalizations

Article
Open access
Published: 26 July 2024

Volume 15, article number 6289, (2024)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

Evaluation of FluSight influenza forecasting in the 2021–22 and 2022–23 seasons with a new target laboratory-confirmed influenza hospitalizations

Download PDF

1418 Accesses
1 Citation
15 Altmetric
2 Mentions
Explore all metrics

Abstract

Accurate forecasts can enable more effective public health responses during seasonal influenza epidemics. For the 2021–22 and 2022–23 influenza seasons, 26 forecasting teams provided national and jurisdiction-specific probabilistic predictions of weekly confirmed influenza hospital admissions for one-to-four weeks ahead. Forecast skill is evaluated using the Weighted Interval Score (WIS), relative WIS, and coverage. Six out of 23 models outperform the baseline model across forecast weeks and locations in 2021–22 and 12 out of 18 models in 2022–23. Averaging across all forecast targets, the FluSight ensemble is the 2^nd most accurate model measured by WIS in 2021–22 and the 5^th most accurate in the 2022–23 season. Forecast skill and 95% coverage for the FluSight ensemble and most component models degrade over longer forecast horizons. In this work we demonstrate that while the FluSight ensemble was a robust predictor, even ensembles face challenges during periods of rapid change.

Collaborative efforts to forecast seasonal influenza in the United States, 2015–2016

Article Open access 24 January 2019

Comparative assessment of methods for short-term forecasts of COVID-19 hospital admissions in England at the local level

Article Open access 21 February 2022

Results from the centers for disease control and prevention’s predict the 2013–2014 Influenza Season Challenge

Article Open access 22 July 2016

Introduction

Traditional influenza surveillance systems provide a comprehensive picture of influenza activity in the United States^1,2,3 and are fundamental for situational awareness and risk communication. However, they measure influenza activity after it has occurred, and do not directly anticipate future trends to inform risk assessment and healthcare preparedness. To address these limitations, the Centers for Disease Control and Prevention (CDC) has supported open influenza forecasting challenges since the 2013–14 season⁴. This collaborative process (named FluSight) has ensured that forecasting targets are relevant to public health. Additionally, forecast data are openly available, which enables transparent evaluation of forecast performance^5,6.

Originally the FluSight collaboration focused on short-term forecasts of outpatient influenza-like-illness (ILI) rates from ILINet² and corresponding results have been summarized previously^4,5,6. However, the COVID-19 pandemic resulted in changes in outpatient care-seeking behavior, and the continued co-circulation of SARS-CoV-2 has further complicated the interpretation of ILI data. In the 2021–22 influenza season, the FluSight forecast target shifted to the weekly number of hospital patients admitted with laboratory-confirmed influenza from the Health and Human Services (HHS) Patient Impact and Hospital Capacity Data System⁷. This system was created during the COVID-19 pandemic to gather a complete and unified representation of COVID-19 disease outcomes along with other metrics related to healthcare capacity. Hospitals registered with the Centers for Medicare and Medicaid Services (CMS) are required to report daily COVID-19 and influenza information⁸. Reporting of the influenza data elements, including the previous day’s number of admissions with laboratory-confirmed influenza virus infection, became mandatory on February 2, 2022,⁸. Although influenza activity, has been monitored throughout the US for decades through multiple surveillance systems, this dataset is the first with laboratory-confirmed influenza hospital admissions reported systematically across all 50 states and additional territories^1,2,3,8.

The COVID-19 pandemic disrupted the typical timing, intensity, and duration of seasonal influenza activity in the United States and many parts of the world^9,10. Influenza activity was very low during the 2020–21 season in the U.S., but activity increased during the 2021–22 season, with activity peaking later in April, May, and early June 2022 and remaining at higher levels than had been reported during these months in previous seasons¹⁰. In the 2022-23 influenza season, activity began increasing nationally in early October, earlier than previous seasons^2,3,11, and peaked in early December 2022.

In this analysis, we summarize the accuracy and reliability of ensemble and component 1- to 4-week ahead forecasts of laboratory-confirmed influenza hospital admissions submitted in real-time during the 2021–22 and 2022-23 seasons. Our objective was to consider potential changes in performance of these forecasts in post-COVID influenza seasons, especially given atypical timing and intensity. By evaluating forecast performance for a new forecast target with limited calibration data, we identify specific areas for forecast improvement.

Results

The 2021–22 influenza season was characterized by two distinct waves of activity. The first occurred between November 2021 and January 2022 and the second between February and June 2022, though reporting of influenza hospitalizations was not mandatory in the HHS system until February 2, 2022 (see observed data in Fig. 1a). Reported national weekly influenza hospital admissions exceeded 1000 for 22 out of 25 of the forecast weeks (Fig. 1a). Updates to weekly counts from the forecast evaluation period were generally minimal (Supplementary Figs. 2–4), with 94% of updates during the 2021–22 season resulting in changes of under 10 hospitalizations for subnational jurisdictions. There were infrequent larger updates (10 or greater) to reported admissions.

**Fig. 1: National incident weekly hospital admissions and select forecasts.**

The 2022-23 influenza season was characterized by an early start, reaching 1000 hospital admissions nationally before October 2022. A sharp increase nationally through October and November led to a peak of 26,600 hospital admissions in early December. Hospital admissions decreased rapidly after December, with 3000 weekly hospital admissions by the end of January, and eventually dropped below 1000 confirmed weekly admissions nationally by May 2023. Weekly numbers of admissions exceeded 1000 for 27 out of 34 of the forecast weeks (Fig. 1b, Supplementary Fig. 4). In the 2022–23 season, 83% of updates for weekly admissions resulted in changes of under 10 hospitalizations for subnational jurisdictions. There were infrequent larger updates to reported admissions and often updates occurred within two weeks of initial publication.

Models Included

For both the 2021–22 and 2022–23 influenza seasons, 26 modeling teams submitted forecasts, and 21 and 16, respectively, were eligible for end-of-season evaluation, not including the FluSight baseline and ensemble models. The number and types of models included in the primary analysis (based on the inclusion criteria) varied across weeks with a range of methodological approaches (see Supplementary Table 1). For the 2021–22 season, a median of 20 included models was submitted (range: 15–21), with most having a statistical component, three mechanistic, and six ensembles of component models. In 2022–23 there was a median of 15 included models (range: 10–16) submitted each week, with many having a statistical component, three mechanistic, and four ensemble models. Top performing models in the 2021–22 season included statistical, mechanistic and ensemble models. In 2022–23, top performing models included mechanistic, statistical, ensemble, and one machine learning model. There were also statistical, mechanistic, AI or machine learning, and ensemble models among models with lower performance across seasons. Modeling teams varied across seasons, with 13 modeling groups having submitted eligible forecasts for both seasons. When only national forecasting targets were considered, no additional teams were included for the 2021–22 season, but two teams, NIH-Flu_ARIMA and ISU_NiemiLab-Flu, met the inclusion criteria for 2022–23 (Supplementary Analysis 3). Visualizations of all forecasts as of the date they were submitted are included in an interactive dashboard¹².

Relative WIS

Over the evaluation period, more models outperformed the FluSight baseline model in 2022–23 (12) than in 2021–22 (6) based on relative WIS (Table 1). Within each season, the models that achieved an overall relative WIS less than or equal to one represent a variety of modeling strategies, including a basic quantile autoregression fit, a mechanistic compartmental model with stochastic simulations, an ensemble of time-series baseline models, a random walk model, a random forest ensemble, and the FluSight ensemble (Supplementary Table 1). Similar results were observed when models were evaluated based on absolute error of the median of probabilistic forecasts (see MAE estimates in Table 1).

Table 1 Performance metrics for teams meeting inclusion criteria

Full size table

Few teams outperformed the FluSight Ensemble in relative WIS for both seasons. The CMU-TimeSeries model was the only model that outperformed the ensemble for both the 2021–22 and 2022–23 seasons, while the MOBS-GLEAM_FLUH, PSI-DICE, and MIGHTE-Nsemble models outperformed the ensemble only in the 2022–23 season.

For both seasons, forecasts from the FluSight Ensemble were ranked among the top 50% of all model forecasts for the same location, date, and target, more than three-fourths of the time (79.73% in 2021–22 and 78.83% in 2022–23) (Fig. 2). Three models consistently ranked in the top 25% for 2021–22 and 2022–23 seasons, respectively: CMU-TimeSeries (42.47%, 36.14%), PSI-DICE (39.34%, 39.87%), and MOBS-GLEAM_FLUH (38.97%, 50.33%). Several models, seven in 2021–22 and five in 2022–23, had bimodal rank distributions, with a combined majority of their forecasts falling in either the bottom 25% or top 25% (Fig. 2).

**Fig. 2: Standardized rank by season.**

Log-transformed analysis

For both seasons, the analysis using log-transformed hospitalization counts resulted in the same top five performing teams in terms of absolute and relative WIS. For the 2021–22 season, all teams were ranked the same for the log-transformed and non-transformed analyses. In 2022–23, MIGHTE-Nsemble and PSI-DICE performed better than CMU-TimeSeries for the log-transformed analysis (Table 1 and Supplementary Analysis 2).

Relative WIS and Spatial Variation

Model performance varied by spatial jurisdiction. For individual states, relative WIS values varied across models ranging from 0.46 to 12.58 in 2021–22 and 0.32 to 12.35 in 2022–23 (Fig. 3). More models, including the ensemble, performed better at the state-level than the baseline in 2022–23 compared to 2021–22. The relative WIS of the FluSight Ensemble had the smallest range of values across all locations from 0.58 to 1.08 in 2021–22 to 0.63 to 1 in 2022–23 (Fig. 3 and Supplementary Fig. 1). To further examine forecast performance across jurisdictions, we considered the percent of jurisdictions that the relative WIS value for a given model and location pair was less than the baseline (i.e., lower than 1). The FluSight Ensemble performed as well as or better than the baseline for all forecast jurisdictions for 2022–23 and 47 out of 52 forecast jurisdictions for 2021–22, a larger number of jurisdictions than any submitted model (Fig. 3). In 2022–23, 12 models performed better than the baseline at the jurisdiction-level at least 50% of the time, compared to five models in 2021–22. In general, the models with lower (better) relative WIS values were consistent between the analysis with all spatial jurisdictions and the analysis considering only national forecast targets for both seasons (Supplementary Analysis 3).

**Fig. 3: Relative WIS by state and model. State-level WIS values for each team relative to the FluSight baseline model.**

Absolute WIS

Across forecasted weeks, the FluSight Ensemble’s worst performance in terms of absolute WIS (maximum values) for 1-week ahead targets occurred on March 19, 2022 for 2021–22 and on November 26, 2022 for 2022–23 (Fig. 4). For the 4-week ahead horizon, maximum absolute values, indicating the worst performance, for each season occurred on June 04, 2022, and December 03, 2022, respectively (Fig. 4). Minimum, or best, absolute WIS values for each season occurred on July 16, 2022, and May 13, 2023, respectively, both during periods of low flu activity.

Coverage

Model performance for the FluSight Ensemble dropped during periods of relatively rapid change (see Figs. 1 and 3). The lowest 1-week horizon 95% value occurred for forecasts with target end dates of March 14, 2022, for 2021–22 and on November 21, 2022, for 2022–23 (Fig. 5). Across forecasted weeks in the 2021–22 season, the FluSight Ensemble had a minimum 95% coverage value at the 1-week horizon of 75%. Lower 95% coverage for the 1-week horizon was observed in the 2022–23 season with a minimum of 29%. The maximum coverage rate achieved by the FluSight Ensemble in any individual week was 100% in both seasons. Minimum FluSight Ensemble 95% coverage values for forecasts at the 4-week horizon in any individual week were 62% for 2021–22 and 15% for 2022–23.

Model performance, in terms of coverage, tended to decline at longer time horizons for the FluSight Ensemble, baseline, and individual contributed models (see Table 2). Over the forecast weeks, the 2021–22 FluSight ensemble had slightly higher overall 95% coverage values of 89.32%, 86.11%, 85.15%, and 83.33% for the 1 to 4-week ahead horizons respectively, compared to the 2022–23 season during which the FluSight Ensemble had 95% coverage values of 85.79%, 81.64%, 78.78%, and 77.85% for the 1 to 4-week ahead horizons respectively. A similar proportion of models had higher overall 95% coverage values at the 1-week ahead horizon than at the 4-week ahead horizon for 2022–23 (14 of 18 models) and 2021–22 (18 of 23 models) (Table 2). Out of the forecast targets and across forecast weeks, the FluSight Ensemble’s 95% prediction interval contained at least 90% of the corresponding observed values only 55.56% and 64.52% of the time, for 2021–22 and 2022–23, respectively (Table 2). Ideally 95% prediction intervals are just wide enough to capture 95% of eventually observed values.

Table 2 One-to-four-week coverage and one-to-four-week percent of coverage above 90% for teams meeting inclusion criteria. One-to-four-week is abbreviated with each number and “Wk” indicates week

Full size table

Discussion

The 2021–22 influenza season marked the return of from very low levels of seasonal influenza activity observed in the U.S. following the first years of the COVID-19 pandemic, and many components of the 2021–22 and 2022–23 FluSight Forecasting Challenges were new. One of the most substantial changes was the shift from the original FluSight forecasting targets of weekly influenza-like-illness (ILI) percentages to weekly counts of confirmed influenza hospitalizations. The COVID-19 pandemic resulted in the availability of a new data source, the unified HHS-Protect dataset, which provided information on laboratory-confirmed daily influenza hospitalizations from all 50 states, D.C., and Puerto Rico^7,8. Confirmed influenza hospital admissions may more directly inform influenza preparedness and response efforts. During the time period that these forecasting results cover, data were reported daily, with mandatory reporting for influenza admissions from most hospitals in each state, U.S. territories, and D.C starting February 2, 2022. Despite challenges accompanying the shift to the new target of influenza hospitalizations, such as limited historic data from this system for model training, these forecasts provided substantial utility and reinforced a number of lessons learned over the course of previous forecasting activities, both during the pre-pandemic influenza seasons and the COVID-19 pandemic.

Forecast performance–accuracy

As demonstrated in this analysis, collaborative forecasting hub approaches provide opportunities to systematically evaluate performance across multiple modeling strategies and enable the creation of ensemble models. Since a particular model’s performance often varies within and across seasons¹³, it is helpful to have a unified representation of model inputs that can be used to quickly assess expected upcoming trends. Additionally, this work indicates that ensemble models may also provide more consistently reliable and well-calibrated forecasts across spatial jurisdictions.

Evaluated models cover mechanistic, statistical, ensemble, and AI or machine learning models (see Table 1 and Supplementary Table 1 for additional information). The diversity of model types among the top-performing models was consistent across seasons. In light of this heterogeneity in top-performing model structures and the many dimensions of differences across forecasting model it has not yet been possible to identify particular characteristics of individual models that are most often associated with high performance. Individual models often vary greatly in their performance within and across seasons (Fig. 1c–e). Across the evaluation period for both seasons and all forecast jurisdictions, the FluSight ensemble was among the top 5 performing models in terms of Absolute WIS and Relative WIS. Additionally, when considering forecast performance by rank (Fig. 2), the FluSight ensemble more accurately predicted weekly influenza hospital admissions than most contributed models with the majority of the FluSight ensemble forecasts falling within the top 50% of submitted forecasts (Table 1, Fig. 2). While the PSI-DICE, CMU-TimeSeries, and MOBS-GLEAM_FLUH models have more forecasts in the top 25%, they exhibit higher spatial heterogeneity than the FluSight ensemble in forecast performance (Fig. 3). The generally high accuracy of the FluSight Ensemble relative to that of individual models is consistent with previous findings that ensemble models, that utilize the outputs from multiple teams, generally outperform individual models on average^14,15,16,17. Like most models, ensembles may have decreased performance during periods of rapid change when some individual models may have higher accuracy (Fig. 1c, d); however, identifying these time frames and corresponding high-performing models has been difficult a priori^5,6.

One option to better evaluate forecast performance during periods of change and across multiple magnitudes is to evaluate transformed counts¹⁸. We did not find notable differences in model performance using this approach in either season. We expected that there might be a stronger influence on performance in the 2022–2023 season which saw a sharp increase in hospitalizations in fall 2022, but it is possible that models were not able to capture this initial rise and thus did not accrue additional benefit in the log transform score. The long tail of the season may also have elevated scores across all models.

Forecast model performance tended to decline over longer time horizons. For both the 2021–22 and 2022–23 FluSight seasons, accuracy declined across the 1–4 week ahead horizons. This trend has been observed previously in multiple forecast activities. The U.S. COVID-19 Forecast Hub observed declines in accuracy for forecasted deaths over periods of 1–4 weeks ahead, and German and Polish COVID-19 forecast efforts also showed declines in performance at the 3- and 4-week ahead horizons¹⁹. Accuracy scores were also shown to decline over longer time horizons for influenza-like-illness forecasts¹³.

Across the forecast weeks, individual models often showed larger increases in absolute WIS, while the FluSight ensemble had the smallest range of absolute WIS for each season, demonstrating one aspect of stability for the FluSight ensemble. In terms of state-level performance, the FluSight ensemble tended to be more robust than individual models, as measured by relative WIS scores (Fig. 3). Similarly, the COVID-19 Forecast Hub ensemble performed better across all locations, with the COVID-19 Hub ensemble being the only model to outperform the baseline in each of the forecast locations¹⁴.

Forecast performance – coverage

Our analysis found that, as the forecast horizon moved from 1 to 4-weeks, the FluSight ensemble 95% prediction interval coverage declined from 89.61% to 83.74% in 2021–22 and from 85.69% to 77.85% in 2022–23. These results highlight room for improvement in model calibration, as almost all models (with the exception of the UMass trends ensemble) were overconfident in their predictions (Table 2). The lack of comparable historical data for model fitting may have contributed to poor calibration of 95% prediction intervals.

Consistent with past forecasting efforts, forecasting remains difficult in periods of rapid change and epidemic turning points (e.g., during initial increases or periods of peaking activity). This analysis highlights declines in forecast accuracy and coverage during periods of rapid change in influenza hospitalizations during both the 2021–22 and 2022–23 seasons. For example, the only model that had 95% coverage greater than 80% from October to January 2023 when hospitalizations were rapidly increasing and then peaking was LUCompUncertLab-humanjudgment, which did not end up meeting the inclusion criteria for the full season analysis. Analogous declines were also observed for COVID-19 case forecasts²⁰ and mortality forecasts across different waves of the COVID-19 pandemic¹⁴, where forecasts systematically underpredicted during periods of increase and overpredicted during periods of decrease.

Times of changing dynamics are the most important periods for public health response and communication. While forecasting the magnitude at these times may be less tractable, it is possible that we may be able to provide more reliable information during these difficult forecasting periods so that forecasts are better able to inform critical planning. In general, most ensembles tend to predict less activity than observed when trends are steeply increasing and predict more activity than observed when trends are steeply decreasing, especially when there is between- or within-model uncertainty in the timing of peaks in cases, hospitalizations, or deaths. Thus, it may be possible that an ensemble of forecasts for categorical increases or decreases in activity²¹ may have additional utility in terms of preserving valuable information while also maintaining the benefits of the use of ensembles over individual models. As such, the FluSight Forecasting Hub added an experimental target in the 2022–23 season for forecasting categorical rate changes in influenza hospitalizations (e.g., probabilities of increase or decrease)²². Assessing the utility of this additional forecast target will be an important area of investigation moving forward. Aside from soliciting a separate forecasting target, it may be possible to determine which forecasting models perform better during different phases of epidemics and then use this information to weight models accordingly when their forecasts are aggregated into an ensemble²³.

Influenza forecasting in the COVID-19 era: challenges and opportunities

Several challenges for forecasting existed during the 2021–22 and 2022–23 influenza seasons. First, as noted earlier, the change in the forecasting target from outpatient ILI percentages to counts of influenza-associated hospitalizations from a data collection system established during the COVID-19 pandemic meant that there was little data for forecast calibration and training. This shift also required changes in data processing for teams that had produced ILI forecasts previously. While previous data on influenza-associated hospitalizations was available through the FluSurv-NET system, differences in reporting and the spatial resolution, of the FluSurv-NET system may have complicated the process of utilizing this dataset for the purpose of forecasting model calibration. In addition, reporting within the unified HHS-Protect hospitalization dataset changed throughout this forecasting endeavor. For example, the confirmed influenza hospital admissions field only became mandatory for the 2021–22 season on February 2, 2022, leading to an increase in the number of reported hospitalizations and a change in hospital reporting practices during a period of increasing influenza activity.

In addition to changing reporting patterns, the COVID-19 pandemic brought other challenges for forecasting influenza, including changing human behavior. The quantity and types of interactions between people likely changed in tandem with perceptions of the risk of illness with COVID-19. In addition, the use of nonpharmaceutical interventions (NPIs) aimed at preventing SARS-CoV-2 transmission (e.g., stay-at-home orders, mask-wearing) reduced transmission of other respiratory pathogens⁹, including influenza. These changes in behavior may be related to the minimal influenza activity observed in the U.S. in the 2020–21 season and the low severity but atypically late influenza season observed in the 2021–22 season. Population-level behavior is difficult to predict, especially in the context of changing public health recommendations and emerging SARS-CoV-2 variants, which complicates the process of forecasting. Despite these challenges, FluSight forecasting teams provided forecasts of confirmed influenza hospitalizations throughout each season, which helped public health officials anticipate trends during the unusually prolonged influenza season in 2021–22, with forecasting efforts extending into June, and then again for the atypically early 2022–23 season.

While the shift to forecasting for a new target presented a modeling challenge, the utility of the corresponding new data source should be recognized²⁴. The HHS-Protect dataset⁷provided, in addition to the state-level timeseries, facility-level data, which is at a higher spatial resolution than other indicators of influenza activity. During the forecasting time frame analyzed here, the data were also reported daily with previous-day admission data published as soon as the day after their occurrence, providing a timely source of information. As our data update analysis (Supplementary Figs. 2–4) shows, these data demonstrated remarkably stable reporting behavior, particularly during the 2021–22 season, with 94% of updates resulting in changes of under 10 hospitalizations for subnational jurisdictions. Stability of reporting decreased slightly during the 2022–23 season, with 83% of updates resulting in changes of under 10 hospitalizations for subnational jurisdictions. Degraded forecast performance has been associated with large revisions to initially observed values⁶, and consistency in reporting is an important component of a reliable forecasting target. Additionally, this dataset provided national and jurisdictional-level data for confirmed influenza hospital admissions. In contrast with ILI, this indicator eliminated the need to model outpatient visits associated with co-circulating non-influenza pathogens that can cause ILI. The continued availability of rapid, disease-specific indicators of hospitalization, such as those provided by these data, will facilitate improved forecasting utility and possibly improvements in accuracy²⁵, particularly when forecasts are informed by mechanistic transmission models.

The FluSight forecasting collaboration adapted quickly in 2021 to utilize a novel laboratory-confirmed influenza hospital admission dataset. Even with limited calibration data and atypical influenza seasonality in the 2021–22 and 2022–23 seasons, the FluSight ensemble forecast provided more robust forecasts than individual component models across spatial jurisdictions and time horizons. This result mirrors those of other forecasting hubs. Collaborative hubs also offer the ability for frequent feedback and interaction between modeling teams, providing opportunities for rapidly sharing observations about underlying data and insights for forecast development²⁶. We observed poor coverage and general performance, especially at the beginning of the 2022–23 season and during other periods of rapid change. Collective insights from these challenges can also inform when forecasts should be interpreted with extra caution. Ongoing availability of the confirmed influenza hospitalization dataset, which covers all states, could improve model calibration and ultimately contribute to the improvement of influenza forecast performance and utility, as well as continued exploration and improvement of forecasting and ensembling methodologies. These improvements are needed, particularly to more accurately capture trends and appropriate levels of uncertainty during times of rapid change.

Methods

Forecasts of weekly influenza hospital admissions were openly solicited from existing COVID-19 and influenza forecasting networks every Monday from January 10, 2022, through June 20, 2022, for the 2021–22 season. For the 2022–23 season, forecasts were solicited every Monday from October 17, 2022, through January 9, 2023, and then every Tuesday from January 17, 2023, through May 17, 2023. Weeks were defined in terms of MMWR Epiweeks (EW) spanning Sunday to Saturday²⁷. Forecasted jurisdictions included the U.S. national level, all fifty states, Washington D.C., and Puerto Rico. Forecasts for the Virgin Islands, while requested, were not included in this evaluation due to low reported hospitalization counts and irregular data submission. Each week, forecasting teams were asked to provide jurisdiction-specific point estimates and probabilistic predictions for 1, 2, 3, and 4-week ahead weekly counts of confirmed influenza hospital admissions. A total of 23 quantiles were requested for the probabilistic forecasts: 0.010, 0.025, 0.050, 0.100, 0.150, …, 0.950, 0.975, and 0.990. Teams were not required to submit forecasts for all four weeks ahead or for all locations. Additional details of the forecast submission process (e.g., file formatting, submission procedures, and required metadata) are provided in the FluSight-forecast-data GitHub Repository²².

The FluSight Ensemble model was generated for all forecasted jurisdictions each week using the unweighted median of each quantile among eligible forecasts. Forecasts were considered eligible for inclusion in the ensemble if they were submitted by 11:59 PM ET on the due date and if all requested quantiles were provided. Modeling teams could further designate whether a particular model’s forecasts should be included in the ensemble. If a forecast was designated as “other”, it was not included in the FluSight ensemble and not evaluated in this manuscript.

Baseline forecasts and their prediction intervals were generated each week using the ‘quantile baseline’ method in the simplets R package²⁸ based on the incident hospitalizations reported in the previous week, with underlying methodology described as follows. The median prediction of the baseline forecasts is the corresponding target value observed in the previous week, and noise around the median prediction is generated using positive and negative 1-week differences (i.e., differences between consecutive reports) for all prior observations, separately for each jurisdiction. Sampling distributions were truncated to prevent negative values. The same median prediction is used for the 1-through 4-week ahead forecasts. The baseline model’s prediction intervals are generated from a smoothed version of this distribution of differences^14,29.

For inclusion in this analysis, forecasting teams must have submitted greater than or equal to 75% of the requested targets, for subnational jurisdictions, between the forecast evaluation period of February 21, 2022, to June 20, 2022 (total of 18 weeks) for 2021–22 or October 17, 2022, to May 15, 2023 (total of 30 weeks) for 2022–23. These periods translate to 4-week ahead forecast target end dates of March 19, 2022–July 16, 2022 for the 2021–22 season and November 11, 2022–June 10, 2023 for the 2022–23 season. The start date of the evaluation period for the 2021–22 season was chosen to be the first forecast date following two weeks of mandatory reporting of confirmed influenza hospitalizations⁸ to minimize the potential effects of reporting changes on forecasts. For 2021-22 and 2022-23, three and 12 models were excluded from the primary analysis, respectively, for not meeting the inclusion criteria.

Forecasts were evaluated against the reported number of the previous day’s laboratory-confirmed influenza admissions (Field #34) from the COVID-19 Reported Patient Impact and Hospital Capacity by State Timeseries^7,8, with data shifted one day earlier to align with admission date and then aggregated to the weekly scale (from Sunday to Saturday)²², using data as of September 12, 2022, for 2021–22 and June 13, 2023, for 2022–23. This dataset is subject to revision by submitting facilities; therefore, we analyzed backfill and revision for each season (Supplementary Analysis 1). For each of the contributed forecasts included in the analysis, values were rounded to more closely relate the values of prediction intervals of forecasts to the reported numbers of hospital admissions. In particular, forecast values for quantiles less than 0.5 were rounded down, values for quantiles greater than 0.5 were rounded up, and values for the 0.5 quantiles were rounded normally. This rounding procedure ensured that teams were not penalized for missing the prediction interval by less than one hospital admission.

To evaluate forecast performance across all states, D.C., and Puerto Rico, we primarily used the Weighted Interval Score (WIS). The WIS is a proper score that generates interval scores for probabilistic forecasts provided in the quantile format^14,19. Briefly, interval scores are used to account for dispersion, underprediction, and overprediction. Forecasts with lower absolute WIS values are considered more accurate than forecasts with higher absolute WIS values. The relative WIS computes the ratio of average WIS values for each pair of models on the subset of forecasts that both models provided and then normalizes by the mean pairwise WIS ratio for the baseline model (See “Supplementary Methods”). Relative WIS values were calculated using the scoringutils package³⁰. Simple means were calculated for absolute and relative WIS to get a score for each model, location, and season. Median absolute error (MAE) values are also considered for characterizing differences between forecasted and reported weekly hospitalizations¹⁴. Unless otherwise specified, forecasts of national hospitalizations were not included in summary metrics for accuracy (e.g., absolute WIS) since these forecasts can have a disproportionate impact on the overall score. To address concerns related to assessing measures of absolute error on a natural scale when forecasts span multiple orders of magnitude¹⁸, we performed an analogous analysis on log-transformed hospitalization counts after adding one to all counts to account for zero counts (Supplementary Analysis 2). We also performed a separate analysis including only national forecasts (Supplementary Analysis 3).

In addition, we considered coverage values of the quantile-based prediction intervals to assess each model’s ability to appropriately capture uncertainty in forecasts. Coverage values are defined as the percent of observed values that fall within the 50% or 95% prediction intervals for the corresponding date. Ideally, the percent coverage values will be equal to the corresponding prediction interval, e.g., 95% percent prediction intervals should contain the reported value 95% of the time.

Comparing model forecasts is complicated by the fact that not all models submit forecasts for each of the forecast targets and for each forecast week in the evaluation period. To partially account for this, we consider the percentage of forecasts submitted as an indicator of how often and how many different types of forecasts were submitted by each team. Following Cramer et al.¹⁴, we also consider a standardized rank score that uses the number of models forecasting a particular location and target and then ranks these forecasts. Ranks were determined by relative WIS performance, with the best-performing model for each observation being assigned a rank of 1 and the worst-performing model receiving a rank equal to the number of models submitting a forecast for the observation. These ranks were standardized by rescaling so that 0 corresponds to the worst rank and 1 corresponds to the best rank.

All analyses were conducted using the R language for statistical computing (version 4.0.3)³¹ with scoringutils (version 1.2.2) to generate scores³⁰.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The forecast data for each model are available from the FluSight Forecast Hub GitHub repository (https://github.com/cdcepi/Flusight-forecast-data; https://doi.org/10.5281/zenodo.12686773)²² and the Zoltar forecast archive¹² (https://zoltardata.com/project/299 /viz). These are both publicly accessible. The target data are also available as daily counts for each jurisdiction from HHS⁷.

Code availability

The code used to generate all figures and tables in the manuscript are available in a public repository (https://github.com/cdcepi/FluSight-manuscripts, https://doi.org/10.5281/zenodo.12625724)³².

References

CDC. Weekly U.S. Influenza Surveillance Report. https://www.cdc.gov/flu/weekly/index.htm (2023).
CDC. U.S. Outpatient Influenza-like Illness Surveillance Network (ILINet). https://wwwn.cdc.gov/ILINet/ (2023).
CDC. Influenza Hospitalization Surveillance Network (FluSurv-NET). https://www.cdc.gov/flu/weekly/influenza-hospitalization-surveillance.htm (2023).
Lutz, C. S. et al. Applying infectious disease forecasting to public health: a path forward using influenza forecasting examples. BMC Public Health 19, 1659 (2019).
Article PubMed PubMed Central Google Scholar
McGowan, C. J. et al. Collaborative efforts to forecast seasonal influenza in the United States, 2015-2016. Sci. Rep. 9, 683 (2019).
Article ADS PubMed PubMed Central Google Scholar
Reich, N. G. et al. A collaborative multiyear, multimodel assessment of seasonal influenza forecasting in the United States. Proc. Natl Acad. Sci. USA 116, 3146–3154 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
COVID-19 Reported patient impact and hospital capacity by state timeseries (RAW). https://healthdata.gov/Hospital/COVID-19-Reported-Patient-Impact-and-Hospital-Capa/g62h-sye (2023).
Guidance for hospitals and acute care facilities reporting of respiratory pathogen, bed capacity, and supply data to CDC’s National Healthcare Safety Network (NHSN). https://www.hhs.gov/sites/default/files/covid-19-faqs-hospitals-hospital-laboratory-acute-care-facility-data-reporting.pdf (2023).
Olsen, S. J. et al. Changes in influenza and other respiratory virus activity during the COVID-19 pandemic-United States, 2020-2021. Am. J. Transpl. 21, 3481–3486 (2021).
Article CAS Google Scholar
Merced-Morales, A. et al. Influenza activity and composition of the 2022-23 influenza vaccine - United States, 2021-22 season. MMWR Morb. Mortal. Wkly Rep. 71, 913–919 (2022).
Article CAS PubMed PubMed Central Google Scholar
CDC. Influenza activity in the United States during the 2022–23 season and composition of the 2023–24 influenza vaccine. https://www.cdc.gov/flu/spotlights/2023-2024/22-23-summary-technical-report.htm (2023).
ReichLab. Zoltar Forecast Archive. https://zoltardata.com/project/299/viz (2023).
Reich, N. G. et al. Accuracy of real-time multi-model ensemble forecasts for seasonal influenza in the U.S. PLoS Comput Biol. 15, e1007486 (2019).
Article PubMed PubMed Central Google Scholar
Cramer, E. Y. et al. Evaluation of individual and ensemble probabilistic forecasts of COVID-19 mortality in the United States. Proc. Natl Acad. Sci. USA 119, e2113561119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Biggerstaff, M. et al. Improving pandemic response: employing mathematical modeling to confront coronavirus disease 2019. Clin. Infect. Dis. 74, 913–917 (2022).
Article CAS PubMed Google Scholar
Howerton, E. et al. Evaluation of the US COVID-19 scenario modeling hub for informing pandemic response under uncertainty. Nat. Commun. 14, 7260 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Reich, N. G. et al. Collaborative hubs: making the most of predictive epidemic modeling. Am. J. Public Health 112, 839–842 (2022).
Article PubMed PubMed Central Google Scholar
Bosse, N. I. et al. Scoring epidemiological forecasts on transformed scales. PLoS Comput Biol. 19, e1011393 (2023).
Article CAS PubMed PubMed Central Google Scholar
Bracher, J. et al. Evaluating epidemic forecasts in an interval format. PLoS Comput Biol. 17, e1008618 (2021).
Article CAS PubMed PubMed Central Google Scholar
Lopez, V. K. et al. Challenges of COVID-19 case forecasting in the US, 2020–2021. PLoS Comput Biol. 20, e1011200 (2024).
Article CAS PubMed PubMed Central Google Scholar
Srivastava, A., Singh, S. & Lee, F. Shape-based evaluation of epidemic forecasts. In 2022 IEEE International Conference on Big Data (Big Data) 1701–1710 (IEEE, 2022).
FluSight Forecasting Consortium. FluSight Forecast Data 2023 https://github.com/cdcepi/Flusight-forecast-data (2023).
Adiga, A. et al. Phase-informed bayesian ensemble models improve performance of COVID-19 forecasts. Proc. AAAI Conf. Artif. Intell. 37, 15647–15653 (2023).
Google Scholar
Borchering, R. K. et al. Responding to the return of influenza in the united states by applying centers for disease control and prevention surveillance, analysis, and modeling to inform understanding of seasonal influenza. JMIR Public Health Surveill. 10, e54340 (2024).
Article PubMed PubMed Central Google Scholar
Fox, S. J. et al. Real-time pandemic surveillance using hospital admissions and mobility data. Proc. Natl. Acad. Sci. USA 119, e2111870119 (2022).
Johansson, M. A. et al. An open challenge to advance probabilistic forecasting for dengue epidemics. Proc. Natl Acad. Sci. USA 116, 24268–24274 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
CDC. MMWR Weeks Fact Sheet. https://ndc.services.cdc.gov/wp-content/uploads/MMWR_Week_overview.pdf (2023).
Ray, E. L. et al. Simplets: simple models for time series forecasting (2022).
Cramer, E. Y. et al. The United States COVID-19 forecast hub dataset. Sci. Data 9, 462 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bosse, N. I. et al. Evaluating Forecasts with scoringutils in R. Preprint at https://doi.org/10.48550/arXiv.2205.07090 (2022).
R Development Core Team, R: A language and environment for statistical computing. (R foundation for statistical computing: Vienna, Austria, 2023).
FluSight manuscript repository. https://github.com/cdcepi/FluSight-manuscripts (2023).

Download references

Acknowledgements

The authors would like to acknowledge Michael A. Johansson and Nicole Samay for their contributions to this work. M.B.N., P.R., J.T., S.V., A.A., G.K., B.H., B.L.L., M.V.M., M.A.A, A.Srivastava disclose support for the research of this work from the Centers for Disease Control and Prevention (CDC) and Council of State and Territorial Epidemiologists (CSTE), [Cooperative Agreement number NU380T000297] M.C., J.D, K.M., X.X, A.P.P., AV, P.C.V., A.G.K. M.L., M.A. disclose support for the research of this work from the HHS/CDC 6U01IP001137 and HHS/CDC 5U01IP0001137. B.A.P., A.R., H.P.K., Z.Z., G.G., P.A., S.S.B., R.Raman, disclose support for the research of this work from NSF (Expeditions CCF-1918770, CAREER IIS-2028586, RAPID IIS-2027862, Medium IIS-1955883, Medium IIS-2106961, PIPP CCF-2200269), CDC MInD program, faculty gifts from Facebook/Meta, and funds/computing resources from Georgia Tech and GTRI. M.Santillana, L.C. F.L and A.G.M. disclose support for the research of this work from the Centers for Disease Control and Prevention (CDC) and Council of State and Territorial Epidemiologists (CSTE), [Cooperative Agreement number NU380T000297] M.Santillana discloses support for the research of this work from the National Institutes of Health (grant number R01GM130668) and (in part) by contract 200-2016-91779 and cooperative agreement CDC-RFA-FT-23-0069 with the Centers for Disease Control and Prevention. S.V., A.A., G.K., B.H., B.L.L., M.V.M., also disclose support for the research of this work from NSF Expeditions CCF-1918656, VDH Grant VDH-21-501-0135, University of Virginia Strategic Investment Fund Award SIF160. L.C.B., A.Green, A.J.H., D.J.M., R.Rosenfeld, D.S., R.J.T. disclose support for the research of this work from the Centers for Disease Control and Prevention U011P001121 and Centers for Disease Control and Prevention 75D30123C15907. B.T.S., S.A.S., H.L.G., and P.Baccam disclose support for the research of this work from the Centers for Disease Control and Prevention (CDC) and Council of State and Territorial Epidemiologists (CSTE), [Cooperative Agreement number NU38OT000297] N.R. discloses support from National Science Foundation grants CCF-1918770, NRT DGE-1545362, and OAC-1835660. S.T., C.P.S., A.H. disclose support for the research of this work from the National Science Foundation [2127976]. S.T., C.P.S., A.H., J.Lessler, J.C.L., S.L.L., C.D.M., K.S., S-m.J. disclose support from the Centers for Disease Control and Prevention [200–2016]. J.Lessler and J.C.L. disclose support from the National Institutes of Health (NIH 5R01AI102939). A.Mallela, Y.T.L., and W.S.H. disclose support for the research of this work from Laboratory Directed Research and Development Program at Los Alamos National Laboratory [20220268ER]. W.S.H., R. G. P., S. L., Y. C. disclose support for the research of this work from the National Institute of Health [R01GM111510]. Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government. The findings and conclusions in this report are those of the authors and do not necessarily represent the views of the Centers for Disease Control and Prevention or the National Institutes of Health.

Author information

These authors contributed equally: Sarabeth M. Mathis, Alexander E. Webber.

Authors and Affiliations

Centers for Disease Control and Prevention, Atlanta, GA, USA
Sarabeth M. Mathis, Alexander E. Webber, Carrie Reed, Matthew Biggerstaff & Rebecca K. Borchering
California Department of Public Health, Richmond, CA, USA
Tomás M. León, Erin L. Murray, Monica Sun & Lauren A. White
Carnegie Mellon University, Pittsburgh, PA, USA
Logan C. Brooks, Alden Green, Addison J. Hu, Roni Rosenfeld, Dmitry Shemetov & Ryan J. Tibshirani
University of California, Berkeley, Berkeley, CA, USA
Logan C. Brooks & Ryan J. Tibshirani
University of British Columbia, Vancouver, BC, Canada
Daniel J. McDonald
Norwegian Institute of Public Health, Oslo, Norway
Sasikiran Kandula
Columbia University, New York, NY, USA
Sen Pei, Rami Yaari, Teresa K. Yamana & Jeffrey Shaman
Columbia University School of Climate, New York, NY, USA
Jeffrey Shaman
Georgia Institute of Technology, Atlanta, GA, USA
Pulak Agarwal, Srikar Balusu, Gautham Gururajan, Harshavardhan Kamarthi, B. Aditya Prakash, Rishi Raman & Zhiyuan Zhao
University of Michigan, Ann Arbor, MI, USA
Alexander Rodríguez
Guidehouse Advisory and Consulting Services, McClean, VA, USA
Akilan Meiyappan & Shalina Omar
IEM, Bel Air, MD, USA
Prasith Baccam, Heidi L. Gurung & Brad T. Suchoski
IEM, Baton Rouge, LA, USA
Steve A. Stage
Indiana University School of Public Health, Bloomington, IN, USA
Marco Ajelli, Allisandra G. Kummer, Maria Litvinova & Paulo C. Ventura
Iowa State University, Ames, IA, USA
Spencer Wadsworth & Jarad Niemi
Johns Hopkins University, Baltimore, MD, USA
Erica Carcelen, Alison L. Hill, Sara L. Loo, Clifton D. McKee, Koji Sato, Claire Smith & Shaun Truelove
University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Sung-mok Jung, Joseph C. Lemaitre & Justin Lessler
Lehigh University, Bethlehem, PA, USA
Thomas McAndrew & Wenxuan Ye
London School of Health and Tropical Medicine, London, UK
Nikos Bosse
Los Alamos National Laboratory, Los Alamos, NM, USA
William S. Hlavacek, Yen Ting Lin, Abhishek Mallela & Graham C. Gibson
Northern Arizona University, Flagstaff, AZ, USA
Ye Chen, Shelby M. Lamm, Jaechoul Lee & Richard G. Posner
Fogarty International Center, National Institutes of Health, Bethesda, MD, USA
Amanda C. Perofsky & Cécile Viboud
Northeastern University, Boston, MA, USA
Leonardo Clemente, Fred Lu, Austin G. Meyer, Mauricio Santillana, Matteo Chinazzi, Jessica T. Davis, Kunpeng Mu, Ana Pastore y Piontti, Alessandro Vespignani & Xinyue Xiong
Predictive Science Inc, San Diego, CA, USA
Michal Ben-Nun, Pete Riley & James Turtle
Signature Science, LLC, Austin, TX, USA
Chis Hulme-Lowe & Shakeel Jessa
Signature Science, LLC, Charlottesville, VA, USA
V. P. Nagraj, Stephen D. Turner & Desiree Williams
University of Georgia, Athens, GA, USA
Avranil Basu, John M. Drake, Spencer J. Fox & Ehsan Suez
University of Guelph, Guelph, ON, Canada
Monica G. Cojocaru & Edward W. Thommes
Sanofi, Toronto, ON, USA
Edward W. Thommes
University of Massachusetts Amherst, Amherst, MA, USA
Estee Y. Cramer, Aaron Gerding, Ariane Stark, Evan L. Ray, Nicholas G. Reich, Li Shandross, Nutcha Wattanachit, Yijin Wang & Martha W. Zorn
University of Southern California, Los Angeles, CA, USA
Majd Al Aawar & Ajitesh Srivastava
University of Texas Austin, Austin, TX, USA
Lauren A. Meyers
University of Virginia, Charlottesville, VA, USA
Aniruddha Adiga, Benjamin Hurt, Gursharn Kaur, Bryan L. Lewis, Madhav Marathe & Srinivasan Venkatramanan
Virginia Tech, Arlington, VA, USA
Patrick Butler, Andrew Farabow & Naren Ramakrishnan
Stevens Institute of Technology, Hoboken, NJ, USA
Nikhil Muralidhar

Authors

Sarabeth M. Mathis
View author publications
You can also search for this author in PubMed Google Scholar
Alexander E. Webber
View author publications
You can also search for this author in PubMed Google Scholar
Tomás M. León
View author publications
You can also search for this author in PubMed Google Scholar
Erin L. Murray
View author publications
You can also search for this author in PubMed Google Scholar
Monica Sun
View author publications
You can also search for this author in PubMed Google Scholar
Lauren A. White
View author publications
You can also search for this author in PubMed Google Scholar
Logan C. Brooks
View author publications
You can also search for this author in PubMed Google Scholar
Alden Green
View author publications
You can also search for this author in PubMed Google Scholar
Addison J. Hu
View author publications
You can also search for this author in PubMed Google Scholar
Roni Rosenfeld
View author publications
You can also search for this author in PubMed Google Scholar
Dmitry Shemetov
View author publications
You can also search for this author in PubMed Google Scholar
Ryan J. Tibshirani
View author publications
You can also search for this author in PubMed Google Scholar
Daniel J. McDonald
View author publications
You can also search for this author in PubMed Google Scholar
Sasikiran Kandula
View author publications
You can also search for this author in PubMed Google Scholar
Sen Pei
View author publications
You can also search for this author in PubMed Google Scholar
Rami Yaari
View author publications
You can also search for this author in PubMed Google Scholar
Teresa K. Yamana
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Shaman
View author publications
You can also search for this author in PubMed Google Scholar
Pulak Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Srikar Balusu
View author publications
You can also search for this author in PubMed Google Scholar
Gautham Gururajan
View author publications
You can also search for this author in PubMed Google Scholar
Harshavardhan Kamarthi
View author publications
You can also search for this author in PubMed Google Scholar
B. Aditya Prakash
View author publications
You can also search for this author in PubMed Google Scholar
Rishi Raman
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyuan Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Akilan Meiyappan
View author publications
You can also search for this author in PubMed Google Scholar
Shalina Omar
View author publications
You can also search for this author in PubMed Google Scholar
Prasith Baccam
View author publications
You can also search for this author in PubMed Google Scholar
Heidi L. Gurung
View author publications
You can also search for this author in PubMed Google Scholar
Brad T. Suchoski
View author publications
You can also search for this author in PubMed Google Scholar
Steve A. Stage
View author publications
You can also search for this author in PubMed Google Scholar
Marco Ajelli
View author publications
You can also search for this author in PubMed Google Scholar
Allisandra G. Kummer
View author publications
You can also search for this author in PubMed Google Scholar
Maria Litvinova
View author publications
You can also search for this author in PubMed Google Scholar
Paulo C. Ventura
View author publications
You can also search for this author in PubMed Google Scholar
Spencer Wadsworth
View author publications
You can also search for this author in PubMed Google Scholar
Jarad Niemi
View author publications
You can also search for this author in PubMed Google Scholar
Erica Carcelen
View author publications
You can also search for this author in PubMed Google Scholar
Alison L. Hill
View author publications
You can also search for this author in PubMed Google Scholar
Sara L. Loo
View author publications
You can also search for this author in PubMed Google Scholar
Clifton D. McKee
View author publications
You can also search for this author in PubMed Google Scholar
Koji Sato
View author publications
You can also search for this author in PubMed Google Scholar
Claire Smith
View author publications
You can also search for this author in PubMed Google Scholar
Shaun Truelove
View author publications
You can also search for this author in PubMed Google Scholar
Sung-mok Jung
View author publications
You can also search for this author in PubMed Google Scholar
Joseph C. Lemaitre
View author publications
You can also search for this author in PubMed Google Scholar
Justin Lessler
View author publications
You can also search for this author in PubMed Google Scholar
Thomas McAndrew
View author publications
You can also search for this author in PubMed Google Scholar
Wenxuan Ye
View author publications
You can also search for this author in PubMed Google Scholar
Nikos Bosse
View author publications
You can also search for this author in PubMed Google Scholar
William S. Hlavacek
View author publications
You can also search for this author in PubMed Google Scholar
Yen Ting Lin
View author publications
You can also search for this author in PubMed Google Scholar
Abhishek Mallela
View author publications
You can also search for this author in PubMed Google Scholar
Graham C. Gibson
View author publications
You can also search for this author in PubMed Google Scholar
Ye Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shelby M. Lamm
View author publications
You can also search for this author in PubMed Google Scholar
Jaechoul Lee
View author publications
You can also search for this author in PubMed Google Scholar
Richard G. Posner
View author publications
You can also search for this author in PubMed Google Scholar
Amanda C. Perofsky
View author publications
You can also search for this author in PubMed Google Scholar
Cécile Viboud
View author publications
You can also search for this author in PubMed Google Scholar
Leonardo Clemente
View author publications
You can also search for this author in PubMed Google Scholar
Fred Lu
View author publications
You can also search for this author in PubMed Google Scholar
Austin G. Meyer
View author publications
You can also search for this author in PubMed Google Scholar
Mauricio Santillana
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Chinazzi
View author publications
You can also search for this author in PubMed Google Scholar
Jessica T. Davis
View author publications
You can also search for this author in PubMed Google Scholar
Kunpeng Mu
View author publications
You can also search for this author in PubMed Google Scholar
Ana Pastore y Piontti
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Vespignani
View author publications
You can also search for this author in PubMed Google Scholar
Xinyue Xiong
View author publications
You can also search for this author in PubMed Google Scholar
Michal Ben-Nun
View author publications
You can also search for this author in PubMed Google Scholar
Pete Riley
View author publications
You can also search for this author in PubMed Google Scholar
James Turtle
View author publications
You can also search for this author in PubMed Google Scholar
Chis Hulme-Lowe
View author publications
You can also search for this author in PubMed Google Scholar
Shakeel Jessa
View author publications
You can also search for this author in PubMed Google Scholar
V. P. Nagraj
View author publications
You can also search for this author in PubMed Google Scholar
Stephen D. Turner
View author publications
You can also search for this author in PubMed Google Scholar
Desiree Williams
View author publications
You can also search for this author in PubMed Google Scholar
Avranil Basu
View author publications
You can also search for this author in PubMed Google Scholar
John M. Drake
View author publications
You can also search for this author in PubMed Google Scholar
Spencer J. Fox
View author publications
You can also search for this author in PubMed Google Scholar
Ehsan Suez
View author publications
You can also search for this author in PubMed Google Scholar
Monica G. Cojocaru
View author publications
You can also search for this author in PubMed Google Scholar
Edward W. Thommes
View author publications
You can also search for this author in PubMed Google Scholar
Estee Y. Cramer
View author publications
You can also search for this author in PubMed Google Scholar
Aaron Gerding
View author publications
You can also search for this author in PubMed Google Scholar
Ariane Stark
View author publications
You can also search for this author in PubMed Google Scholar
Evan L. Ray
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas G. Reich
View author publications
You can also search for this author in PubMed Google Scholar
Li Shandross
View author publications
You can also search for this author in PubMed Google Scholar
Nutcha Wattanachit
View author publications
You can also search for this author in PubMed Google Scholar
Yijin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Martha W. Zorn
View author publications
You can also search for this author in PubMed Google Scholar
Majd Al Aawar
View author publications
You can also search for this author in PubMed Google Scholar
Ajitesh Srivastava
View author publications
You can also search for this author in PubMed Google Scholar
Lauren A. Meyers
View author publications
You can also search for this author in PubMed Google Scholar
Aniruddha Adiga
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Hurt
View author publications
You can also search for this author in PubMed Google Scholar
Gursharn Kaur
View author publications
You can also search for this author in PubMed Google Scholar
Bryan L. Lewis
View author publications
You can also search for this author in PubMed Google Scholar
Madhav Marathe
View author publications
You can also search for this author in PubMed Google Scholar
Srinivasan Venkatramanan
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Butler
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Farabow
View author publications
You can also search for this author in PubMed Google Scholar
Naren Ramakrishnan
View author publications
You can also search for this author in PubMed Google Scholar
Nikhil Muralidhar
View author publications
You can also search for this author in PubMed Google Scholar
Carrie Reed
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Biggerstaff
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca K. Borchering
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.M.M., A.E.W, M.B, and R.K.B. contributed to conceptualization. S.M.M. and R.K.B. wrote the original draft of the manuscript. S.M.M. and A.E.W. performed the formal analysis. M.B. and C.R. performed supervision and project administration. All authors contributed modeling data and T.M.L., E.L.M., M.Sun, L.A.W., L.C.B., A.Green, A.J.H., D.J.M., R.Rosenfeld, D.S., R.J.T., S.K., S.P., J.S., R.Y., T.K.Y., P.A., S.B., G.G., H.K., B.A.P., R.Raman, A.R., Z.Z., A.Meiyappan, S.O., P.Baccam, H.L.G., S.A.S., B.T.S., M.A., A.G.K., M.L., P.C.V., S.W., J.N., E.C., A.L.H., S.J., J.C.L., J.Lessler S.L.L., C.D.M., K.S., C.S., S.T., T.M., W.Y., N.B., W.S.H., Y.T.L., A.Mallela, Y.C., S.M.L., J.Lee, R.G.P., A.C.P., C.V., L.C., F.L., A.G.M., M.Santillana, M.C., J.T.D., K.M., A.P.P., A.V., X.X., M.B.N., P.R., J.T., C.H.L., S.J., V.P.N., S.D.T., D.W., A.B., J.M.D., S.J.F., G.C.G., E.S., E.W.T., M.G.C., E.Y.C., A.Gerding, A.Stark, E.L.R., N.G.R., L.S., N.W., Y.W., M.W.Z., M.A.A., A.Srivastava, L.A.M., A.A., B.H., G.K., B.L.L., M.M., S.V., P.Butler, A.F., N.M., and N.R. submitted forecast data for the analysis. All authors contributed to the review and editing of the manuscript.

Corresponding authors

Correspondence to Sarabeth M. Mathis or Rebecca K. Borchering.

Ethics declarations

Competing interests

E.W.T. is an employee of Sanofi, which manufactures influenza vaccines. J.S. and Columbia University disclose partial ownership of SK Analytics. J.S. discloses consulting for BNI. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Martin Bicher and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplmentary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mathis, S.M., Webber, A.E., León, T.M. et al. Evaluation of FluSight influenza forecasting in the 2021–22 and 2022–23 seasons with a new target laboratory-confirmed influenza hospitalizations. Nat Commun 15, 6289 (2024). https://doi.org/10.1038/s41467-024-50601-9

Download citation

Received: 08 December 2023
Accepted: 16 July 2024
Published: 26 July 2024
DOI: https://doi.org/10.1038/s41467-024-50601-9
Springer Nature Limited

Evaluation of FluSight influenza forecasting in the 2021–22 and 2022–23 seasons with a new target laboratory-confirmed influenza hospitalizations

From

Abstract

Similar content being viewed by others

Collaborative efforts to forecast seasonal influenza in the United States, 2015–2016

Comparative assessment of methods for short-term forecasts of COVID-19 hospital admissions in England at the local level

Results from the centers for disease control and prevention’s predict the 2013–2014 Influenza Season Challenge

Introduction

Results

Models Included

Relative WIS

Log-transformed analysis

Relative WIS and Spatial Variation

Absolute WIS

Coverage

Discussion

Forecast performance–accuracy

Forecast performance – coverage

Influenza forecasting in the COVID-19 era: challenges and opportunities

Methods

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplmentary Information

Peer Review File

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation