Git Product home page Git Product logo

adaptively_stacking_ensembles_for_infleunza_forecasting_with_incomplete_data's Introduction

adaptively_stacking_ensembles_for_infleunza_forecasting_with_incomplete_data

Running the command make build will generate all needed figures and datasets needed for publication.

Description of Folders

_0_mixtureModelAlgorithms -> Ensemble algorithms for static and adaptive ensembles.

_1_downloadRawData -> Code to download Raw influenza data from the CDC, through Epicast.

_2_processRawILIdata -> Aggregate all raw influenza data into one file.

_3_collect_and_process_individual_forecasts -> Combine component model forecasts together.

_4_score_component_model_forecasts -> Score component model forecasts.

_5_compute_and_score__ensembles -> Score ensemble model forecasts.

_6_TLGs -> Create tables and figures for publication.

_7_manuscript -> Code for generating manuscript.

_data_dictionary -> Data dictionary for datasets published on Harvard dataverse.

abstract

Seasonal influenza infects between 10 and 50 million people in the United States every year, overburdening hospitals during weeks of peak incidence. Named by the CDC as an important tool to fight the damaging effects of these epidemics, accurate forecasts of influenza and influenza-like illness (ILI) forewarn public health officials about when, and where, seasonal influenza outbreaks will hit hardest.

Multi-model ensemble forecasts---weighted combinations of component models---have shown positive results in forecasting. Ensemble forecasts of influenza outbreaks have been static, training on all past ILI data at the beginning of a season, generating a set of optimal weights for each model in the ensemble, and keeping the weights constant. We propose an adaptive ensemble forecast that (i) changes model weights week-by-week throughout the influenza season, (ii) only needs the current influenza season's data to make predictions, and (iii) by introducing a prior distribution, shrinks weights toward the reference equal weighting approach and adjusts for observed ILI percentages that are subject to future revisions.

We investigate the prior's ability to impact adaptive ensemble performance and, after finding an optimal prior via a cross-validation approach, compare our adaptive ensemble’s performance to equal-weighted and static ensembles. Applied to forecasts of short-term ILI incidence at the regional and national level in the US, our adaptive model outperforms a na"ive equal-weighted ensemble, and has similar or better performance to the static ensemble, which requires multiple years of training data.

Adaptive ensembles are able to quickly train and forecast during epidemics, and provide a practical tool to public health officials looking for forecasts that can conform to unique features of a specific season.

additional data needed

Data needed for this project can be found at the Harvard Dataverse.

allFSNLogScores.csv.gz (https://doi.org/10.7910/DVN/YUYFUF) can be downloaded wget https://dataverse.harvard.edu/api/access/datafile/:persistentId?persistentId=doi:10.7910/DVN/YUYFUF and placed in the folder _4_score_component_model_forecasts/analysisData/

analysisData/allEnsembleScores.csv (https://doi.org/10.7910/DVN/RJE9PT) can be downloaded wget https://dataverse.harvard.edu/api/access/datafile/:persistentId?persistentId=doi:10.7910/DVN/RJE9PT -O allEnsembleScores.csv and placed in the folder _5_compute_and_score__ensembles/analysisData

analysisData/allEnsembleWeights.csv (https://doi.org/10.7910/DVN/A9YZOV) can be downloaded wget https://dataverse.harvard.edu/api/access/datafile/:persistentId?persistentId=doi:10.7910/DVN/A9YZOV -O allEnsembleWeights.csv and placed in the folder _5_compute_and_score__ensembles/analysisData

analysisData_preSeason/allEnsembleScores.csv (https://doi.org/10.7910/DVN/GH3C1U) can be downloaded wget https://dataverse.harvard.edu/api/access/datafile/:persistentId?persistentId=doi:10.7910/DVN/GH3C1U -O allEnsembleScores.csv and placed in the folder _5_compute_and_score__ensembles/analysisData

_2_processRawILIdata/analysisData/allFluData__releaseDate_location_EW_lag_ili_wili_year_week_modelWeek_calendarEW_Season.csv (https://doi.org/10.7910/DVN/BNVXN6) can be downloaded wget https://dataverse.harvard.edu/api/access/datafile/:persistentId?persistentId=doi:10.7910/DVN/BNVXN6/Q56PMZ -O allFluData__releaseDate_location_EW_lag_ili_wili_year_week_modelWeek_calendarEW_Season.csv and placed in the folder _2_processRawILIdata/analysisData/

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.