Part of the VIEWS Platform ecosystem for large-scale conflict forecasting.
- Overview
- Role in the VIEWS Pipeline
- Features
- Installation
- Usage
- Architecture
- Project Structure
- Contributing
- License
- Acknowledgements
Stepshifter is a machine learning model designed for time-series forecasting using Darts. It solves [regression and classification] tasks.
Key Capabilities:
- Probabilistic Outputs: Binary outputs and point predictions.
- Learning Approach:
LinearRegressionModel,RandomForest,LightGBMModel,XGBModel,HurdleModel.
- Integration-Ready: Built to seamlessly integrate into the larger VIEWS Pipeline.
Stepshifter serves as part of the Violence & Impacts Early Warning System (VIEWS) pipeline. It complements the following repositories:
- views-pipeline-core: For data ingestion and preprocessing.
- views-models: Handles training, testing, and deployment.
- views-evaluation: Evaluates and calibrates model outputs.
- docs: Organization/pipeline level documentation.
Stepshifter fits into the pipeline as follows:
- Data Input: Processes preprocessed data from views-pipeline-core.
- Model Execution: This modeling approach involves shifting all independent variables in time, in order to train models that can predict future values of the dependent variable.
- Post-Processing: Outputs are sent to views-evaluation for further analysis.
- Darts models: Stepshifter model class supports multiple Darts forecasting models, including
LinearRegressionModel,RandomForest,LightGBMModel, andXGBModel. - Automated Data Cleanup: Stepshifter model class automatically processes missing data and ensures consistent multi-index formatting for time-series data.
- Hurdle model: Hurdle model class inherits from StepshifterModel.
A hurdle model consists of two stages:
- Binary stage: Predicts whether the target variable is 0 or > 0.
- Positive stage: Predicts the value of the target variable when it is > 0.
- Python >= 3.11
- Access to views-pipeline-core.
See the organization/pipeline level docs
See the organization/pipeline level docs
Stepshifter integrates seamlessly with the VIEWS pipeline. After processing, outputs can be passed to views-evaluation for further calibration or ensembling.
This modeling approach involves shifting all independent variables in time, in order to train models that can predict future values of the dependent variable. More details can be found in Appendix A of Hegre et al. (2020).
This approach differs from a traditional implementation in three aspects:
- In the first stage, since Darts doesn't support classification models, a regression model is used instead. These estimates are not strictly bounded between 0 and 1, but this is acceptable for the purpose of this step.
- To determine whether an observation is classified as "positive," we apply a threshold. The default threshold is 1, meaning that predictions above this value are considered positive outcomes. It is not set as 0 because most predictions won't be exactly 0. This threshold can be adjusted as a tunable hyperparameter to better suit specific requirements.
- In the second stage, a regression model is used to predict for the selected time series. Since Darts time series require a continuous timestamp, we can't get rid of those timestamps with negative prediction produced in the first stage like a traditional implementation. Instead we include the entire time series for countries or PRIO grids where the first stage yielded at least one positive prediction.
- Input: VIEWS historical conflict data.
- Processing: Converting to Darts time series data.
- Prediction: Regression predictions.
Refer to the Appendix A of Hegre et al. (2020) for an in-depth explanation.
For more detailed information about the VIEWS Stepshifter models themselves, refer to the VIEWS models catalog.
views-stepshifter/
├── README.md # Documentation
├── tests # Unit and integration tests
├── views-stepshifter # Main source code
│ ├── manager # Management of stepshifter model lifecycle
│ ├── models # Model algorithms
│ ├── src # Folder template
│ ├── __init__.py # Package initialization
├── .gitignore # Git ignore rules
├── pyproject.toml # Poetry project file
├── poetry.lock # Dependency lock file
We welcome contributions to this project! Please follow the guidelines in the VIEWS Documentation.
This project is licensed under the LICENSE file.
Special thanks to the VIEWS MD&D Team for their collaboration and support.

