Workflow Overview

This page provides a comprehensive overview of the OSeMOSYS-RDM workflow and its components.

What is OSeMOSYS-RDM?

OSeMOSYS-RDM is a reproducible workflow tool designed for:

Preprocessing OSeMOSYS model data
Solving energy system optimization problems
Postprocessing results for analysis
Uncertainty Analysis using Robust Decision Making (RDM) methodology
Scenario Discovery using PRIM (Patient Rule Induction Method)

Architecture Overview

Pipeline Stages

Stage 1: Base Future (Future 0)

The base future represents your baseline scenario without uncertainties.

Purpose:

Establish the reference case for comparison
Validate model structure and data
Generate baseline projections

Process:

Extract model structure from scenario file
Preprocess data into solver-ready format (via preprocess_data.py: adds commodity-technology-mode sets, computes CapitalRecoveryFactor and PvAnnuity)
Execute optimization with selected solver
Process outputs into standardized CSV format

Key Files:

Input: src/workflow/0_Scenarios/*.txt
Output: src/workflow/1_Experiment/Executables/

Stage 2: RDM Experiment

The RDM experiment generates multiple futures by systematically varying uncertain parameters.

Purpose:

Explore uncertainty space using Latin Hypercube Sampling
Generate ensemble of possible futures
Understand sensitivity to key parameters

Process:

Read uncertainty definitions from Interface_RDM.xlsx
Generate parameter samples using LHS
Apply perturbations and correct EV UDC coefficient signs (if configured)
Create modified scenario files for each future (with automatic data preprocessing)
Execute all futures (parallelized)
Store results in Parquet format
Generate consolidated Energy_Input.csv in src/Results/

Key Concepts:

Concept	Description
Future	One realization of uncertain parameters
LHS	Latin Hypercube Sampling for efficient coverage
Multiplier	Relative change from baseline value
Time Series	Trajectory of values over time

Stage 3: Postprocessing

Consolidates output results from all futures into analysis-ready datasets.

Purpose:

Aggregate output results from parallel runs
Create unified output dataset
Generate the consolidated Energy_Output.csv

Process:

Collect output results from all futures
Standardize column formats
Concatenate into master output dataset
Export as CSV

Note

The Energy_Input.csv is generated during the rdm_experiment stage (immediately after all futures are solved), while the Energy_Output.csv is generated during this postprocess stage.

Output Files:

src/Results/
├── OSEMOSYS_{Region}_Energy_Output.csv   (generated in postprocess)
├── OSEMOSYS_{Region}_Energy_Input.csv    (generated in rdm_experiment)
└── (additional analysis files)

Stage 4: PRIM Files Creator

Prepares data for PRIM scenario discovery analysis.

Purpose:

Transform RDM results into PRIM-compatible format
Calculate metrics for each future
Aggregate by analysis periods

Process:

Read postprocessed results
Calculate outcome metrics (costs, emissions, etc.)
Match drivers to outcomes
Create PRIM input tables

Stage 5: PRIM Analysis

Executes the Patient Rule Induction Method for scenario discovery.

Purpose:

Identify parameter combinations leading to outcomes of interest
Find “boxes” in uncertainty space
Characterize successful/risky scenarios

Process:

Define outcome thresholds (high/low cases)
Execute PRIM peeling algorithm
Identify predominant parameter ranges
Export discoveries to Excel

Execution Modes

Full Automated Execution

# Run everything
python run.py all

Selective Execution

# RDM only (stages 1-3)
python run.py rdm

# PRIM only (stages 4-5, requires RDM results)
python run.py prim

Manual Stage Execution

# Individual DVC stages
conda run -n AFR-RDM-env dvc repro base_future
conda run -n AFR-RDM-env dvc repro rdm_experiment
conda run -n AFR-RDM-env dvc repro postprocess

Data Flow

Input Data

src/workflow/0_Scenarios/Scenario1.txt

Contains:

Set definitions (YEAR, TECHNOLOGY, FUEL, etc.)
Parameter values (costs, capacities, demands, etc.)
Constraints and bounds

Intermediate Data

src/workflow/1_Experiment/
├── Executables/           # Base future results
│   └── Scenario1_0/
└── Experimental_Platform/
    └── Futures/           # RDM futures
        └── Scenario1/
            ├── Scenario1_1/
            ├── Scenario1_2/
            └── ...

Output Data

Results/
└── solution_status.txt                    # Per-future solver status (optimal/infeasible)

src/Results/
├── OSEMOSYS_{Region}_Energy_Output.csv
├── OSEMOSYS_{Region}_Energy_Input.csv
└── *.parquet files

OSeMOSYS Compatibility

OSeMOSYS-RDM is designed for the GNU MathProg implementation of OSeMOSYS.

Tested Formulations

The workflow has been tested with MUIO v5.3 and v5.4:

Reference formulations: model.v.5.3.txt and model.v.5.4.txt
Standard OSeMOSYS sets and parameters
Support for storage and user-defined constraints
v5.4 requires the automatic data preprocessing step (handled by preprocess_data.py)

Compatible Model Features

Feature	Support
Multi-region	✅ Yes
Storage	✅ Yes
User-defined constraints	✅ Yes
Time slicing	✅ Yes
Trade flows	✅ Yes
Emissions	✅ Yes