Generalize the benchmarking framework at country level by tgilon · Pull Request #543 · open-energy-transition/open-tyndp

tgilon · 2026-03-13T17:28:02Z

Closes #540.

Changes proposed in this Pull Request

This PR extends the benchmarking framework with a new bus-level and country-level overview. Where data is available, the benchmarks are produced both at EU27-level and at a spatially resolved level for each technology in a table.

The carrier-level tables now report on missing buses/countries (i.e. where a carrier is not defined for a given bus/country). The table-level tables report on missing carriers (i.e. where Open-TYNDP and reference carriers do not match) and the maximum number of missing buses/countries from the underlying carrier data.

Tasks

Workflow

(new configuration) benchmarking.spatial.by_bus and benchmarking.spatial.by_country, which control whether benchmarking at this level is performed
clean_tyndp_output_benchmark produces bus-level data where possible
clean_tyndp_report_benchmark has been refactored for clarity
make_benchmark and plot_benchmark always generate by_bus and by_country outputs. These outputs are empty if the configuration is disabled.
General improvements to the mappings and to the code structure

Open issues

Power capacities

Belgium: Open-TYNDP reports an extra 3GW of wind offshore compared to the Market Model (both 2030 and 2040). However, the capacity exists in the model in Denmark. This is a known issue with the naming conventions for the offshore topology ([SPIKE] investigate differences in offshore hub modelling #513).
Denmark: Except from the wind offshore issue described in the Belgium section, Open-TYNDP reports an extra of 1.4 GW of wind onshore compared to the Market Model in 2040 (Fix wrong onwind capacity in DK for NT 2040 #558).
Spain: The PEMMDB for ES00 only has the 20.5 GW of Gas CCGT present 1. However, the MM reports a value of 24.49 GW of CCGT present 1. If we add the Other Non-RES value for ES00 to the PEMMDB value, we get the exact correct value for Gas CCGT present 1: 20,517.56 + 3,980.011 = 24,497.571 GW. It could be that there is a double count of Other Non-RES Gas power plant.
Malta: Open-TYNDP reports an extra of 0.1 GW of solar compared to the MM in 2040 (Fix wrong solar capacity in MT for NT 2040 #559).
Netherlands: Open-TYNDP reports a lack of 1 GW of wind offshore compared to the MM (both 2030 and 2040). It's a rounding issue. Open-TYNDP reports 50 300 MW (rounded at 50 GW) and MM 50 542.5 MW (rounded at 51 GW).

Hydrogen demand

Hydrogen demand for power generation is still too low, especially in 2040.

Notes

Using the latest produced results (20260311), we get the following figures and tables. A set of figures has been made available here to make the review easier.

Overall overview by countries

	sMPE	sMAPE	sMdAPE	RMSLE	Growth Error	Missing carriers	Missing countries	reference	version
biomass_supply	0.21	0.63	0.32	1.24	0	0		TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
elec_demand	0	0	0	0	0	0		TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
energy_imports	0.58	0.58	0.4	1.11	0.02	1	0	TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
final_energy_demand	-0.1	0.27	0.09	0.56	0.01	0		TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
generation_profiles						NA	NA		v0.5.1+g0cfc7814b
hydrogen_demand	-0.27	0.35	0	2.99	0	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
hydrogen_supply	-0.24	0.64	0.4	1.64	-0.41	3	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
methane_demand	0.04	0.11	0.09	0.14	0	0		TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
methane_supply	0.15	0.15	0.12	0.16	0.01	4	0	TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
power_capacity	0.06	0.16	0	3.61	0	3	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
power_generation	-0.07	0.29	0.01	3.98	0.02	3	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
Total (excl. time series)	0.04	0.38	0.05	1.56	0	11			v0.5.1+g0cfc7814b

Overall overview by bus

	sMPE	sMAPE	sMdAPE	RMSLE	Growth Error	Missing carriers	Missing buses	reference	version
biomass_supply	0.21	0.63	0.32	1.24	0	0	0	TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
elec_demand	0	0	0	0	0	0	0	TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
energy_imports	0.58	0.58	0.4	1.11	0.02	1	0	TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
final_energy_demand	-0.1	0.27	0.09	0.56	0.01	0	0	TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
generation_profiles						NA	NA		v0.5.1+g0cfc7814b
hydrogen_demand	-0.27	0.35	0	2.99	0	0	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
hydrogen_supply	-0.24	0.64	0.4	1.64	-0.41	3	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
methane_demand	0.04	0.11	0.09	0.14	0	0	0	TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
methane_supply	0.15	0.15	0.12	0.16	0.01	4	0	TYNDP 2024 Scenarios Report	v0.5.1+g0cfc7814b
power_capacity	0.06	0.19	0	3.91	0	3	32	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
power_generation	-0.08	0.27	0	4.1	0.02	3	32	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
Total (excl. time series)	-0.1	0.21	0	1.93	0	16			v0.5.1+g0cfc7814b

Power capacities at country-level

benchmark_power_capacity_EU27_cy2009_2040

	sMPE	sMAPE	sMdAPE	RMSLE	Growth Error	Missing buses	Missing countries	reference	version
battery	0.47	1.25	1.41	10.34	-1.75	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
chp and small thermal	0	0	0	0	0	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
coal + other fossil (incl. biofuels)	-0.02	0.02	0	0.06	-0.02	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
hydro and pumped storage	0	0	0	0	0	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
hydrogen	0	0	0	0	0	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
methane	0	0.01	0	0.04	0.01	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
nuclear	0	0	0	0	0	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
oil (incl. biofuels)	0	0	0	0	0	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
small scale res	0	0	0	0	0	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
solar	0.01	0.01	0	0.06	0	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
wind offshore	0	0.03	0	0.11	-0.04	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
wind onshore	0	0	0	0.02	0	0		TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
solar thermal							0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
demand shedding							0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
OCGT							0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b

Power capacities at bus-level

benchmark_power_capacity_ITCA_cy2009_2040

	sMPE	sMAPE	sMdAPE	RMSLE	Growth Error	Missing buses	reference	version
battery	0.52	1.21	1.33	10.53	-1.75	2	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
chp and small thermal	0	0	0	0.01	0	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
coal + other fossil (incl. biofuels)	-0.02	0.02	0	0.06	-0.02	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
hydro and pumped storage	0	0	0	0	0	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
hydrogen	0	0	0	0	0	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
methane	0	0.01	0	0.04	0.01	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
nuclear	0	0	0	0	0	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
oil (incl. biofuels)	0	0	0	0	0	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
small scale res	0	0	0	0	0	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
solar	0	0	0	0.05	0	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
wind offshore	-0.17	0.22	0	0.5	0.03	32	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
wind onshore	0	0	0	0.03	0	0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
solar thermal						0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
demand shedding						0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b
OCGT						0	TYNDP 2024 Market Model Outputs	v0.5.1+g0cfc7814b

Checklist

…or_network

Conflicts: doc/release_notes.rst

Co-authored-by: Daniel Rüdt <[email protected]>

Conflicts: scripts/_helpers.py

…/540-bench-by-ct

…p_report_benchmakr

Conflicts: config/benchmarking.default.yaml rules/sb.smk scripts/sb/clean_tyndp_output_benchmark.py scripts/sb/clean_tyndp_report_benchmark.py scripts/sb/make_benchmark.py scripts/sb/plot_benchmark.py

lisazeyen

Thanks a lot @tgilon ! This is a massive improvement for the benchmarking and helps us a lot to analyse the results.

lisazeyen · 2026-03-24T07:31:03Z

-        ax.bar_label(c, fmt="%.0f", padding=3, fontsize=8)
+        ax.bar_label(
+            c,
+            fmt=lambda x: f"{x:.1f}" if 0 < abs(x) < 10 else f"{x:.0f}",


it is not good practice to have different rounding in the same plot because it reduces comparability, better to have all numbers in the same format

Suggested change

fmt=lambda x: f"{x:.1f}" if 0 < abs(x) < 10 else f"{x:.0f}",

fmt=lambda x: f"{x:.1f}",

I get your point. Some of the numbers are already overlapping. Adding more numbers will make it even worse.

E.g.:

I agree with both sentiments. Maybe you could increase the width of the plot so that the bars are wider giving more room to the figure labels?

We could also reduce the font or tilt the text

Good ideas, we can make the figure wider, rotate the text, reduce font size, have different label positions

for i, (bar, val) in enumerate(zip(bars, values)): offset = 0.05 if i % 2 == 0 else 0.15 ax.text(bar.get_x() + bar.get_width()/2, bar.get_height() + offset, f'{val:.1f}', ha='center', va='bottom')

I would favour that instead of having different roundings.

Here is my suggestion: when we have large numbers and / or many sources, the bar labels are rotated.

Any further thoughts? @lisazeyen @daniel-rdt

To me, this looks much better! Thank you @tgilon! We could even have it always rotated by 90° so it is consistent. But no strong opinion on that.

Co-authored-by: lisazeyen <[email protected]>

lisazeyen

@tgilon thanks for the changes! I think it is nearly ready to be merged, only the merge conflicts need to be resolved and it would be good if we can change the layout of the figure so that we can have one more decimal in the bar plots.

Conflicts: README.md config/benchmarking.default.yaml doc/release_notes.rst

tgilon and others added 30 commits March 4, 2026 11:14

feat: process h2 demand time series

01b0d5c

feat: add mm h2 demand time series as optional inputs of prepare_sect…

8826cb7

…or_network

feat: ignore last day of the year

8789aad

feat: add an option to define the reference source for benchmarking

1b040c5

feat: benchmark h2 demand against market outputs

6befe46

feat: make benchmarking source visible on plots

cecb8a9

doc: update the doc accordingly

5435c5a

doc: add release note

8da9276

Merge branch 'master' into feat/480-mm-h2-demand

fdb4836

chore: remove lost config

68dd899

doc: improve documentation

864fcfd

fix: format h2 demand snapshots correctly for incomplete years

1346c4e

fix: make sure to use mm data only for 2030 and 2040

7121b6e

fix: restrict benchmarking using market outputs to NT for now

9a88723

fix: fix return type in prepare_sector_network

84ac688

Merge remote-tracking branch 'origin/master' into feat/480-mm-h2-demand

428fc3b

Conflicts: doc/release_notes.rst

fix: fix conditions for h2_demand in prepare_sector_network

54bf35e

Apply suggestions from code review

e5af0f3

Co-authored-by: Daniel Rüdt <[email protected]>

refactor: use align_demand_to_snapshots

79322f4

refactor: improve naming convention for sources

b7afcc3

refactor: improve robustness of SOURCES_MAP access

fc7aa2c

Merge branch 'master' into feat/480-mm-h2-demand

fa1baa3

Apply suggestions from code review

8e858ec

feat: make reference sources list configurable

09b5837

Merge branch 'master' into feat/480-mm-h2-demand

40496ac

feat: generalize build_statistics

e50316f

Merge branch 'feat/480-mm-h2-demand' into feat/540-bench-by-ct

84a2616

Conflicts: scripts/_helpers.py

feat: generalise clean_tyndp_output_benchmark at ct level

19c2221

Merge branch 'master' into feat/480-mm-h2-demand

7fd08b6

Conflicts: scripts/_helpers.py

Merge remote-tracking branch 'origin/feat/480-mm-h2-demand' into feat…

e01eca8

…/540-bench-by-ct

tgilon added 8 commits March 19, 2026 14:58

refactor: create a clean_data_for_benchmarking function in clean_tynd…

151d889

…p_report_benchmakr

chore: improve plotting

a0ee4e3

Merge branch 'master' into feat/540-bench-by-ct

d13ab31

Conflicts: config/benchmarking.default.yaml rules/sb.smk scripts/sb/clean_tyndp_output_benchmark.py scripts/sb/clean_tyndp_report_benchmark.py scripts/sb/make_benchmark.py scripts/sb/plot_benchmark.py

refactor: create a grouping function in clean_tyndp_report_benchmark

06ecfb9

feat: implement other indicators

68bd59f

fix: create accurate countries when reading data

0b45402

doc: add release note and update doc

b0a9a14

fix: adjust bus names in market model outputs for hydrogen

bc9207f

This was referenced Mar 20, 2026

[SUB] Add a load-weighted average indicator for each table #556

Open

[SUB] Add prices to the benchmarking framework #557

Closed

Fix wrong onwind capacity in DK for NT 2040 #558

Closed

Fix wrong solar capacity in MT for NT 2040 #559

Closed

tgilon added 3 commits March 20, 2026 11:26

feat: include biofuels to methane

9d0c09b

Apply suggestions from code review

a9ae79f

fix: apply biofuels grouping to generation

8095fc6

tgilon marked this pull request as ready for review March 20, 2026 10:57

tgilon requested review from daniel-rdt and lisazeyen March 20, 2026 10:57

fix: make make_benchmark rebost to empty dataframe

3251cb2

tgilon mentioned this pull request Mar 20, 2026

feat: Add common data assumptions from PEMMDB #541

Merged

6 tasks

lisazeyen requested changes Mar 24, 2026

View reviewed changes

tgilon and others added 2 commits March 24, 2026 14:06

Apply suggestions from code review

7af4f1f

Co-authored-by: lisazeyen <[email protected]>

fix: add more improvements after suggestions

f0ffdd8

tgilon requested a review from lisazeyen March 24, 2026 13:51

lisazeyen approved these changes Mar 25, 2026

View reviewed changes

tgilon added 2 commits March 25, 2026 22:37

Merge branch 'master' into feat/540-bench-by-ct

92121a1

Conflicts: README.md config/benchmarking.default.yaml doc/release_notes.rst

feat: improve dense figure rendering

c827456

lisazeyen removed this from the Release v0.6 milestone Mar 27, 2026

tgilon merged commit 625e84a into master Mar 27, 2026
6 checks passed

tgilon deleted the feat/540-bench-by-ct branch March 27, 2026 16:59

	fmt=lambda x: f"{x:.1f}" if 0 < abs(x) < 10 else f"{x:.0f}",
	fmt=lambda x: f"{x:.1f}",

Conversation

tgilon commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes proposed in this Pull Request

Tasks

Workflow

Open issues

Power capacities

Hydrogen demand

Notes

Overall overview by countries

Overall overview by bus

Power capacities at country-level

Power capacities at bus-level

Checklist

Uh oh!

lisazeyen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lisazeyen Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

tgilon Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

daniel-rdt Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

tgilon Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

lisazeyen Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

tgilon Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

daniel-rdt Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

lisazeyen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tgilon commented Mar 13, 2026 •

edited

Loading