Input data¶

Datamodel¶

The oedatamodel-parameter (docs) has to be used to provide input data in SEDOS. This choice was made to allow ontological annotation of data. The data model consists of two table types: "scalar" and "timeseries". Please choose the table type accordingly, depending on the data you want to upload:

Use scalar tables to provide parameters with no relation to a timeindex.
To properly fill the table, read the scalar model column description.
Use the example scalar package metadata to write your own metadata (see section How to contribute data - 2. Create metadata
Use a timeseries table to provide parameters with relation to a timeindex.
Use the timeseries model column description to make yourself familiar with the fields and add metadata to your tables using the example timeseries package metadata as a reference.

Datamodel features

type - use to specify multiple processes in one csv table (see example)
bandwidth_type - specify how the data bandwidths are interpreted

Input and output energy vectors¶

The input and output energy vectors of processes in SEDOS' reference energy system are defined in an external table on the BW Sync&Share, in the sheet Parameter_Input-Output.

For processes with multiple input and/or output energy vectors it might not clear to which energy vector a parameter column refers. Thus, the information has to be specified if needed.

By default, all parameters of a process as assigned to all inputs and outputs of a process from the BW Sync&Share table - sheet: Process_Set (the default does not appear in the input_output sheet, but is used in the backend of the data pipeline).

If needed, the default can be overwritten, simply by assigning other input(s) and output(s) to a specific parameter of the process in the Parameter_Input-Output sheet.

input_output insertion conventions

Enclose substituting inputs or outputs in squared brackets [ ]. E.g.
separate inputs or outputs with , (comma)

Example

Parameter_Input-Output sheet

parameter	process	input	output
energy_conversion_efficiency_diesel	mob_road_mcar_ice_pass_diesel		[pkm_road_mcar_short_exo, pkm_road_mcar_long_ex]
energy_conversion_efficiency_syndiesel	mob_road_mcar_ice_pass_diesel		[pkm_road_mcar_short_exo, pkm_road_mcar_long_ex]
emission_factor_diesel	mob_road_mcar_ice_pass_diesel	diesel	CO2
emission_factor_syndiesel	mob_road_mcar_ice_pass_diesel	syndiesel	CO2
parameter1	process 1	[cane, honeymelon], heat, power	[sugar, cinnamon], waste

In the Process_Set sheet, the process is assigned to:

Input	Process	Output
[diesel, syndiesel, biodiesel]	mob_road_mcar_ice_pass_diesel	[pkm_road_mcar_short_exo, pkm_road_mcar_long_exo], CO2

In the example it is assumed that the mob_road_mcar_ice_pass_diesel emits the same amount of CO2 when using diesel or biodiesel as fuel. However, the process has different efficiencies depending on the fuel.

All other process parameters of mob_road_mcar_ice_pass_diesel, such as: investment_cost, operational_life_time, mileage, occupancy_rate, market_share are assigned to the default inputs (diesel, syndiesel_ren, syndiesel_conv, biodiesel) and outputs (pkm_road_mcar_short_exo, pkm_road_mcar_long_exo, CO2) in the backend from the table sheet Process_Set.

It is for the data providers (WP4-8) to assess whether this is correct for each process with respect to the modelling.
If the default is incorrect, the Parameter_Input-Output sheet should be used to specify process parameters' inputs and outputs accordingly.

Data tables¶

Naming conventions for data tables and parameters¶

Note

Parameter name and table name conventions are in place due to technical reasons of the underlying relational postgre-sql database on the OEP.

The following conventions will be automatically checked when uploading a table on the OEP, and error messages will be raised in case of violation. Users need to correct them and their compliance is mandatory.

Do's¶

use ASCII characters only
use lower case only
use the singular instead of the plural.
use underscores

Don'ts¶

table name must not exceed maximal character limit = 50
no points
no commas
no spaces
no special characters
avoid dates
no hyphens. If dates are used, then without -! E.g.: 2022-10-21 will through an error. Use underscore instead!
don't start parameter name with a number

Delimiter¶

Use semicolon ; as the column delimiter.

Decimal separator¶

Use point . as decimal separator.

Versioning convention¶

Versioning pattern

For SEDOS Reference Dataset (SRD)

v + number
Version your data with lowercase letter v and arabic number, e.g.: v1, v2, v3, v4, ..., v12, v13

For SEDOS Scenarios

When the data for the SRD is final, specific scenarios will be uploaded to showcase the application of the SRD in practice. It's done by selecting specific values from SRD's bandwidths and appending them with a scenario version to respective tables.

s + number
Version your data with lowercase letter s and arabic number, e.g.: s1, s2, s3, s4, ..., s12, s13

Increase the version when you want to add or update data to a table that has been already uploaded to the OEP. The oedatamodel-API will append new data versions to an existing OEP table.

Reasoning: By following the versioning convention the end-user only needs to know the latest data version of a given process. Thus, querying the latest process data version will return a full set of coherent input data.
Conversely, users only need to know one version number when querying older data versions to work with a full set of coherent input data for a process.

v1 : Initial data

id	region	year	type	capital_costs	lifetime	bandwidth_type	version	method	source	comment
1	DE	2020		1	5		v1
2	DE	2025		1.5	6		v1

v2 : Adding data to your csv table requires a new version v2 for new datapoints (capital_cost and lifetime in 2030, row=5), including a copy the existing data from v1 as version v2 (reasoning see above)

id	region	year	capital_costs	lifetime	version
1	DE	2020	1	5	v2
2	DE	2025	1.5	6	v2
3	DE	2030	2	8	v2

v3 : Updating a datapoint (lifetime, row=8) requires a new version v3, including a copy the existing data from v2 as version v3

id	region	year	capital_costs	lifetime	version
1	DE	2020	1	5	v3
2	DE	2025	1.5	6	v3
3	DE	2030	2	15	v3

v4 : Updating a datapoint (capital_costs, row=11) requires a new version v4, including a copy the existing data from v3 as version v4

id	region	year	capital_costs	lifetime	version
1	DE	2020	1	5	v4
2	DE	2025	1.5	6	v4
3	DE	2030	4	15	v4

id column numbering

If you check Automatically increase IDs, if IDs are already present in table in the OEDatamodel-api when uploading the data, the id's will be updated automatically - as shown in the Example result on OEP below.

Example result on OEP:

id	region	year	capital_costs	lifetime	version
1	DE	2020	1	5	v1
2	DE	2025	1.5	6	v1
3	DE	2020	1	5	v2
4	DE	2025	1.5	6	v2
5	DE	2030	2	8	v2
6	DE	2020	1	5	v3
7	DE	2025	1.5	6	v3
8	DE	2030	2	15	v3
9	DE	2020	1	5	v4
10	DE	2025	1.5	6	v4
11	DE	2030	4	15	v4

Versioning note

Uploading a row which exists already in: "region, type, year, version" results in an error.
Otherwise single or multiple rows can be added to already existing versions

Datatypes¶

Available datatypes and corresponding formatting examples:

dtype	example	comment
json	"{""key1"": ""text"", ""key2"": ""text""}"	{"key1": "text", ...} might work as well. If you're using an table edititor make sure to use ASCII characters only
float array	[0.02, 279.5]	arrays with strings should be formatted as strings ("['string1', 'string2']"
text	"text"	String data should have text as datatype (text serves as "string")
integer	27
text array	"[""TH"", ""BW""]"	["TH","BW"] might work as well. If you're using an table edititor make sure to use ASCII characters only

Table naming¶

Follow the nomenclature for table and process naming in sheet: Nomenclature_Processes.

Note

Create a new data package for each process, e.g. wind_onshore, chp, ...
Or use the type column to put multiple processes in one data package
You can use a single datapackage for both demand data and other constraint data (tech-independent parameters),e.g.:
- emission limit,
- natural domestic limit,
- WACC

Parameter naming¶

Follow the nomenclature for parameter naming in sheet: Parameter_Set.

Note

Parameter names must be linked to a concept in an ontology

It is importance that every parameter name is linked to a suitable ontological concept via the metadata to enable its clear interpretation.
In case you cannot find a suitable ontology concept (e.g. because it's not in the ontology yet), please make sure your chosen parameter name is clear and common in the domain. In this case avoid acronyms.

Ontological annotations

In the nomenclature for parameter naming in sheet: Parameter_Set, we made suggestions for suitable ontology concepts for usage across the SEDOS APs.

Please use them or make suggestions of better suitable concepts.

For more background information regarding for linking parameter names and ontology concepts, using OEM, see here.

For a practical manual linking parameter names to a suitable ontology concepts, using oemetadata builder, see here.

Parameter naming in case of missing suitable ontology concept¶

It is likely that not every parameter concept is already covered by an ontology. In this case, linking your parameter name to a suitable ontology concept is not directly possible.

It might, however, be indirectly possible by linking it to a distinct selection of related ontology concepts. In this case you would link your parameter name to multiple related concepts.

For a practical manual linking parameter names to multiple related ontology concepts, using oemetadata builder, see here.