How to Contribute

This document describes how researchers/users can contribute their own benchmark (resources) to the WaterBenchmarkHub.

In order to add a new benchmark (resource), the WaterBenchmarkHub Git repository has to be cloned and a pull-request with all necessary changes (see next sections) must be submitted. The pull-requests will be reviewed by the WaterBenchmarkHub team before it gets accepted and finally integrated into the WaterBenchmarkHub.

General

When adding a new benchmark (resource), there are a few aspects that must be considered:

Naming

The unique identifier of a new benchmark (resource) must consist of two parts:

Category or user/organization
Name of the benchmark (resource)

Both parts (lowercase!) are concatenated by a hyphen – i.e. let’s assume we add a network called Net2, then the final name would be network-net2 (note that we use the category network to denote a network resource).

Meta data

For each benchmark (resource), a new entry (dictionary) in the database docs/static/database.json must be created with the following content – if a field is not available, it should dropped from the database entry:

name Name of the benchmark (resource).
desc Brief description of the benchmark (resource).
year Year in which the benchmark (resource) was published.
doi The doi of the benchmark (resource) or the corresponding publication.
license The license of the benchmark (resource).
tags List of (existing) tags associated with the benchmark (resource) – avoid inventing new tags and use existing tags!
keywords Additional keywords associated with the benchmark (resource) – these keywords are never displayed but used when the user is searching for benchmark (resources).
download_url External URL where the benchmark (resource) can be downloaded – should be direct link to the resource file(s).
external_url External URL to the project page of the benchmark (resource).
permalink Usually, this is given as benchmarks/benchmark-ID.html where benchmark-ID refers to the unique ID of the benchmark resource.

Note: The dictionary key (lowercase!) in the database must be the unique identifier of the benchmark (resource).

A complete example of the KIOS LeakDB benchmark is given below:

"kios-leakdb": {
    "name": "LeakDB",
    "desc": "LeakDB (Leakage Diagnosis Benchmark) is a realistic leakage dataset for water distribution networks.",
    "year": 2018,
    "tags": ["leakage detection", "leakage localization", "leakage", "event detection"],
    "keywords": ["Net1", "Hanoi"],
    "doi": "10.5281/zenodo.1313116",
    "license": "EUPL-1.2",
    "download_url": "https://ucy-my.sharepoint.com/:f:/g/personal/mkiria01_ucy_ac_cy/Eiyah0-TL4dGqt9K4Ln5TN0BRlroASbX35p53bS7or4j5A",
    "external_url": "https://github.com/KIOS-Research/LeakDB",
    "permalink": "benchmarks/KIOS-LeakDB.html"
}

Description

The complete description of the benchmark resource must be placed in a Markdown file in the docs/_benchmarks folder. The name of the Markdown file must be the same as specified in the permalink field in the database.

The Markdown file must start with the following header:

---
title: "BenchmarkName"
id: "benchmark-ID"
permalink: benchmark-permalink
collection: benchmarks
layout: benchmark
---

where BenchmarkName refers to the name, benchmark-ID to the unique ID (see above), and benchmark-permalink to the permalink of the benchmark (resource) as specified in the database. All other parts of the header must not be changed!

Regarding the description itself, we do not have any strict rules how to structure and what to put into the Markdown file. However, we recommend the following layout:

## Description General (detailed) description of the benchmark (resource).
## How to Use Description of how to load and utilize the benchmark (resource)
### Usage in Python If applicable, a description on how to load the benchmark (resource) in Python using the water-benchmark-hub Python package.
## Reference Reference (citation) of the benchmark (resource).

Images or other resources are used in the Markdown file, must be placed in the static/benchmarks/benchmark-ID/ folder – where benchmark-ID (lower case!) refers to the benchmark ID as given the permalink field.

Python package

In addition to the previous steps, the benchmark (resource) should made be available (if possible) in the water-benchmark-hub Python package as well. For this, the following steps are necessary:

If the new benchmark (resource) is a network, a new class has to be derived from the WaterDistributionNetwork class and put into src/water_benchmark_hub/networks/networks.py (if justified, a new .py file can also be created).

For all other types of benchmark (resources), a new directory should be created and the benchmark itself must be implemented by deriving a new class from the BenchmarkResource class and importing this new class in src/water_benchmark_hub/__init__.py.
In all cases, the new benchmark (resource) must be registered by calling the register() function right after the class declaration – the argument is a key that is used to load the benchmark (resource), which is usually similar (but nicely formatted) to the unique ID of the benchmark resource as used in the database.
Also, in all cases, the new class must be decoratated with the @meta_data() decorator – the argument is the unique ID of the benchmark resource as used in the database.