Reusability (Objectives)#

Data publication and quality assurance of scientific data is an issue that has to be done in other scientific disciplines as well. In the course of this project we always have other sciences in mind.

Standard Procedure#

A draft of the std-doi publication process has to be created. This should be done by flowcharting technique. The flowcharting technique gives the opportunity the describe the process without specification of the objects. The network of the graphical objects are reusable. The description of the objects with the specification of the activities and flow controls depends on the individual technical systems. The four main objects are: Permission, SQA, TQA and Publication.

Permission#

The permission is an individual process and not reusable.

SQA - Scientific Quality Assurance#

In this project a quality assurance toolkit will be developed to test meteorological observation data. The used language will be the R scripting language (http://www.r-project.org/), which is freely available. The implemented software, a R-Package, will be published at the R reposity for extensions CRAN (http://cran.r-project.org/).

TQA - Technical Quality Assurance#

The TQA check list is a proposal for the TQA procedure. It is very general and reusable. The corresponding software depends on the individual system and is not reusable.

TQA check list#

  1. Number of data sets is correct and not equal 0
  2. Size of every data set is not equal 0
  3. The data sets and corresponding metadata are all accessable via internet
  4. The data size is controlled and correct
  5. The time description (metadata) and existency of data are consistent, complete, start-, stop- date are consistent, continuous time steps are correct
  6. The format is correct
  7. Variable description and data are consisten

Publication#

The interface of the publication process are standardized and reusable. The first 3 main steps of the publication process are reusable:
  1. Creation of std-doi metadata and DOI/URN - A xml output of the assistant system will support the creation of the std-doi metadata. Individual metadata has to be added for the publisher, the DOI/URN suffix and data size.
  2. std-doi metadata integration into library catalogue - service available contact: http://www.tib-hannover.de/en/the-tib/doi-registration-agency/
  3. DOI/URN integration into resolver system - service available contact: http://www.tib-hannover.de/en/the-tib/doi-registration-agency/
  4. Link and access via internet to primary data and metadata - This is an individual process and not reusable.

Web based software system#

While developing the web-based assistant system we aim at creating an open architecture that can be easily adapted with respect to
  • data source databases
  • metadata fields
  • workflows
  • entry forms
  • user authentification
The WDC-Climate intends to use the based software system for CMIP5/IPCC (http://cmip-pcmdi.llnl.gov/cmip5/) model output. The WDC-Climate will be responsible for publication: assigning and registering persistent identifiers using the DOI/URN system to those data available for the IPCC fifth assessment report produced at every modelling center for each CMIP5 experiment.
«