Data Assessment, Harmonisation, and Certification Facilities

From Gcube Wiki
Jump to: navigation, search

Overview

gCube is a software suite equipped with a rich array of services capable to interface with data sources having different characteristics both in terms of data types these sources offers (e.g. from document data, to statistical, biodiversity, and semantic data - see Data Access and Storage Facilities) and the heterogeneity of data belonging to the same type.

The goal of the Data Assessment, Harmonisation, and Certification Facilities is to deal with the above heterogeneity and provide unified views over diverse data items through a number of dedicated services. To meet this goal a number of components have been designed.

This page outlines the design rationale and high-level architecture of such components.

Key Features

The components part of the subsystem provide the following main key features:

workflow-oriented tabular data manipulation
user-defined definition and execution of workflows of data manipulation steps
rich array of data manipulation facilities offered 'as-a-Service'
rich array of data mining facilities offered 'as-a-Service'
rich array of data visualisation facilities offered 'as-a-Service'
reference-data management support
uniform model for reference-data representation including versioning and provenance
data curation and enrichment support
species occurrence data enrichment with environmental data dynamically acquired by data providers
data provenance recording
standard-based data presentation
OGC standard-based Geospatial data presentation

Main Components

Tabular Data
this family of components provides:
Time Series
this family of components provides:
  • Time Series: a service for performing assessment and harmonization on time series.
  • Codelist Manager: a library for performing import, harmonization and curation on code lists.
Biodiversity Data
this family of components provides: