Difference between revisions of "Data Assessment, Harmonisation, and Certification Facilities"

Latest revision as of 21:05, 16 December 2013

Overview

gCube is a software suite equipped with a rich array of services capable to interface with data sources having different characteristics both in terms of data types these sources offers (e.g. from document data, to statistical, biodiversity, and semantic data - see Data Access and Storage Facilities) and the heterogeneity of data belonging to the same type.

The goal of the Data Assessment, Harmonisation, and Certification Facilities is to deal with the above heterogeneity and provide unified views over diverse data items through a number of dedicated services. To meet this goal a number of components have been designed.

This page outlines the design rationale and high-level architecture of such components.

Key Features

The components part of the subsystem provide the following main key features:

workflow-oriented tabular data manipulation: user-defined definition and execution of workflows of data manipulation steps; rich array of data manipulation facilities offered 'as-a-Service'; rich array of data mining facilities offered 'as-a-Service'; rich array of data visualisation facilities offered 'as-a-Service'

reference-data management support: uniform model for reference-data representation including versioning and provenance

data curation and enrichment support: species occurrence data enrichment with environmental data dynamically acquired by data providers; data provenance recording

standard-based data presentation: OGC standard-based Geospatial data presentation

Main Components

Tabular Data

this family of components provides:

Tabular Data Service: a service supporting tabular data flow management;

Time Series

this family of components provides:

Time Series: a service for performing assessment and harmonization on time series.
Codelist Manager: a library for performing import, harmonization and curation on code lists.

Biodiversity Data

this family of components provides:

Occurrence Data Reconciliation: a service for performing assessment and harmonization on occurrence points of species.
Occurrence Data Enrichment Service: a service for performing enrichment of information associated to occurrence points of species.
Taxon Names Reconciliation Service: a service for performing assessment and harmonization on taxa.

@@ Line 1: / Line 1: @@
+[[Category:gCube Features]]
 == Overview ==
-The iMarine EA Community of Practice works on a wide range of data types with different sources and targets; our main objective is to let the user consume this data in an uniform and useful way. Data retrieved using different format and protocol is harmonizated and assessed using the service offered by this area and by the one already present in the system.
+gCube is a software suite equipped with a rich array of services capable to interface with data sources having different characteristics both in terms of data types these sources offers (e.g. from document data, to statistical, biodiversity, and semantic data - see [[Data Access and Storage Facilities]]) and the heterogeneity of data belonging to the same type.
+The goal of the ''Data Assessment, Harmonisation, and Certification Facilities'' is to deal with the above heterogeneity and provide unified views over diverse data items through a number of dedicated services. To meet this goal a number of components have been designed.
-In order to meet the requirements a number of components have been designed.
+This page outlines the design rationale and high-level architecture of such components.
-This document outlines the design rationale and high-level architecture of such components.
 == Key Features ==
@@ Line 11: / Line 12: @@
 The components part of the subsystem provide the following main key features:
-;Pluglable data consumption services
+;workflow-oriented tabular data manipulation
-:the data consumption service follow a plugin architecture where the plugin provide access to a type of datasource
+:user-defined definition and execution of workflows of data manipulation steps
-;Component reuse
+:rich array of data manipulation facilities offered 'as-a-Service'
-:Components are designed to be reused in different services
+:rich array of data mining facilities offered 'as-a-Service'
-;Geospatial data production in [http://www.opengeospatial.org/ OGC standard format]
+:rich array of data visualisation facilities offered 'as-a-Service'
-:Geospatial data can be returned to invoking clients in a OGC standard format
-;Species Occurrence points enrichment harmonization and reconciliation
+;reference-data management support
-:species occurrence points can be associated to environmental data. Moreover they can be merged when coming from different data sources.
+:uniform model for reference-data representation including versioning and provenance
-;Single access point for geospatial data retrieval
-:A single access point service can be used to retrieve geospatial data.
+;data curation and enrichment support
-;General purpose tabular data processing
+:species occurrence data enrichment with environmental data dynamically acquired by data providers
-:A set of services provide a tabular data flow mechanism and a set of components for tabular data visualization.
+:data provenance recording
-;Data mining operation on species data
-:Occurrence data can be processed, clustered and hidden information can be extracted by means of data mining operations.
+;standard-based data presentation
+:[http://www.opengeospatial.org/ OGC standard]-based Geospatial data presentation
 == Main Components ==
@@ Line 30: / Line 32: @@
 ; Tabular Data
 :this family of components provides:
-:* [[Tabular Data Flow Manager]]: a tabular data flow mechanism
+<!--:* [[Tabular Data Flow Manager]]: a service providing tabular data flow management.
-:* [[Tabular Data Manager]]: tabular data visualization and elaboration facilities.
+:* [[Tabular Data Manager]]: a set of libraries for tabular data visualization and management.-->
+:* [[Tabular Data Service]]: a service supporting tabular data flow management;
 ; Time Series
-:* [[Tabular Data Flow Manager]]: a tabular data flow mechanism
+:this family of components provides:
-:* [[Tabular Data Manager]]: tabular data visualization and elaboration facilities.
+:* [[TimeSeries|Time Series]]: a service for performing assessment and harmonization on time series.
+:* [[Codelist Manager]]: a library for performing import, harmonization and curation on code lists.
 ; Biodiversity Data
-:* [[Occurrence Data Reconciliation]]: this family of components provides a service for performing assessment and harmonization on occurrence points of species.
+:this family of components provides:
-:* [[Taxon names reconciliation service|Taxon Names Reconciliation Service]]:this family of components provides a service for performing assessment and harmonization on taxa
+:* [[Occurrence Data Reconciliation]]: a service for performing assessment and harmonization on occurrence points of species.
+:* [[Occurrence Data Enrichment Service]]: a service for performing enrichment of information associated to occurrence points of species.
+:* [[Taxon Names Reconciliation Service]]: a service for performing assessment and harmonization on taxa.

Difference between revisions of "Data Assessment, Harmonisation, and Certification Facilities"

Latest revision as of 21:05, 16 December 2013

Overview

Key Features

Main Components

Navigation menu

Views

Personal tools

gCube Wiki

gCube features

gCube documentation

Integration and Distribution

Search

Tools