Difference between revisions of "Data Assessment, Harmonisation, and Certification Facilities"

From Gcube Wiki
Jump to: navigation, search
(Main Components)
(Main Components)
Line 30: Line 30:
 
; Tabular Data
 
; Tabular Data
 
:this family of components provides:  
 
:this family of components provides:  
:* [[Tabular Data Flow Manager]]: a tabular data flow mechanism
+
:* [[Tabular Data Flow Manager]]: a service providing tabular data flow management.
:* [[Tabular Data Manager]]: tabular data visualization and elaboration facilities.
+
:* [[Tabular Data Manager]]: a set of libraries for tabular data visualization and management.
 
; Time Series
 
; Time Series
 
:this family of components provides:  
 
:this family of components provides:  
Line 38: Line 38:
 
; Biodiversity Data
 
; Biodiversity Data
 
:this family of components provides:  
 
:this family of components provides:  
:* [[Occurrence Data Reconciliation]]: this family of components provides a service for performing assessment and harmonization on occurrence points of species.
+
:* [[Occurrence Data Reconciliation]]: a service for performing assessment and harmonization on occurrence points of species.
:* [[Occurrence Data Enrichment Service]]: this family of components provides a service for performing enrichment of information associated to occurrence points of species.
+
:* [[Occurrence Data Enrichment Service]]: a service for performing enrichment of information associated to occurrence points of species.
:* [[Taxon names reconciliation service|Taxon Names Reconciliation Service]]:this family of components provides a service for performing assessment and harmonization on taxa
+
:* [[Taxon names reconciliation service|Taxon Names Reconciliation Service]]: a service for performing assessment and harmonization on taxa.

Revision as of 18:49, 14 May 2012

Overview

The iMarine EA Community of Practice works on a wide range of data types with different sources and targets; our main objective is to let the user consume this data in an uniform and useful way. Data retrieved using different format and protocol is harmonizated and assessed using the service offered by this area and by the one already present in the system.

In order to meet the requirements a number of components have been designed.

This document outlines the design rationale and high-level architecture of such components.

Key Features

The components part of the subsystem provide the following main key features:

Pluglable data consumption services
the data consumption service follow a plugin architecture where the plugin provide access to a type of datasource
Component reuse
Components are designed to be reused in different services
Geospatial data production in OGC standard format
Geospatial data can be returned to invoking clients in a OGC standard format
Species Occurrence points enrichment harmonization and reconciliation
species occurrence points can be associated to environmental data. Moreover they can be merged when coming from different data sources.
Single access point for geospatial data retrieval
A single access point service can be used to retrieve geospatial data.
General purpose tabular data processing
A set of services provide a tabular data flow mechanism and a set of components for tabular data visualization.
Data mining operation on species data
Occurrence data can be processed, clustered and hidden information can be extracted by means of data mining operations.

Main Components

Tabular Data
this family of components provides:
Time Series
this family of components provides:
  • TimeSeries: a service for performing assessment and harmonization on time series.
  • Codelist Manager: a library for performing import, harmonization and curation on code lists.
Biodiversity Data
this family of components provides: