Difference between revisions of "Spatial Data Processing"

From Gcube Wiki
Jump to: navigation, search
m (Subsystems)
(Subsystems)
 
(4 intermediate revisions by the same user not shown)
Line 6: Line 6:
 
||__TOC__
 
||__TOC__
 
|}
 
|}
 +
 +
gCube Spatial Data Processing offers a rich array of data analytics methods via OGC Web Processing Service (WPS).
  
 
== Overview ==
 
== Overview ==
Geospatial Data Processing takes advantage of the OGC Web Processing Service (WPS) as web interface to allow for the dynamic deployment of user processes. In this case, [http://www.terradue.com Terradue] selected the [http://52north.org/communities/geoprocessing/wps/index.html 52° North WPS] open source component, as the interface to a Cloud service that we developed to allow the deployment of user “algorithms”. We demonstrate that such user algorithms can be developed to exploit the powerful and distributed framework offered by [http://hadoop.apache.org/mapreduce/ Apache™ Hadoop™ MapReduce]. The [http://svn.research-infrastructures.eu/public/d4science/gcube/trunk/data-analysis/wps-hadoop/ source code] for wrappers and examples developed by Terradue are contributed via the [http://i-marine.eu/ iMarine EC FP7] project to the [http://www.d4science.eu/ D4Science] gCube infrastructure components.
+
Geospatial Data Processing takes advantage of the OGC Web Processing Service (WPS) as web interface.  
 +
It is implemented by relying on the [[Data Mining Facilities | gCube platform for data analytics]].  
  
 
[[File:Spatial_Data_Processing.png|400px|Overall Architecture]]
 
[[File:Spatial_Data_Processing.png|400px|Overall Architecture]]
Line 14: Line 17:
 
== Key Features ==
 
== Key Features ==
  
DataMiner offers a web interface to access the algorithms from external HTTP clients through three different kind of requests, made available through the Web Processing Standard interface:
+
gCube Spatial Data Processing distinguishing features include:
  
* The '''GetCapabilities''' operation provides access to general information about a live WPS implementation, and lists the operations and access methods supported by that implementation. 52N WPS supports the GetCapabilities operation via HTTP GET and POST.
+
; WPS-based access to an open and extensible set of processes
 +
: all the processes hosted by the system are exposed via RESTful protocol enacting clients to be informed on the list of available processes (GetCapabilities), to get the specification of every process (DescribeProcess) and to execute a selected process (Execute); 
  
* The '''DescribeProcess''' operation allows WPS clients to request a full description of one or more processes that can be executed by the service. This description includes the input and output parameters and formats and can be used to automatically build a user interface to capture the parameter values to be used to execute a process.
+
; relying on a Hybrid and Distributed Computing Infrastructure;
 +
: every process can be designed to be executed on diverse and many 'computing nodes' (e.g. R engines, Java);
  
* The '''Execute''' operation allows WPS clients to run a specified process implemented by the server, using the input parameter values provided and returning the output values produced. Inputs can be included directly in the Execute request, or reference web accessible resources.
+
; easy integration of user-defined processes;
 +
: the system enact users to easily add their own algorithms to the set of those offered by the system, e.g. by [[Statistical Algorithms Importer]];
 +
 
 +
; rich array of ready to use processes;
 +
: the system is equipped with a [[Statistical Manager Algorithms | large set of ready to use algorithms]];
 +
 
 +
; open science support
 +
: the system automatically provide for process repeatability and provenance by recording on the [[Workspace]] a comprehensive research object;
  
 
== Subsystems ==
 
== Subsystems ==
Line 28: Line 40:
  
 
;[[Ecological Modeling]]
 
;[[Ecological Modeling]]
 +
: ...
  
 
;[[Signal Processing]]
 
;[[Signal Processing]]
 +
: ...
 +
 +
; [[Geospatial Data Mining]]
 +
: ...

Latest revision as of 18:17, 6 July 2016


gCube Spatial Data Processing offers a rich array of data analytics methods via OGC Web Processing Service (WPS).

Overview

Geospatial Data Processing takes advantage of the OGC Web Processing Service (WPS) as web interface. It is implemented by relying on the gCube platform for data analytics.

Overall Architecture

Key Features

gCube Spatial Data Processing distinguishing features include:

WPS-based access to an open and extensible set of processes
all the processes hosted by the system are exposed via RESTful protocol enacting clients to be informed on the list of available processes (GetCapabilities), to get the specification of every process (DescribeProcess) and to execute a selected process (Execute);
relying on a Hybrid and Distributed Computing Infrastructure;
every process can be designed to be executed on diverse and many 'computing nodes' (e.g. R engines, Java);
easy integration of user-defined processes;
the system enact users to easily add their own algorithms to the set of those offered by the system, e.g. by Statistical Algorithms Importer;
rich array of ready to use processes;
the system is equipped with a large set of ready to use algorithms;
open science support
the system automatically provide for process repeatability and provenance by recording on the Workspace a comprehensive research object;

Subsystems

DataMiner / Statistical Manager
...
Ecological Modeling
...
Signal Processing
...
Geospatial Data Mining
...