Data Retrieval Facilities

From Gcube Wiki
Revision as of 19:53, 2 March 2012 by Vassilis.verroios (Talk | contribs) (Key Features)

Jump to: navigation, search

Overview

gCube provides Information Retrieval facilities over large heterogeneous environments. Sources of information that use different technologies, data representation and semantics can be integrated and exploited by gCube's Data Retrieval framework. The architecture and mechanisms provided by the framework ensure flexibility, scalability, high performance and availability.

The gCube Data Retrieval Framework aims at hiding the complexity of the underlying environment by:

  •  providing a declarative approach for querying the hosted information
  •  scaling to the number of hosted information sources
  •  Integrating dynamically external sources of information

Key Features

Declarative Query Language over a heterogeneous environment

gCube Data Retrieval framework unifies Data Sources that use different data representation and semantics through the http://www.loc.gov/standards/sru/specs/cql.html standard.

On the fly integration of Data Sources
Context ...
Scalability in the number of Data Sources
Context ...
Direct Integration of External Information Providers
Context ...
Indexing Capabilities for Replication and High Availability
Context ...
Distributed Execution Environment offering High Performance and Flexibility
Context ...

Subsystems

Data Retrieval framework comprises the following two subsystems:

Search Planning and Execution Specification

Data Sources Specification