Common-accounting-model ABANDONED

Scope

This library contains the definition of the resource accounting record.

Data-model

The structure of a generic accounting record (Usage Record, UR) will be composed of a set of common fields for all resource types, in particular:

id : an unique identifier for the UR
consumerId : the user actually consuming the resource (optional, for future purposes)
createTime : when the UR was created
startTime, endTime : the time window the UR refers to
resourceType : the type of resource the UR tracks
scope : the scope of the resource
resourceOwner : who owns the resource and/or who creates the UR

Furthermore, for each UR there will be a section to be filled with the specific properties per resource type (key-value pairs).

Resource Types

The resource types we've identified are: Execution, Service, Data-access and Storage.

Execution

Regarding the Execution resource type, there are two sub-types, according to the PE2ng's structure which is composed by two main layers. There is the Workflow layer that is more abstract, constructing workflow plans, supporting various adaptors and is aware of jobs as a whole. There is also the Execution layer, also a Service, where the actual execution takes place and is aware of more detailed stuff.

Discriminating those layers:

Workflow layer is aware of:

Number of jobs submitted and adaptor that were used Execution nodes that will be used (scale out) per job

Execution layer is aware of:

Statuses of execution jobs (success/fail/pending) also GHN hosting node information of every execution node is available to Workflow, harvested through Registry, containing info such as location, cpu load (week, day, hour,...), memory, disk space etc.: the Workflow layer that is more abstract, constructing workflow plans, supporting various adaptors and is aware of jobs as a whole. There is also the Execution layer, also a Service, where the actual execution takes place and is aware of more detailed stuff.

Plan

Specific Plan properties:

cores : the number of a vm's cores that get occupied is based on either the process is multithreaded or not.
inputFilesNumber : this info could be extracted at workflow layer.
inputFilesSize : not know at workflow layer, before execution starts, as files are transferred from different sources. Available at execution layer.
jobId, jobName, jobStart, jobEnd, jobStatus: This info could be extracted out of progress report of a job, or directly from every execution engine at execution layer.
outputFilesNumber, outputFilesSize : same as input.
overallNetworkIn, overallNetworkOut : depends on process demands.
processors : number of processors used per job.
wallDuration : duration between the instant the job started running and the instant the job ended its execution.

Execution Engine

Specific Execution Engine properties:

refHost : hostname of the vm
refVM : Execution Engine resource id or gHN id
usageStart : the earlier usage time of the Execution Engine
usageEnd: the latest usage time of the Execution Engine
usagePhase: Completed/Ready/Paused/Running/Cancel

Service

Specific service attributes

callerIP :
invocationCount :
averageInvocationTime :
serviceClass :
serviceName :

Data-access

to do

Storage

Specific storage attributes

operationType : GET, PUT (update or new file), DELETE
targetFile : remote full path of the storage resource
fileDimension : storage resource dimension
serviceClass: service class used by the client of the storage library at the initialization time of the library
serviceName: service name used by the client of the storage library at the initialization time of the library
hostname: hostname of the host where the storage library is invoked

Common-accounting-model ABANDONED

Contents

Scope

Data-model

Resource Types

Execution

Plan

Execution Engine

Service

Data-access

Storage

Navigation menu

Views

Personal tools

gCube Wiki

gCube features

gCube documentation

Integration and Distribution

Search

Tools