PROVE as a Web Service - Indico

P-GRADE Portal:
Towards a User-friendly
Grid Environment
Tamas Kiss
Centre for Parallel Computing
University of Westminster
[email protected]
Gergely Sipos
MTA SZTAKI, Hungary
[email protected]
www.lpds.sztaki.hu/pgportal
[email protected]
1
Technology concerns of Grid
systems
• Fast evolution of Grid systems and
middleware:
– GT1, GT2, OGSA, GT3 (OGSI), GT4 (WSRF),
LCG-2, gLite, …
• Many Grid systems are built based on these
different technologies
– EGEE (LCG-2), UK NGS (GT2), Open Science
Grid (GT3), etc.
2
Grid systems for HPC –
User concerns
•
•
•
•
How to cope with the variety of Grid systems?
How to develop/create new Grid applications?
How to execute Grid applications?
How to observe the application execution in
the Grid?
• How to tackle performance issues?
• How to execute Grid applications over several
Grids in a transparent way?
P-GRADE Grid Portal gives you the answer!
3
Properties of the
P-GRADE Portal
• General purpose, workflow-oriented computational Grid
portal. Supports the development and execution of workflowbased Grid applications.
•
•
•
Support for multi-grid workflows
GridSphere-2 based
– Easy to expand with new portlets (e.g. application-specific portlets)
– Easy to tailor to end-user needs
Grid services supported by the portal:
Service
Job execution
File storage
LCG-2 specific grids
Globus-specific grids
Computing Element
GRAM
Storage Element
GSIFTP server
Certificate management
Information system
Brokering
MyProxy
BDII
MDS-2
Workload Management System
---
Job monitoring
Mercury
Workflow & job visualization
PROVE
4
What is a P-GRADE Portal workflow?
• a directed acyclic graph
where
– Nodes represent jobs
(executable batch programs)
– Ports represent input/output
files the jobs expect/ produce
– Arcs represent file transfer
between the jobs
• semantics of the
workflow:
– A job can be executed if all
of its input files are available
• local input files: on the
portal server
• remote input files: on
storage elements
5
Two levels of parallelism by a workflow
• The P-GRADE Portal
workflow concept enables
the efficient parallelization
of complex problems
• Semantics of the
workflow enables two
levels of parallelism:
– Parallel execution
inside a workflow
node
– Parallel execution
among workflow
nodes
Multiple jobs can
run parallel
The job can be a
parallel program
6
Ultra-short range weather forecast
(Hungarian Meteorology Service)
Forecasting dangerous
weather situations (storms,
fog, etc.), crucial task in the
protection of life and property
25 x
10 x
25 x
5x
Processed information:
surface level
measurements, highaltitude measurements,
radar, satellite, lightning,
results of previous
computed models
Requirements:
•Execution time < 10 min
•High resolution (1km)
7
The problem of current portals
• They tightly connected and tailored to only one particular
Grid (eg. NGS portal, NorduGrid portal)
• If the user wants to move to another Grid
–
–
–
–
–
(She has to obtain certificate for the new Grid)
She has to register for the new Grid
She has to get an account for its portal
She has to learn the new environment
She has to copy the grid files & modify the application
• P-GRADE Portal release 2.1 and above solve these
problems:
– (Obtain a certificate for the new Grid)
– Register for the new Grid
– Map some of the jobs of your workflow onto resources of this Grid
8
Multi-Grid P-GRADE Portal
Different jobs of a
EGEE Grid
workflow can be
executed in different e.g. VOCE
grids
P-GRADE-Portal
The portal can be
connected to multiple grids
UK NGS
London
Rome
Athens
9
The typical P-GRADE Portal scenario
Part 1 - development phase
Certificate
servers
SAVE
WORKFLOW
Portal
server
DEFINE GRID
ENVIRONMENT
Grid
services
OPEN
EDITOR
OPEN & EDIT
or DEVELOP
WORKFLOW
10
The typical P-GRADE Portal scenario
Part 2 - execution phase
Certificate
servers
TRANSFER FILES,
SUBMIT JOBS
DOWNLOAD
PROXY
CERTIFICATES
VISUALIZE
JOBS and
WORKFLOW
PROGRESS
Portal
server
MONITOR
JOBS
Grid
services
DOWNLOAD
RESULTS
DOWNLOAD
RESULTS
11
Developing workflows
with the P-GRADE
Portal
Main steps
1.
2.
Define the Grid environment
Define the workflow
12
The typical P-GRADE Portal scenario
Development phase – step 1:
Certificate
servers
DEFINE THE
GRID
ENVIRONMENT
Portal
server
Grid
services
13
Resource Manager
(settings portlet)
•
•
To define which computational resources
my workflows will use
Two levels:
1. Define grids or VOs  administrator
1. Name (e.g. gridats)
2. Information system (e.g. egrid-2.egrid.it)
2. Define Computational resources for each grid:
1. Automatically from information system (only from MDS-2)
2. Centrally by the administrator
3. Individually by each user
14
Resource Manager
(settings portlet – user view)
List of available grids
To define computational
resources for such a
grid
15
Resource Manager
(settings portlet – user view)
Every computational resource is
identified by a
• host name
• port number (or use default)
• local jobmanager (queue name)
e.g. egrid-3.egrid.it/jobmanager-fork
16
The typical P-GRADE Portal scenario
Development phase – step 2:
Certificate
servers
SAVE
WORKFLOW
Portal
server
Grid
services
OPEN
EDITOR
OPEN & EDIT
or DEVELOP
or IMPORT
WORKFLOW
17
Workflow development
opening the workflow editor
The editor is a Java Webstart application
dynamic download and installation!
18
Workflow Editor
defining the graph
•
The aim is to define a
DAG of batch jobs:
1. Drag & drop
components:
jobs and ports
2. Define their properties
3. Connect ports by
channels
(no cycles, no loops, no
conditions)
19
Workflow Editor
defining the jobs
Define the job:
•Executable file
•Executable type
•Number of required
processors
•command line params.
•The resource to be
used for the execution:
•Grid
•(Comp. resource)
20
Which resource to use?
I still don’t
know which
resource to
use!
The information
system portlet
helps characterize
resources!
21
Automatic resource selection
Since P-GRADE Portal v2.2
1. Describe the requirements of the job
2. Select a LCG-2 middleware based Grid
(e.g. VOCE) for it
3. The workflow manager will use the broker
of that Grid during the execution to find
the best resource for the job
22
Workflow Editor
defining jobs in v2.2
Select an LCG-2 based Grid
(*_LCG_2_BROKER)!
Ignore the resource field!
Define optional
requirements using
the built-in JDL editor!
23
Workflow Editor
JDL editor in v2.2
JDL  look at the LCG-2 Users’ manual!
24
Workflow Editor
defining the ports
Type:
input: the job requires
output: the job produces
File type:
local: from/to my desktop
remote: from/to a
storage resource
File:
location of the file
Storage type:
Permanent: final result of
the WF
Volatile: just inter-job data
transfer
25
Location of files
Input file
Output file
Local files
• Client side location:
• Client side location:
result.dat
c:\experiments\11-04.dat
• Grid Unique IDentifier (GUID):
guid:1fd75fdf-dccc-4603-998b-e17facb0d034
• LRS logical file name:
• LRS logical file name:
(RMC file catalog is required – VOCE)
lfn:/sipos_11_04.dat
• LFC logical file name
(LFC file catalog is reuired – Egrid, Hungrid)
lfn:/grid/egrid/sipos/11-04.dat
(RMC file catalog is required – VOCE)
lfn:/sipos_11_04_-_result.dat
• LFC logical file name
(LFC file catalog is reuired – Egrid, Hungrid)
lfn:/grid/egrid/sipos/11-04_-_result.dat
Remote files
26
Local vs. remote files
Grid
services
LOCAL INPUT
FILES
LOCAL INPUT
FILES
&
EXECUTABLES
Portal
server
LOCAL
OUTPUT
FILES
&
EXECUTABLES
Storage
resources
REMOTE
INPUT
FILES
LOCAL
OUTPUT
FILES
REMOTE
OUTPUT
FILES
Comp.
resources
Only the
permanent
files!
27
Workflow Editor
saving the workflow
Workflow has been defined!
Let’s execute it!
28
Executing workflows
with the P-GRADE
Portal
Main steps
1.
2.
3.
4.
5.
Download proxies
Submit workflow
Observe workflow progress
If some error occurs correct the graph
Download result
29
The typical P-GRADE Portal scenario
Execution phase – step 1:
Certificate
servers
DOWNLOAD
PROXY
CERTIFICATES
Portal
server
Grid
services
30
Certificate Manager
certificates portlet
• To access
GSI-based Grids the
portal server
application needs
proxy certificates
• “Certificates” portlet:
• to upload X.509
certificates into
MyProxy servers
• to download
short-term proxy
credentials into
the portal server
application
31
Certificate Manager
downloading a proxy
1.
2.
MyProxy server access details:
•
Hostname (egrid-1.egrid.it)
•
Port number (7512)
•
User name (from upload)
•
Password (from upload)
Proxy parameters:
•
Lifetime
•
Comment
32
Certificate Manager
associating the proxy with a grid
This operation
displays the details
of the certificate and
the list of available
Grids
33
Certificate Manager
browsing proxies
Multiple proxies can
be available on the
portal server at the
same time!
Comp. resources of SEE-GRID
Comp. resources of HUNGRID
34
The typical P-GRADE Portal scenario
Execution phase - step 2:
Certificate
servers
TRANSFER FILES,
SUBMIT JOBS
Portal
server
Grid
services
35
Workflow Management
(workflow portlet)
• The portlet presents the status, size and output of the available workflow
in the “Workflow” list
• It has a Quota manager to control the users’ storage space on the server
• The portlet also contains the “Abort”, “Attach”, “Details”, “Delete” and
“Delete all” buttons to handle execution of workflows
• The “Attach” button opens the workflow in the Workflow Editor
• The “Details” button gives an overview about the jobs of the workflow
36
Workflow Execution
(observation by the workflow portlet)
White/Red/Green color means the job is initial/running/finished state
37
Workflow Execution
(observation by the workflow portlet)
White/Red/Green color means the job is initialised/running/finished
38
Workflow Execution
I still don’t
know what’s
happening
inside my
workflow!
39
The typical P-GRADE Portal scenario
Execution phase – step 3:
Certificate
servers
VISUALIZE
JOBS and
WORKFLOW
PROGRESS
Portal
server
MONITOR
JOBS
Grid
services
40
On-Line Monitoring both at the
workflow and job levels (workflow portlet)
- The portal
monitors and
displays
workflows
41
On-Line Monitoring both at the
workflow and job levels (workflow portlet)
- The portal also
monitors and
visualizes
parallel jobs
(if they were developed
with the P-GRADE
Environment)
- The portal also
generates a
statistical view
42
Rescuing a failed workflow 1.
(from v2.2)
A job failed during
workflow
execution
Read the error log
to know why
43
Rescuing a failed workflow 2.
(from v2.2)
Don’t touch the
finished jobs!
Map the failed job
onto a different
resource or
download a new
proxy for it.
The execution
can continue
from the point
of failure!
44
The typical P-GRADE Portal scenario
Execution phase – step 5
Certificate
servers
Portal
server
Grid
services
DOWNLOAD
RESULTS
DOWNLOAD
RESULTS
45
Downloading the results…
46
New features in P-GRADE Portal 2.3
• Every user has a storage quota on the
portal server
• Workflows and traces can be exported
from the portal server onto your client
machine
• Workflows and traces can be imported
from your client machine into the Portal
47
Workflow/trace export/import
To export a
workflow from
the portal onto
your machine
To delete every
unnecessary
files of the
workflow
To delete
trace/output of
the workflow
(if any)
48
Grid systems for HPC –
User concerns
•
•
•
•
How to cope with the variety of Grid systems?
How to develop/create new Grid applications?
How to execute Grid applications?
How to observe the application execution in
the Grid?
• How to tackle performance issues?
• How to execute Grid applications over several
Grids in a transparent way?
49
References
• Official portal of
– SEE-GRID infrastructure
– VOCE infrastructure
– HUNGRID infrastructure
• P-GRADE portal is available as service for:
– Croatian Grid
– UK National Grid Service
– EGrid (Italy)
50
How to access P-GRADE portal?
• If you are interested in using P-GRADE Portal:
– Take a look at www.lpds.sztaki.hu/pgportal
(slideshows, manuals, etc.)
– Get an account for one of its production installations:
•
•
•
•
•
VOCE portal - SZTAKI
SEEGRID portal – SZTAKI
HUNGrid portal – SZTAKI
NGS portal – University of Westminster
Egrid portal – ICTP
– If you are the administrator of a Globus/LCG-2 based Grid/VO
then ask SZTAKI to install the P-GRADE Portal for you!
– If you know the administrator of a P-GRADE Portal you can ask
him/her to give access to your Grid through his/her portal
installation! (Multi-Grid portal!)
51
What more we can offer
• GEMLCA-specific P-GRADE Portal:
– Share jobs and legacy applications with other members of your
community
– Portal service for the UK NGS www.cpc.wmin.ac.uk/ngsportal
– LCG-2 specific version is also available
– Lecture and demo tomorrow at 14:30-15:15
• Collaborative P-GRADE Portal:
– Develop workflows with your colleagues in a concurrent fashion
– Execute different parts of the workflow with different users’
certificates
– Will be available in 2006
52
Final conclusions
• P-GRADE portal provides:
– Easy-to-use workflow concept for solving complex problems
– Fast development of Grid applications
– Integrating various components into large Grid applications:
• Sequential codes
• MPI codes
• (Legacy codes  GEMLCA-specific P-GRADE Portal)
–
–
–
–
Application monitoring, performance visualization, guarantee correctness
Interoperability between different Grid systems can be solved
Simultaneous execution of application components in different Grids
Easy to port applications among Grids
(Switching between Grid technologies will be transparent to the end-user)
• Learn once, use everywhere
• Develop once, execute anywhere
53
Thank you!
www.lpds.sztaki.hu/pgportal
[email protected]
54