P-GRADE Portal: Towards a User-friendly Grid Environment Tamas Kiss Centre for Parallel Computing University of Westminster [email protected] Gergely Sipos MTA SZTAKI, Hungary [email protected] www.lpds.sztaki.hu/pgportal [email protected] 1 Technology concerns of Grid systems • Fast evolution of Grid systems and middleware: – GT1, GT2, OGSA, GT3 (OGSI), GT4 (WSRF), LCG-2, gLite, … • Many Grid systems are built based on these different technologies – EGEE (LCG-2), UK NGS (GT2), Open Science Grid (GT3), etc. 2 Grid systems for HPC – User concerns • • • • How to cope with the variety of Grid systems? How to develop/create new Grid applications? How to execute Grid applications? How to observe the application execution in the Grid? • How to tackle performance issues? • How to execute Grid applications over several Grids in a transparent way? P-GRADE Grid Portal gives you the answer! 3 Properties of the P-GRADE Portal • General purpose, workflow-oriented computational Grid portal. Supports the development and execution of workflowbased Grid applications. • • • Support for multi-grid workflows GridSphere-2 based – Easy to expand with new portlets (e.g. application-specific portlets) – Easy to tailor to end-user needs Grid services supported by the portal: Service Job execution File storage LCG-2 specific grids Globus-specific grids Computing Element GRAM Storage Element GSIFTP server Certificate management Information system Brokering MyProxy BDII MDS-2 Workload Management System --- Job monitoring Mercury Workflow & job visualization PROVE 4 What is a P-GRADE Portal workflow? • a directed acyclic graph where – Nodes represent jobs (executable batch programs) – Ports represent input/output files the jobs expect/ produce – Arcs represent file transfer between the jobs • semantics of the workflow: – A job can be executed if all of its input files are available • local input files: on the portal server • remote input files: on storage elements 5 Two levels of parallelism by a workflow • The P-GRADE Portal workflow concept enables the efficient parallelization of complex problems • Semantics of the workflow enables two levels of parallelism: – Parallel execution inside a workflow node – Parallel execution among workflow nodes Multiple jobs can run parallel The job can be a parallel program 6 Ultra-short range weather forecast (Hungarian Meteorology Service) Forecasting dangerous weather situations (storms, fog, etc.), crucial task in the protection of life and property 25 x 10 x 25 x 5x Processed information: surface level measurements, highaltitude measurements, radar, satellite, lightning, results of previous computed models Requirements: •Execution time < 10 min •High resolution (1km) 7 The problem of current portals • They tightly connected and tailored to only one particular Grid (eg. NGS portal, NorduGrid portal) • If the user wants to move to another Grid – – – – – (She has to obtain certificate for the new Grid) She has to register for the new Grid She has to get an account for its portal She has to learn the new environment She has to copy the grid files & modify the application • P-GRADE Portal release 2.1 and above solve these problems: – (Obtain a certificate for the new Grid) – Register for the new Grid – Map some of the jobs of your workflow onto resources of this Grid 8 Multi-Grid P-GRADE Portal Different jobs of a EGEE Grid workflow can be executed in different e.g. VOCE grids P-GRADE-Portal The portal can be connected to multiple grids UK NGS London Rome Athens 9 The typical P-GRADE Portal scenario Part 1 - development phase Certificate servers SAVE WORKFLOW Portal server DEFINE GRID ENVIRONMENT Grid services OPEN EDITOR OPEN & EDIT or DEVELOP WORKFLOW 10 The typical P-GRADE Portal scenario Part 2 - execution phase Certificate servers TRANSFER FILES, SUBMIT JOBS DOWNLOAD PROXY CERTIFICATES VISUALIZE JOBS and WORKFLOW PROGRESS Portal server MONITOR JOBS Grid services DOWNLOAD RESULTS DOWNLOAD RESULTS 11 Developing workflows with the P-GRADE Portal Main steps 1. 2. Define the Grid environment Define the workflow 12 The typical P-GRADE Portal scenario Development phase – step 1: Certificate servers DEFINE THE GRID ENVIRONMENT Portal server Grid services 13 Resource Manager (settings portlet) • • To define which computational resources my workflows will use Two levels: 1. Define grids or VOs administrator 1. Name (e.g. gridats) 2. Information system (e.g. egrid-2.egrid.it) 2. Define Computational resources for each grid: 1. Automatically from information system (only from MDS-2) 2. Centrally by the administrator 3. Individually by each user 14 Resource Manager (settings portlet – user view) List of available grids To define computational resources for such a grid 15 Resource Manager (settings portlet – user view) Every computational resource is identified by a • host name • port number (or use default) • local jobmanager (queue name) e.g. egrid-3.egrid.it/jobmanager-fork 16 The typical P-GRADE Portal scenario Development phase – step 2: Certificate servers SAVE WORKFLOW Portal server Grid services OPEN EDITOR OPEN & EDIT or DEVELOP or IMPORT WORKFLOW 17 Workflow development opening the workflow editor The editor is a Java Webstart application dynamic download and installation! 18 Workflow Editor defining the graph • The aim is to define a DAG of batch jobs: 1. Drag & drop components: jobs and ports 2. Define their properties 3. Connect ports by channels (no cycles, no loops, no conditions) 19 Workflow Editor defining the jobs Define the job: •Executable file •Executable type •Number of required processors •command line params. •The resource to be used for the execution: •Grid •(Comp. resource) 20 Which resource to use? I still don’t know which resource to use! The information system portlet helps characterize resources! 21 Automatic resource selection Since P-GRADE Portal v2.2 1. Describe the requirements of the job 2. Select a LCG-2 middleware based Grid (e.g. VOCE) for it 3. The workflow manager will use the broker of that Grid during the execution to find the best resource for the job 22 Workflow Editor defining jobs in v2.2 Select an LCG-2 based Grid (*_LCG_2_BROKER)! Ignore the resource field! Define optional requirements using the built-in JDL editor! 23 Workflow Editor JDL editor in v2.2 JDL look at the LCG-2 Users’ manual! 24 Workflow Editor defining the ports Type: input: the job requires output: the job produces File type: local: from/to my desktop remote: from/to a storage resource File: location of the file Storage type: Permanent: final result of the WF Volatile: just inter-job data transfer 25 Location of files Input file Output file Local files • Client side location: • Client side location: result.dat c:\experiments\11-04.dat • Grid Unique IDentifier (GUID): guid:1fd75fdf-dccc-4603-998b-e17facb0d034 • LRS logical file name: • LRS logical file name: (RMC file catalog is required – VOCE) lfn:/sipos_11_04.dat • LFC logical file name (LFC file catalog is reuired – Egrid, Hungrid) lfn:/grid/egrid/sipos/11-04.dat (RMC file catalog is required – VOCE) lfn:/sipos_11_04_-_result.dat • LFC logical file name (LFC file catalog is reuired – Egrid, Hungrid) lfn:/grid/egrid/sipos/11-04_-_result.dat Remote files 26 Local vs. remote files Grid services LOCAL INPUT FILES LOCAL INPUT FILES & EXECUTABLES Portal server LOCAL OUTPUT FILES & EXECUTABLES Storage resources REMOTE INPUT FILES LOCAL OUTPUT FILES REMOTE OUTPUT FILES Comp. resources Only the permanent files! 27 Workflow Editor saving the workflow Workflow has been defined! Let’s execute it! 28 Executing workflows with the P-GRADE Portal Main steps 1. 2. 3. 4. 5. Download proxies Submit workflow Observe workflow progress If some error occurs correct the graph Download result 29 The typical P-GRADE Portal scenario Execution phase – step 1: Certificate servers DOWNLOAD PROXY CERTIFICATES Portal server Grid services 30 Certificate Manager certificates portlet • To access GSI-based Grids the portal server application needs proxy certificates • “Certificates” portlet: • to upload X.509 certificates into MyProxy servers • to download short-term proxy credentials into the portal server application 31 Certificate Manager downloading a proxy 1. 2. MyProxy server access details: • Hostname (egrid-1.egrid.it) • Port number (7512) • User name (from upload) • Password (from upload) Proxy parameters: • Lifetime • Comment 32 Certificate Manager associating the proxy with a grid This operation displays the details of the certificate and the list of available Grids 33 Certificate Manager browsing proxies Multiple proxies can be available on the portal server at the same time! Comp. resources of SEE-GRID Comp. resources of HUNGRID 34 The typical P-GRADE Portal scenario Execution phase - step 2: Certificate servers TRANSFER FILES, SUBMIT JOBS Portal server Grid services 35 Workflow Management (workflow portlet) • The portlet presents the status, size and output of the available workflow in the “Workflow” list • It has a Quota manager to control the users’ storage space on the server • The portlet also contains the “Abort”, “Attach”, “Details”, “Delete” and “Delete all” buttons to handle execution of workflows • The “Attach” button opens the workflow in the Workflow Editor • The “Details” button gives an overview about the jobs of the workflow 36 Workflow Execution (observation by the workflow portlet) White/Red/Green color means the job is initial/running/finished state 37 Workflow Execution (observation by the workflow portlet) White/Red/Green color means the job is initialised/running/finished 38 Workflow Execution I still don’t know what’s happening inside my workflow! 39 The typical P-GRADE Portal scenario Execution phase – step 3: Certificate servers VISUALIZE JOBS and WORKFLOW PROGRESS Portal server MONITOR JOBS Grid services 40 On-Line Monitoring both at the workflow and job levels (workflow portlet) - The portal monitors and displays workflows 41 On-Line Monitoring both at the workflow and job levels (workflow portlet) - The portal also monitors and visualizes parallel jobs (if they were developed with the P-GRADE Environment) - The portal also generates a statistical view 42 Rescuing a failed workflow 1. (from v2.2) A job failed during workflow execution Read the error log to know why 43 Rescuing a failed workflow 2. (from v2.2) Don’t touch the finished jobs! Map the failed job onto a different resource or download a new proxy for it. The execution can continue from the point of failure! 44 The typical P-GRADE Portal scenario Execution phase – step 5 Certificate servers Portal server Grid services DOWNLOAD RESULTS DOWNLOAD RESULTS 45 Downloading the results… 46 New features in P-GRADE Portal 2.3 • Every user has a storage quota on the portal server • Workflows and traces can be exported from the portal server onto your client machine • Workflows and traces can be imported from your client machine into the Portal 47 Workflow/trace export/import To export a workflow from the portal onto your machine To delete every unnecessary files of the workflow To delete trace/output of the workflow (if any) 48 Grid systems for HPC – User concerns • • • • How to cope with the variety of Grid systems? How to develop/create new Grid applications? How to execute Grid applications? How to observe the application execution in the Grid? • How to tackle performance issues? • How to execute Grid applications over several Grids in a transparent way? 49 References • Official portal of – SEE-GRID infrastructure – VOCE infrastructure – HUNGRID infrastructure • P-GRADE portal is available as service for: – Croatian Grid – UK National Grid Service – EGrid (Italy) 50 How to access P-GRADE portal? • If you are interested in using P-GRADE Portal: – Take a look at www.lpds.sztaki.hu/pgportal (slideshows, manuals, etc.) – Get an account for one of its production installations: • • • • • VOCE portal - SZTAKI SEEGRID portal – SZTAKI HUNGrid portal – SZTAKI NGS portal – University of Westminster Egrid portal – ICTP – If you are the administrator of a Globus/LCG-2 based Grid/VO then ask SZTAKI to install the P-GRADE Portal for you! – If you know the administrator of a P-GRADE Portal you can ask him/her to give access to your Grid through his/her portal installation! (Multi-Grid portal!) 51 What more we can offer • GEMLCA-specific P-GRADE Portal: – Share jobs and legacy applications with other members of your community – Portal service for the UK NGS www.cpc.wmin.ac.uk/ngsportal – LCG-2 specific version is also available – Lecture and demo tomorrow at 14:30-15:15 • Collaborative P-GRADE Portal: – Develop workflows with your colleagues in a concurrent fashion – Execute different parts of the workflow with different users’ certificates – Will be available in 2006 52 Final conclusions • P-GRADE portal provides: – Easy-to-use workflow concept for solving complex problems – Fast development of Grid applications – Integrating various components into large Grid applications: • Sequential codes • MPI codes • (Legacy codes GEMLCA-specific P-GRADE Portal) – – – – Application monitoring, performance visualization, guarantee correctness Interoperability between different Grid systems can be solved Simultaneous execution of application components in different Grids Easy to port applications among Grids (Switching between Grid technologies will be transparent to the end-user) • Learn once, use everywhere • Develop once, execute anywhere 53 Thank you! www.lpds.sztaki.hu/pgportal [email protected] 54
© Copyright 2026 Paperzz