03_Flashpres2_RI-Kitowski

1
PL-Grid: Polish Infrastructure for
Supporting Computational Science in the
European Research Space
RI / e-Infrastructure coexistence
PL-Grid case study
Jacek Kitowski
ACK Cyfronet AGH
Competence Centre for Cloud and Grid Computing
PL-Grid Consortium
DI4R Conference, September, 28-30, 2016, Krakow
e-Infrastructure Creation
2
Issues and assumptions
Synergistic effort in several dimensions:
Meeting user demands in the field of grand challenges applications
Activity supported by users with scientific achievements and by
well-defined requirements
Organizational:
horizontal perspective - federation of computer
centres supporting the e-infrastructure with
different kinds of resources and competences
vertical perspective - involvement of computer,
computational and domain-specific experts into
e-infrastructure creation and operations
Technological:
different computing hardware and software
various middleware environments
Meeting user
demands
Energy awareness
Technological
Energy awareness:
optimal scheduling strategies of computing jobs among federation resources
to minimize energy consumption as a whole
Organizational
3
RI / eInfrastructure Development
coordinated by Cyfronet
PL-Grid (2009–2012)
Outcome: Common base
infrastructure
PLGrid PLUS (2011–2015)
Outcome:
Focus on users (training, helpdesk…)
Domain specific solutions: 13
PLGrid NG (2014–2015)
Outcome:
Optimization of resources usage, training
Extension of domain specific by: 14
PLGrid CORE (2014–2015)
Outcome: Competence Center
Open Science paradigm
(large workflow app., data farming
mass comp., ……)
End-user services
Real
Users
Research Infrastructures vs.
eInfrastructures (development issues)
Domain grids
Synergy between domain specific researchers and IT experts.
Just partnership.
Ecosystem for making science with pillars
software
hardware
expertise (from communities)
community involvement in all activities
helpdesk (efficiency in operation), training, marketing
financial issues
Prometheus Cluster (2014/2015)
Rpeak = 2349 TFlops
52 728 Haswell cores
144 Nvidia K40 XL
HP Apollo 8000
#39 @ TOP500 (XI 2015)
PUE=1.06
Publications
4
How better coordinate service delivery
accross RI ?
Necessary conditions (technologies, software systems, environments, etc.)
Sufficient conditions (horizontal collaborations)
Global Data Storage Solution for Science
A system that provides a unified and efficient access
to data stored in organizationally distributed
environments.
Onedata enables seamless sharing of data between
users, with strict access control.
Onedata is a distributed system
divided into zones (or federations)
which are created by deploying a
dedicated service.
Onedata deployments with no
relation to other federation.
All data stored in Onedata is
organized into Spaces, which can
be seen as virtual folders or drives,
which can contain an arbitrary
subdirectory and file hierarchy,
while being distributed across
multiple storage providers.
5