You are on page 1of 24

Enabling Grids for E-sciencE

Introduction to EGEE and gLite


middleware

Giuseppe Andronico

April 18, 2009 -- Taipei


INFN Sez. CT

www.eu-egee.org

EGEE-II INFSO-RI-031688
Contents
Enabling Grids for E-sciencE

• EGEE status, history and perspectives


• Grid services: gLite 3.1
– Introduction to basic systems
– Main components
– Component interaction

• Sources of further information

EGEE-II INFSO-RI-031688 2
• Quality:
– Monitoring via Nagios - distributed via official releases, configured
through YAIM, integrated with other tools
Enabling Grids for E-sciencE
– Gradual implementation of Service Level Agreement with sites
– Result: 85% of sites are now above the 75% availability threshold

• Geographical expansion:
– now have production sites all across Asia: Australia, China, India,
Japan, Korea, Malaysia, Pakistan, Taiwan, Thailand
 In certification: Indonesia, Philippines, Vietnam

~280 sites
45 countries
>80,000 CPUs
>20 PetaBytes
>14,000 users
>250,000 jobs/day

EGEE-II INFSO-RI-031688 3
Production & Volunteer grids
Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688 4
Applications database
Enabling Grids for E-sciencE

• 61 registered applications so far


• Joint initiative between EGEE and SEE-GRID-SCI
• http://appdb.eu-egee.org/ & http://grid.ct.infn.it/egee_applications/

Readiness status:
7 HandledByRS
6 In production
5 Ready for deployment
4 Ready for validation
3 Ready for portal interface
2 Ready for gLite middleware
1 Ready for standalone use / running in a local cluster
EGEE-II INFSO-RI-031688 5
EGEE→EGI transition planning
Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688 6
Grid Security
Enabling Grids for E-sciencE

Users in many locations and organisations

Access services (User Interface) :


logon, upload credentials, run m/w commands

Grid Services
Built on Grid Security Infrastructure (GSI)
Encryption and Data Integrity, Authentication and Authorization

Gate Keeping: Identify users and give permissions

Data High Level


Software Computation
Storage Service

Resources in many locations and organisations

EGEE-II INFSO-RI-031688 7
Introductive concepts
Enabling Grids for E-sciencE

• Computational Resources: machines where an user can


execute a program (“job”), store or retrieve files
• Job: is a computational task (an executable, a script) that
the user wants to be executed in the Grid, having finally
the results back on it's machine
• VO: people owning at different institutions but working at
similar problems sharing resources, applications, data, …

EGEE-II INFSO-RI-031688 8
Grid Concepts
Enabling Grids for E-sciencE

User User
Interface Interface
Grid services

EGEE-II INFSO-RI-031688 9
Extendable Infrastructure
Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688 10
gLite decomposition
Enabling Grids for E-sciencE

CLI API
Access

Authorization Information & Job


Monitoring Monitoring
Auditing
Authentication
Information &
Security Services Monitoring Services

Metadata File & Replica Job Package


Catalog Catalog Provenance Manager
Accounting
Storage Data Computing Workload
Element Movement Element Management
Data Services Job Mgmt. Services
Overview paper http://doc.cern.ch//archive/electronic/egee/tr/egee-tr-2006-001.pdf
EGEE-II INFSO-RI-031688
Work Load Management
Enabling Grids for E-sciencE

Users

Tools to:
•Submit jobs to a CE
•Monitor jobs
How do I run a job on a
Compute Element (CE) ? •Get outputs
(CE = batch queue) •Transfer files to CE
•Transfer files between CE
and SE

Resources
Compute elements Network resources Data storage

EGEE-II INFSO-RI-031688 12
WMS
Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688
Information System (IS)
Enabling Grids for E-sciencE

Users

Resource publishes:
• Configuration
Which resources • Latest status
can I use?
• Access information

Resources
Compute elements Network resources Data storage

EGEE-II INFSO-RI-031688 14
Information System
Enabling Grids for E-sciencE

User applications

Hierarchical information WMS


Monitoring services

BDII (gilda)

VO

Other GIIS GIIS GIIS


(gilda) INFN sez. CT Merida (gilda)

Site Globus
MDS

GRISes GRISes GRISes


Resource

EGEE-II INFSO-RI-031688
Data Management
Enabling Grids for E-sciencE

Users

Data Management
provides:
How
My canare
Ourdata
data Iare
manage
ininfiles, • Storage Space
and share
and I’ve terabytes
my data? • Data Transfer
• Replication

Resources
Compute elements Network resources Data storage

EGEE-II INFSO-RI-031688 16
LCG File Catalog (LFC)
Enabling Grids for E-sciencE

Catalogs
LFN (Logical file name)

GUID (Grid unique identifier)

SimLinks

SURL (Site URL)

TURL (Transfer URL)

EGEE-II INFSO-RI-031688
Enabling Grids for E-sciencE
Data Movement

•GridFTP: basic transfer protocol


•Data Scheduler (DS)
•File Transfer/Placement Service (FTS/FPS)
•Transfer Queue (Table)
•Transfer Agent (Network)

EGEE-II INFSO-RI-031688
Basic gLite use case:
Enabling Grids for E-sciencE Job submission
User Interface Information System
Submit job
(executable + small inputs) Resource Broker
query
Retrieve
status &
create (small) output files
proxy

query
publish
Submit job
state
Retrieve
File and Replica Catalog output
Job
Logging
status

Site X
Register file Computing Element Storage Element
Input file(s)
Job
status
VO Management process
Service Output file(s)
(DB of VO users)
Logging and
bookkeeping
EGEE-II INFSO-RI-031688 19
Main components
Enabling Grids for E-sciencE

User Interface (UI): The place where users logon to the Grid

Resource Broker (RB) (Workload Management System (WMS):


Matches the user requirements with the available resources on the Grid

Information System: Characteristics and status of CE and SE

File and replica catalog: Location of grid files and grid file replicas

Logging and Bookkeeping (LB): Log information of jobs

Computing Element (CE): A batch queue on a site’s computers where


the user’s job is executed

Storage Element (SE): provides (large-scale) storage for files

EGEE-II INFSO-RI-031688 20
Main components
Enabling Grids for E-sciencE

User Interface (UI): The place where users logon to the Grid

Resource Broker (RB) (Workload Management System (WMS):


Matches the user requirements with the available resources on the Grid

All built upon


Information System: Characteristics and status of CE and SE

authorisation,
File and replica catalog: Location of grid files and grid file replicas
authentication,
Logging and Bookkeeping (LB): Log information of jobs
security
Computing Element (CE): A batch queue on a site’s computers where
the user’s job is executed

Storage Element (SE): provides (large-scale) storage for files

EGEE-II INFSO-RI-031688 21
Who provides the resources?!
Enabling Grids for E-sciencE

Service Provider Note


User interface User / institute / VO Computer with client SW
Resource Broker VOs - EGEE does
(WMS) not fund RBs
Information System Grid operations -
EGEE funded effort
File and replica catalog VOs - EGEE does
not fund catalogs
Logging and VOs - EGEE does
Bookkeeping not fund LB servers
Computing Element VOs - EGEE does VOs provide resources to
(CE) not fund CEs match average need
Storage Element VOs - EGEE does VOs provide resources to
(SE) not fund SEs match average need
External services User / institute / VO To extend the capabilities of
the core infrastructure

EGEE-II INFSO-RI-031688 22
Further information
Enabling Grids for E-sciencE

• EGEE digital library: http://egee.lib.ed.ac.uk/


• EGEE www.eu-egee.org
• gLite http://www.glite.org/
• GILDA https://gilda.ct.infn.it /

• Open Grid Forum http://www.gridforum.org/


• Globus Alliance http://www.globus.org/
• VDT http://www.cs.wisc.edu/vdt/

EGEE-II INFSO-RI-031688 23
Questions
Enabling Grids for E-sciencE

EGEE-II INFSO-RI-031688 24

You might also like