You are on page 1of 53

CLOUD COMPUTING

Open Grid Services Architecture (OGSA) Courtesy:


Dr Gnanasekaran
Thangavel
UNIT 3 GRID SERVICES

Introduction to Open Grid Services Architecture (OGSA)


– Motivation
– Functionality Requirements
– Practical & Detailed view of OGSA/OGSI
– Data intensive grid service models
– OGSA services
– Grid Computing Software

3/25/2021 2
Grid Computing Generations

3/25/2021 3
https://gridaccess.wordpress.com/generations-of-grid/
What is the OGSA Standard?
• Acronym for Open Grid Service Architecture

• OGSA define how different components in grid interact

• Open Grid Services Architecture (OGSA) is a set of standards defining


the way in which information is shared among diverse components of
large, heterogeneous grid systems.

• In this context, a grid system is a scalable Wide Area Network (WAN)


that supports resource sharing and distribution.

3/25/2021 4
Architecture of OGSA

Comprised of 4 main layers

1. OGSA Enabled Physical and Logical Resources Layer

2. Web Services Layer

3. OGSA Architected Services Layer

4. Grid Applications Layer

3/25/2021 5
OGSA Architecture

3/25/2021 6
OGSA Architecture - Physical and Logical
Resources Layer

• Physical resources are: servers, storage, network

• Logical resources manage physical resources

• Examples of logical resources: database managers, workflow managers


OGSA Architecture - Web Services Layer
• Web service is software available online that could interact with other software using
XML
• Consists of Open Grid Services Infrastructure (OGSI) sub-layer which specifies grid
services and provide consistent way to interact with grid services
• Also extends Web Service Capabilities
Consists of 5 interfaces:
1. Life Cycle: Manages grid service life cycles
2. State Management: Manage grid service states
3. Service Groups: Collection of indexed grid services
4. Factory: Provide way for creation of new grid services
5. Notification: Manages notification between services & resources
6. HandleMap: Deals with service identity when Factories are used

3/25/2021 8
OGSA Architecture - Web Services Layer (OGSI)

3/25/2021 9
Open Grid Services Infrastructure (OGSI)
• Gives a formal and technical specification of what a grid service is.
• Its a excruciatingly detailed specification of how Grid Services work.
• Globus Toolkit 3 (GT3) includes a complete implementation of OGSI.
• It is a formal and technical specification of the concepts described in OGSA.
• Some other implementations are OGSI::Lite (Perl)1 and the UNICORE OGSA
demonstrator2 (Uniform Interface to Computing Resources) from the EU GRIP
(Grid Interoperability Project).
• OGSI specification defines grid services and builds upon web services.

3/25/2021 10
OGSA Architecture – OGSA Architected Services -
Layer
OGSA Architected Services are classified into 3 service categories:

1. Grid Core Services

2. Grid Program Execution Services

3. Grid Data Services

3/25/2021 11
OGSA Architecture – OGSA Architected Services -
Layer

3/25/2021 12
OGSA Architected Services – Grid Core Services
Grid Core services are composed of 4 main types of services:

1. Management Services: assist in installation, maintenance, & troubleshooting


tasks in grid system
2. Communication Services: include functions that allow grid services to
communicate
3. Policy Services: Provide framework for creation, administration & management of
policies for system operation
4. Security Services: provide authentication & authorization mechanisms to ensure
systems interoperate securely

3/25/2021 13
OGSA Architected Services – Grid Program
Execution Services

Grid Program Execution Services support unique grid

systems in high performance computing, collaboration and

parallelism

• Support virtualization of resource processing

3/25/2021 14
OGSA Architected Services – Grid Data Services

Grid Data Services support data virtualization

• Provide mechanism for access to distributed resources

such as databases and files

3/25/2021 15
OGSA Architecture – Grid Applications Layer

Grid Applications Layer comprises of applications that use


the grid architected services

3/25/2021 16
Conclusion

• Grid-Computing allows networked resources to be combined and used

• Grid-Computing offers great benefit to an organization

• OGSA are comprehensive standards which governs grid-computing

• There are 3 types of Grids namely Compute Grid, Scavanging Grid and Data

Grid
Data intensive grid service models

Applications in the grid are normally grouped into two categories i.e.
Computation-intensive and Data intensive
• Data intensive applications deals with massive amounts of data.
• The grid system must specially be designed to discover, transfer and
manipulate the massive data sets.
• Transferring the massive data sets is a time consuming task.
• Data access method is also known as caching, which is often applied to
enhance data efficiency in a grid environment.
• By replicating the same data block and scattering them in multiple regions in
a grid, users can access the same data with locality of references.

3/25/2021 18
Data intensive grid service models
• Replication strategies determine when and where to create a replica of the data.
• The strategies of replications can be classified into dynamic and static
Static method
• The locations and number of replicas are determined in advance and will not be modified.
• Replication operation require little overhead
• Static strategic cannot adapt to changes in demand, bandwidth and storage variability
• Optimization is required to determine the location and number of data replicas.
Dynamic strategies
• Dynamic strategies can adjust locations and number of data replicas according to change in
conditions
• Frequent data moving operations can result in much more overhead the static strategies
• Optimization may be determined based on whether the data replica is being created,
deleted or moved.

3/25/2021 19
Grid data Access models

In general there are four access models for organizing a data grid
as listed here
1. Monadic model

2. Hierarchical model

3. Federation model

4. Hybrid model

3/25/2021 20
Monadic model
• This is a centralized data repository model. All
data is saved in central data repository.
• When users want to access some data they
have no submit request directly to the central
repository.
• No data is replicated for preserving data
locality.
• For a larger grid this model is not efficient in
terms of performance and reliability.
• Data replication is permitted in this model only
when fault tolerance is demanded.

3/25/2021 21
Hierarchical model
• It is suitable for building a large data grid which
has only one large data access directory
• Data may be transferred from the source to a
second level center. Then some data in the
regional center is transferred to the third level
centre.
• After being forwarded several times specific
data objects are accessed directly by users.
Higher level data center has a wider coverage
area.
• PKI security services are easier to implement
in this hierarchical data access model

3/25/2021 22
Federation model
• It is suited for designing a data grid with
multiple source of data supplies.
• It is also known as a mesh model
• The data is shared the data and items are
owned and controlled by their original
owners.
• Only authenticated users are authorized to
request data from any data source.
• This mesh model cost the most when the
number of grid intuitions becomes very
large

3/25/2021 23
Hybrid model
• This model combines the best features of
the hierarchical and mesh models.
• Traditional data transfer technology such as
FTP applies for networks with lower
bandwidth.
• High bandwidth are exploited by high speed
data transfer tools such as GridFTP
developed with Globus library.
• The cost of hybrid model can be traded off
between the two extreme models of
hierarchical and mesh-connected grids.

3/25/2021 24
Parallel versus Striped Data Transfers
• Parallel data transfer opens multiple data streams for passing subdivided segments
of a file simultaneously. Although the speed of each stream is same as in sequential
streaming, the total time to move data in all streams can be significantly reduced
compared to FTP transfer.

• Striped data transfer a data objects is partitioned into a number of sections and
each section is placed in an individual site in a data grid. When a user requests this
piece of data, a data stream is created for each site in a data gird. When user
requests this piece of data, data stream is created for each site, and all the sections
of data objects ate transected simultaneously.

3/25/2021 25
Grid Services and OGSA
• Facilitate use and management of resources across distributed, heterogeneous
environments
• Deliver seamless QoS
• Define open, published interfaces in order to provide interoperability of diverse
resources
• Exploit industry-standard integration technologies
• Develop standards that achieve interoperability
• Integrate, virtualize, and manage services and resources in a distributed,
heterogeneous environment
• Deliver functionality as loosely coupled, interacting services aligned with
industry- accepted web service standards

3/25/2021 26
Grid Services and OGSA
• OGSA services fall into
seven broad areas,
defined in terms of
capabilities frequently
required in a grid scenario.

• Figure shows the OGSA


architecture.

• These services are


summarized as follows:

3/25/2021 27
OGSA services - seven broad areas
1. Infrastructure Services Refer to a set of common functionalities, such as naming,
typically required by higher level services.
2. Execution Management Services Concerned with issues such as starting and
managing tasks, including placement, provisioning, and life-cycle management. Tasks
may range from simple jobs to complex workflows or composite services.
3. Data Management Services Provide functionality to move data to where it is needed,
maintain replicated copies, run queries and updates, and transform data into new
formats. These services must handle issues such as data consistency, persistency, and
integrity. An OGSA data service is a web service that implements one or more of the base
data interfaces to enable access to, and management of, data resources in a distributed
environment. The three base interfaces, Data Access, Data Factory, and Data
Management, define basic operations for representing, accessing, creating, and
managing data.

3/25/2021 28
OGSA services - seven broad areas
4. Resource Management Services Provide management capabilities for grid
resources: management of the resources themselves, management of the
resources as grid components, and management of the OGSA infrastructure. For
example, resources can be monitored, reserved, deployed, and configured as
needed to meet application QoS requirements. I t also requires an information
model (semantics) and data model (representation) of the grid resources and
services.
5. Security Services Facilitate the enforcement of security-related policies within a
(virtual) organization, and supports safe resource sharing. Authentication,
authorization, and integrity assurance are essential functionalities provided by
these services.

3/25/2021 29
OGSA services - seven broad areas
6. Information Services Provide efficient production of, and access to, information about
the grid and its constituent resources. The term “information” refers to dynamic data or
events used for status monitoring; relatively static data used for discovery; and any
data that is logged. Troubleshooting is j ust one of the possible uses for information
provided by these services.
7. Self-Management Services Support service-level attainment for a set of services (or
resources), with as much automation as possible, to reduce the costs and complexity
of managing the system. These services are essential in addressing the increasing
complexity of owning and operating an I T infrastructure.

3/25/2021 30
Grid Computing Software

• Condor / HTCondor
– NASA
– Intel

• BOINC (Berkeley Open Infrastructure for Network


Computing)

3/25/2021 31
HTCondor https://research.cs.wisc.edu/htcondor/

• HTCondor is an open-source high-throughput computing software framework for coarse-grained distributed


parallelization of computationally intensive tasks.
• It can be used to manage workload on a dedicated cluster of computers, and/or to farm out work to idle
desktop computers – so-called cycle scavenging.
• HTCondor runs on Linux, Unix, Mac OS X, FreeBSD, and contemporary Windows operating systems.
• HTCondor can seamlessly integrate both dedicated resources (rack-mounted clusters) and non-dedicated
desktop machines (cycle scavenging) into one computing environment.
• HTCondor was formerly known as Condor; the name was changed in October 2012 to resolve a trademark
lawsuit.
• HTCondor is developed by the HTCondor team at the University of Wisconsin–Madison and is freely
available for use.
• HTCondor follows an open source philosophy (it's licensed under the Apache License 2.0).
• It can be downloaded from the HTCondor web site or by installing the Fedora Linux Distribution. It is also
available on other platforms, like Ubuntu from the repositories.

3/25/2021 32
Downloading HTCondor

3/25/2021 33
Downloading HTCondor

3/25/2021 34
Installing HTCondor

3/25/2021 35
Testing HTCondor

• After Install
• Execute /etc/init.d/condor restart
To ensure that HTCondor is running, you can run: ps -ef | egrep condor_
adeel@adeel-HP-ProBook-450-G1:~$ ps -ef | egrep condor_
condor 28658 1 0 22:26 ? 00:00:00 /usr/sbin/condor_master -f
root 28684 28658 0 22:26 ? 00:00:00 condor_procd -A /var/run/condor/procd_pipe -L /var/log/condor/ProcLog -
R 1000000 -S 60 -C 132
condor 28685 28658 0 22:26 ? 00:00:00 condor_collector -f
condor 28686 28658 0 22:26 ? 00:00:00 condor_startd -f
condor 28687 28658 0 22:26 ? 00:00:00 condor_schedd -f
condor 28688 28658 0 22:26 ? 00:00:00 condor_negotiator -f
condor 28815 3374 0 22:27 ? 00:00:00 condor_kbdd
adeel 33159 27815 0 23:58 pts/5 00:00:00 grep -E --color=auto condor_

3/25/2021 36
Quick Starting HTCondor
http://research.cs.wisc.edu/htcondor/manual/quickstart.html

3/25/2021 37
Creating a script and a submit description file
Unix/Linux Windows

Batch file
Shell Script

Submit Description File

3/25/2021 38
Typical Commands of HTCondor

• To start/restart condor use /etc/init.d/condor restart


• To submit a job use condor_submit sleep.sub
• To check job queue use condor_q -submitter adeel
• To remove a job use condor_rm 1.0
• Log of execution is saved in sleep.log

3/25/2021 39
BOINC https://boinc.berkeley.edu/

• The Berkeley Open Infrastructure for Network Computing (BOINC) is an open-source


middleware system, supports volunteer and grid computing.
• Originally developed to support the SETI@home project, it became generalized as a platform
for other distributed applications in areas as diverse as mathematics, linguistics, medicine,
molecular biology, climatology, environmental science, and astrophysics, among others.
• BOINC aims to enable researchers to tap into the enormous processing resources of multiple
personal computers around the world.
• BOINC development originated with a team based at the Space Sciences Laboratory (SSL) at
the University of California, Berkeley and led by David Anderson, who also leads
SETI@home.
• As a high-performance distributed computing platform, BOINC brings together about 311,742
active participants and 834,343 active computers (hosts) worldwide processing on average
16.912 PetaFLOPS as of 13 January 2017.

3/25/2021 40
Downloading BOINC

3/25/2021 41
Selecting BOINC Projects

3/25/2021 42
BOINC Projects

3/25/2021 43
BOINC Projects

3/25/2021 44
BOINC Projects

3/25/2021 45
BOINC Projects

3/25/2021 46
BOINC Projects

3/25/2021 47
GridRepublic http://www.gridrepublic.org

3/25/2021 48
GridRepublic http://www.gridrepublic.org

3/25/2021 49
References
1. Kai Hwang, Geoffery C. Fox and Jack J. Dongarra, “Distributed and Cloud Computing: Clusters, Grids,
Clouds and the Future of Internet”, First Edition, Morgan Kaufman Publisher, an Imprint of Elsevier, 2012.

2. https://www.dcc.fc.up.pt/~ines/aulas/1213/CG/OGSA.ppt

3. http://www.computerworld.com/article/2552339/networking/open-grid-services-architecture.html

4. http://searchsoa.techtarget.com/definition/Open-Grid-Services-Architecture

5. www.cs.umsl.edu/~sanjiv/classes/cs6740/presentation/OGSA.ppt

6. www.nesc.ac.uk/news/.../OpenGridServicesArchitectureApril20021.ppt

7. www.cse.buffalo.edu/~bina/cse486/spring2011/progtutorial_0.4.3.pdf

3/25/2021 50
Related Videos on Youtube
• IBM Grid Computing Demo
– https://www.youtube.com/watch?v=esVzoSqQ1Pc
• Introduction to Grid Computing
– https://www.youtube.com/watch?v=vkBY07pbgGU
• Grid Computing : History and Evolution of Grid Computing
with generations
– https://www.youtube.com/watch?v=KfKgDjop9o0
• Comparison of Cloud with Grid Computing
– https://www.youtube.com/watch?v=qJwqK1GVubg

3/25/2021 51
Assignment #2

• Setup your BOINC Project at http://gridrepublic.org and


submit the Status Report of utilization of your Grid Node as
Assignment #2 Next Week
• Differentiate between Grid and Cloud Computing after
watching the videos on slide 51

3/25/2021 52
Thank You

Questions and Comments?

3/25/2021 53

You might also like