Professional Documents
Culture Documents
1 Introduction
1 Introduction
January 8, 2024
Preeti Malakar
pmalakar@cse.iitk.ac.in
Logistics
• Class hours: MW 3:30 – 4:45 PM (RM 101)
• Office hour: MTW 5:00 – 5:30 PM (KD 221)
• https://www.cse.iitk.ac.in/users/cs633/2023-24-2
– Lectures will be uploaded after every class
• Announcements/uploads on
– MooKIT
– Course email alias
• Email to the instructor should always be prefixed with
[CS633] in the subject
2
Grading Policy
Participate actively in class
3
Switch OFF All Devices
4
5
Assignments
• Programming assignments in C
• In a group (group size = 3)
– Send group member information by Jan 14 to
{gsarkar,madhavm}@cse.iitk.ac.in
– Include clearly names, roll numbers, IITK email-ids
– Subject of email [CS633 Group]
– Change in group formation is not allowed
• Mode of submission will be explained in due time
6
Assignments
Introduction
Multicore Era
CPU
Intel 4004
(1971)
Single core
single chip
9
Moore’s Law (1965)
Number of transistors in a chip doubles every 18 months
[Source: Wikipedia]
“However, it must be programmed with a more complicated parallel programming
10
model to obtain maximum performance.”
Trends
~ $600 million
~ 7300 sq. ft.
~ 22 MW power
~ 23000 L water
13
green500.org (Nov’23)
https://www.top500.or
g/resources/top-
systems/
16
Making of a Supercomputer
Source: energy.gov 17
Greenest Data Centre?
18
“The 149,000 square
foot facility built on a
hillside overlooking the
UC Berkeley campus
and San Francisco Bay
will house one of the
most energy-efficient
computing centers
anywhere, tapping into
the region’s mild
climate to cool the
supercomputers at the
National Energy
Research Scientific
Computing Center
(NERSC) and eliminating
the need for
mechanical cooling. ”
https://www.science.org/content/article/climate-change-threatens-supercomputers 19
Top Supercomputers from India
20
Supercomputing in India [topsc.cdacb.in, Jul’23]
21
Source: www.iitk.ac.in
22
Credit: Ashish Kuvelkar, CDAC
23
National Supercomputing Mission Sites
24
Big Compute
25
Massively Parallel Codes
27
Numerical Weather Models
28
Massively Parallel Simulations
32
Computational Science
34
Output Data
10 PB / year
High-
2 PB / simulation
energy
Scaled to 786K cores on Mira
physics
Higgs boson simulation
Source: CERN
240 TB / simulation
Cosmology
Q Continuum simulation
Source: Salman Habib et al.
Climate/weather
Hurricane simulation
Source: NASA 35
Input Data
1.00E-04
Byte/FLOP
1.00E-05
1.00E-06
1997 2001 2004 2008 2010 2011 2013 2015 2018
39
Why Parallel?
A*
20 hours
2 hours
Not really
40
Parallelism
A parallel computer is a collection of processing
elements that communicate and cooperate to solve
large problems fast.
41
Speedup
Example – Sum of squares of N numbers
Serial Parallel
O(N) O(N/P) +
Communication time
42
Performance Measure
• Speedup
Time ( 1 processor)
SP =
Time ( P processors)
• Efficiency
SP
EP =
P
43
Parallel Performance (Parallel Sum)
Parallel efficiency of summing 10^7 doubles
44
Ideal Speedup
Speedup Linear
Superlinear
Sublinear
Processors
45
Issue – Scalability
48
Distributed Memory Systems
• Networked systems
Node • Distributed memory
• Local memory
• Remote memory
• Parallel
Codefile system
Cluster
49
Parallel Programming Models
Libraries MPI, TBB, Pthread, OpenMP, …
New languages Haskell, X10, Chapel, …
Extensions Coarray Fortran, UPC, Cilk, OpenCL, …
• Shared memory
– OpenMP, Pthreads, …
• Distributed memory
– MPI, UPC, …
• Hybrid
– MPI + OpenMP
50
This course …
51
Large-scale Parallel Computing
Message Parallel
passing algorithms
Designing Performance
parallel codes analysis
52
Message Passing Paradigm
53
Profiling
54
Parallel I/O
NOT SHARED
2 GB/s SHARED
BRIDGE NODES
4 GB/s
IB NETWORK
128:1
11
Job Scheduling
Wikipedia
NODES USERS
JOBS
57
Reference Material
58