0% found this document useful (0 votes)

163 views37 pages

Teaching Parallelism in Computing

The document discusses parallel computing and teaching parallelism. It presents three options for how to teach parallelism: 1) minimize what students learn about parallelism by hiding it in libraries, 2) teach parallelism as an advanced topic after sequential computing, or 3) teach parallelism from the beginning by viewing sequential computing as a special case. The document then examines examples of algorithms like quicksort and matrix inversion, showing how their parallel implementations have lower complexity than sequential ones. It discusses observations about how natural parallelism is often lost, the importance of higher-level descriptions, and using cost models of work and span. The document proposes core ideas in parallelism that should be taught, like different types of parallelism, algorithms, and programming

Uploaded by

Anonymous RrGVQj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

163 views37 pages

Teaching Parallelism in Computing

Uploaded by

Anonymous RrGVQj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Parallel Thinking*

Guy Blelloch
Carnegie Mellon University
*PROBE

PPoPP, 2/16/2009

as part of the Center for Computational Thinking

Andrew Chien, 2008

PPoPP, 2/16/2009

Parallel Thinking
How to deal with teaching parallelism?
Option I : Minimize what users have to learn
about parallelism. Hide parallelism in libraries
which are programmed by a few experts
Option II : Teach parallelism as an advanced
subjet after and based on standard material on
sequential computing.
Option III : Teach parallelism from the start
with sequential computing as a special case.
PPoPP, 2/16/2009

Parallel Thinking
If explained at the right level of abstraction

are many algorithms naturally parallel?

If done right could parallel programming be
as easy or easier than sequential
programming for many uses?
Are we currently brainwashing students to
think sequentially?
What are the core parallel ideas that all
computer scientists should know?

PPoPP, 2/16/2009

Quicksort from Sedgewick

public void quickSort(int[] a, int left, int right) {
int i = left-1; int j = right;
if (right <= left) return;
while (true) {
while (a[++i] < a[right]);
while (a[right]<a[--j])
if (j==left) break;
if (i >= j) break;
swap(a,i,j); }
swap(a, i, right);
quickSort(a, left, i - 1);
quickSort(a, i+1, right); }

Sequential!
PPoPP, 2/16/2009

Quicksort from Aho-Hopcroft-Ullman

procedure QUICKSORT(S):
if S contains at most one element then return S
else
begin
choose an element a randomly from S;
let S1, S2 and S3 be the sequences of
elements in S less than, equal to,
and greater than a, respectively;
return (QUICKSORT(S1) followed by S2
followed by QUICKSORT(S3))
end
PPoPP, 2/16/2009

Two forms of natural parallelism

Observation 1 and 2
Natural parallelism is often lost in low-level

implementations.
Need higher level descriptions
Need to revert back to the core ideas of an
algorithm and recognize what is parallel
and what is not
Lost opportunity not to describe parallelism

PPoPP, 2/16/2009

Quicksort in NESL
function quicksort(S) =
if (#S <= 1) then S
else let
a = S[rand(#S)];
S1 = {e in S | e < a};
S2 = {e in S | e = a};
S3 = {e in S | e > a};
R = {quicksort(v) : v in [S1, S3]};
in R[0] ++ S2 ++ R[1];

PPoPP, 2/16/2009

Parallel selection
{e in S | e < a};
S
= [2, 1, 4, 0, 3, 1, 5, 7]
F=S<4
= [1, 1, 0, 1, 1, 1, 0, 0]
I = addscan(F) = [0, 1, 2, 2, 3, 4, 5, 5]
where F
R[I] = S

= [2, 1, 0, 3, 1]
Each element gets sum of
previous elements.
Seems sequential?

PPoPP, 2/16/2009

Scan
[2, 1, 4, 2, 3, 1, 5, 7]
sum
recurse
sum
interleave

[3,

12]

[0,

13]

[2,

12, 18]

[0, 2, 3, 7, 9, 12, 13, 18]

PPoPP, 2/16/2009

Scan code
function scan(A, op) =
if (#A <= 1) then [0]
else let
sums = {op(A[2*i], A[2*i+1]) : i in [0:#a/2]};
evens = scan(sums, op);
odds = {op(evens[i], A[2*i]) : i in [0:#a/2]};
in interleave(evens,odds);,
A = [2, 1, 4, 2, 3, 1, 5, 7]
sums = [3, 6, 4, 12]
evens = [0, 3, 9, 13] (result of recursion)
odd = [2, 7, 12, 18]
result = [0, 2, 3, 7, 9, 12, 13, 18]
PPoPP, 2/16/2009

Observations 3, 4 and 5
Just because it seems sequential does not

mean it is
+ When in doubt recurse on a single smaller
problem and use the result to solve larger
problem
+ Transitions can be aggregated (composed)
+ Core parallel idea/technique

PPoPP, 2/16/2009

Qsort Complexity
Sequential Partition
Parallel calls

partition
(less than, )

append

Span = O(n)

Work = O(n log n)

Not a very good parallel algorithm

PPoPP, 2/16/2009

Quicksort in HPF
subroutine quicksort(a,n)
integer n,nless,less(n),greater(n),a(n)
if (n < 2) return
pivot = a(1)
nless = count(a < pivot)
less = pack(a, a < pivot)
greater = pack(a, a >= pivot)
call quicksort(less, nless)
a(1:nless) = less
call quicksort(greater, n-nless)
a(nless+1:n) = less
end subroutine

PPoPP, 2/16/2009

Qsort Complexity
Parallel partition
Sequential calls

Span = O(n)

Work = O(n log n)

Still not a very good parallel algorithm

PPoPP, 2/16/2009

Qsort Complexity
Parallel partition
Parallel calls

Work = O(n log n)

Span = O(lg2 n)
A good parallel algorithm
PPoPP, 2/16/2009

Complexity in Nesl
Combining for parallel map:
pexp = {exp(e) : e in A}
work
span

In general all you need is sum (work) and max

(span) for nested parallel computations.

PPoPP, 2/16/2009

Generally for a DAG

Any greedy schedule for

a DAG with span (depth)

D and work (size) W will
complete in:
T < W/P + D
Any schedule will take at
least:
T >= max(W/P, D)

PPoPP, 2/16/2009

Observations 6, 7, 8 and 9
+ Often need to take advantage of both data

parallelism and function parallelism

Abstract cost models that are not machine
based are important.
+ Work and span are reasonable measures
and can be easily composed with nested
parallelism. No more difficult to understand
than time in sequential algorithms.
+ Many ways to schedule
+ = advanced topic
PPoPP, 2/16/2009

Matrix Inversion
Mat invert(mat M) {
D-1 = invert(D)
S-1 = A BD-1C
S-1 = invert(S)
E = D-1
F = S-1BD-1
G = -D-1CS-1
H = D-1 + D-1CS-1BD-1
}

A B
M =

C D

W (n) = 2W (n /2) + 6W* (n /2)

= O(n 3 )

PPoPP, 2/16/2009

E
M =
G
1

D(n) = 2D(n /2) + 6D* (n /2)

= O(n)

Quicksort in X10
double[] quicksort(double[] S) {
if (S.length < 2) return S;
double a = S[rand(S.length)];
double[] S1,S2,S3;
finish {
async { S1 = quicksort(lessThan(S,a));}
async { S2 = eqTo(S,a);}
S3 = quicksort(grThan(S,a));
}
append(S1,append(S2,S3));
}

PPoPP, 2/16/2009

Quicksort in X10
double[] quicksort(double[] S) {
if (S.length < 2) return S;
double a = S[rand(S.length)];
double[] S1,S2,S3;
cnt = cnt+1;
finish {
async { S1 = quicksort(lessThan(S,a));}
async { S2 = eqTo(S,a);}
S3 = quicksort(grThan(S,a));
}
append(S1,append(S2,S3));
}

PPoPP, 2/16/2009

????

Observation 10
Deterministic parallelism is important for

easily understanding, analyzing and

debugging programs.
Functional languages
Race detectors (e.g. cilkscreen)
Using non-functional languages in a
functional style (is this safe?)
Atomic regions and transactions dont solve this problem.

PPoPP, 2/16/2009

Example: Merging
Merge(nil,l2) = l2
Merge(l1,nil) = l1
Merge(h1::t1, h2::t2) =
if (h1 < h2) h1::Merge(t1,h2::t2)
else h2::Merge(h1::t1,t2)

What about in parallel?

PPoPP, 2/16/2009

Merging
Merge(A,B) =
Span = O(log2 n)
let
Work = O(n)
Node(AL, m, AR) = A
(BL ,BR) = split(B, m)
Merge in parallel
in
Node(Merge(AL,BL), m, Merge(AR,BR))
m

AR
A

PPoPP, 2/16/2009

BL
B

Merge(AL ,BL)

Merge(AR ,BR)
25

Merging with Futures

Merge(A,B) =
Span = O(log n)
let
Work = O(n)
Node(AL, m, AR) = A
(BL ,BR) = futureSplit(B, m)
in
Node(Merge(AL,BL), m, Merge(AR,BR))
m

AR
A

PPoPP, 2/16/2009

futures

B
Merge(AL ,BL)

Merge(AR ,BR)
26

Observations 11, 12 and 13

+ Divide and conquer even more useful in

parallel than sequentially

+ Trees are better than lists for parallelism
+ Pipelining can asymptotically reduce depth,
but can be hard to analyze

PPoPP, 2/16/2009

The Observations

General:
1. Natural parallelism is often lost in low-level implementations.
2. Lost opportunity not to describe parallelism
3. Just because it seems sequential does not mean it is
Model and Language:
6. Need to take advantage of both data and function parallelism
7. Abstract cost models that are not machine based are important.
8. Work and span are reasonable measures
9. Many ways to schedule
10. Deterministic parallelism is important
Algorithmic Techniques
4. When in doubt recurse on a smaller problem
5. Transitions can be aggregated
11. Divide and conquer even more useful in parallel
12. Trees are better than lists for parallelism
13. Pipelining is useful, with care
PPoPP, 2/16/2009

More algorithmic techniques

+ Graph contraction
+ Identifying independent sets
+ Symmetry breaking
+ Pointer jumping
0

2
2

6
1

PPoPP, 2/16/2009

6
29

What else
Non-deterministic parallelism:
Races and race detection
Sequential consistency, serializability,
linearizability, atomic primitives, locking
techniques, transactions
Concurrency models, e.g. the pi-calculus
Lock and wait free algorithms
Architectural issues
Cache coherence, memory layout, latency hiding
Network topology, latency vs. throughput

PPoPP, 2/16/2009

Excersise
Identify the core ideas in Parallelism
Ideas

that will still be useful in 20 years

Separate into beginners and advanced
See how they fit into a curriculum
Emphasis on simplicity first
Will depend on existing curriculum

PPoPP, 2/16/2009

Possible course content

Biased by our current sequence
o 211: Fundamental data structures and algorithms
o 212: Principles of programming
o 213: Introduction to computer systems
o 251: Great theoretical ideals in computer science

PPoPP, 2/16/2009

211: Intro to Data Structures+Algos

Teach deterministic nested parallelism with
work and depth.
o Introduce race conditions but dont allow them.
o General techniques: divide-and-conquer,
contraction, combining, dynamic programming
o Data structures: stacks, queues, vectors,
balanced trees, matrices, graphs,
o Algorithms: scan, sorting, merging, medians,
hashing, fft, graph connectivity, MST
PPoPP, 2/16/2009

212: Principles of Programming

o Recursion, structural induction, currying
o Folding, mapping : emphasis on trees not lists
o Exceptions, parallel exceptions, and continuations
o Streams, futures, pipelining
o State and interaction with parallelism
o Nondeterminacy and linearizability
o Simple concurrent structure
o Or parallelism

PPoPP, 2/16/2009

213: Introduction to Systems

o Representing integers/floats
o Assembly language and atomic operations
o Out of order processing
o Caches, virtual memory, and memory consistency
o Threads and scheduling
o Concurrency, synchronization, transactions and

serializability
o Network programming

PPoPP, 2/16/2009

Acknowledgements
This talk has been based on 30 years of
research on parallelism by 100s of people.
Many ideas from the PRAM (theory) community
and PL community

PPoPP, 2/16/2009

Conclusions/Questions
Should we teach parallelism from day 1 and
sequential computing as a special case?
Could teaching parallelism actually make some
things easier?
Are there a reasonably small number of core
ideas that every undergraduate needs to
know? If so, what are they?

PPoPP, 2/16/2009

Parallel Thinking: Guy Blelloch Carnegie Mellon University
No ratings yet
Parallel Thinking: Guy Blelloch Carnegie Mellon University
41 pages
Introduction to Parallel Algorithms
No ratings yet
Introduction to Parallel Algorithms
3 pages
Thinking in Parallel: Some Basic Data-Parallel Algorithms and Techniques
No ratings yet
Thinking in Parallel: Some Basic Data-Parallel Algorithms and Techniques
104 pages
Parallel Algorithms: Sorting and Races
No ratings yet
Parallel Algorithms: Sorting and Races
51 pages
Notes On Data Structures and Algorithms: Dr. Anindita Kundu
No ratings yet
Notes On Data Structures and Algorithms: Dr. Anindita Kundu
64 pages
Parallel and Distributed Lec 11
No ratings yet
Parallel and Distributed Lec 11
15 pages
L8 Parallel Algorithms
No ratings yet
L8 Parallel Algorithms
41 pages
Parallel Sorting Techniques
No ratings yet
Parallel Sorting Techniques
32 pages
10 Sorting
No ratings yet
10 Sorting
20 pages
Overview of Path Finding Algorithms
No ratings yet
Overview of Path Finding Algorithms
1 page
Understanding Nested Parallelism Techniques
No ratings yet
Understanding Nested Parallelism Techniques
18 pages
Algorithm Fundamentals
No ratings yet
Algorithm Fundamentals
42 pages
Parallel Sorting on Multi-Core CPUs
No ratings yet
Parallel Sorting on Multi-Core CPUs
22 pages
Data Structure
No ratings yet
Data Structure
7 pages
Parallel
No ratings yet
Parallel
59 pages
Algorithm Design and Analysis Overview
No ratings yet
Algorithm Design and Analysis Overview
33 pages
Parallelism in Computer Algorithms
No ratings yet
Parallelism in Computer Algorithms
7 pages
Mergesort and Parallel Algorithms Explained
No ratings yet
Mergesort and Parallel Algorithms Explained
4 pages
Flow Chart For Product of First N Natural Numbers: Syllabus/Lectures/Same/First Grade/programming 1 PDF
No ratings yet
Flow Chart For Product of First N Natural Numbers: Syllabus/Lectures/Same/First Grade/programming 1 PDF
7 pages
Super Linear Speedup in Parallel Algorithms
33% (3)
Super Linear Speedup in Parallel Algorithms
4 pages
Advanced Data Structures & Algorithms
No ratings yet
Advanced Data Structures & Algorithms
142 pages
Algorithm Analysis in Data Structures
No ratings yet
Algorithm Analysis in Data Structures
100 pages
Parallel Algorithms: Work-Depth Model
No ratings yet
Parallel Algorithms: Work-Depth Model
44 pages
Algorithms Parallel and Sequential
No ratings yet
Algorithms Parallel and Sequential
514 pages
Advanced Algorithms and Data Structures
No ratings yet
Advanced Algorithms and Data Structures
53 pages
Algorithms and Data Structures - Topic Summary
No ratings yet
Algorithms and Data Structures - Topic Summary
21 pages
Week 3: Algorithms and Sorting Techniques
No ratings yet
Week 3: Algorithms and Sorting Techniques
28 pages
Overview of Sorting Algorithms
No ratings yet
Overview of Sorting Algorithms
18 pages
Divide-and-Conquer Algorithms Guide
No ratings yet
Divide-and-Conquer Algorithms Guide
16 pages
Comprehensive Computer Science Algorithms Guide
No ratings yet
Comprehensive Computer Science Algorithms Guide
33 pages
Merge Sort Algorithm Analysis
No ratings yet
Merge Sort Algorithm Analysis
22 pages
Unit-1 DAA Notes - Daa Unit 1 Note Unit-1 DAA Notes - Daa Unit 1 Note
No ratings yet
Unit-1 DAA Notes - Daa Unit 1 Note Unit-1 DAA Notes - Daa Unit 1 Note
26 pages
Algorithm
No ratings yet
Algorithm
5 pages
DAA File1
No ratings yet
DAA File1
15 pages
Design and Analysis of Algorithms Lab
No ratings yet
Design and Analysis of Algorithms Lab
19 pages
Merge Sort Quick Sort
No ratings yet
Merge Sort Quick Sort
43 pages
Pipeline Merge Sort Analysis
No ratings yet
Pipeline Merge Sort Analysis
22 pages
AOA - Week 9-16 (Final Term)
No ratings yet
AOA - Week 9-16 (Final Term)
51 pages
Computer Science Algorithms Course Info
No ratings yet
Computer Science Algorithms Course Info
8 pages
DSA Sorting
No ratings yet
DSA Sorting
61 pages
Algorithm Solutions for Students
0% (1)
Algorithm Solutions for Students
9 pages
Comprehensive Algorithms Cheat Sheet
No ratings yet
Comprehensive Algorithms Cheat Sheet
2 pages
01 Introduction
No ratings yet
01 Introduction
73 pages
Unit 1 DAA Notes: Algorithms & Analysis
No ratings yet
Unit 1 DAA Notes: Algorithms & Analysis
26 pages
Divide and Conquer Algorithms Overview
No ratings yet
Divide and Conquer Algorithms Overview
9 pages
Advanced Sorting and Searching Algorithms
No ratings yet
Advanced Sorting and Searching Algorithms
21 pages
DSA Self-Paced Summer Internship Guide
No ratings yet
DSA Self-Paced Summer Internship Guide
33 pages
Algorithms and Data Structures Overview
No ratings yet
Algorithms and Data Structures Overview
89 pages
ADSA Lecture Unit II Ch1
No ratings yet
ADSA Lecture Unit II Ch1
45 pages
Unit 1 DAA Notes: Algorithms & Analysis
No ratings yet
Unit 1 DAA Notes: Algorithms & Analysis
26 pages
Sorting Algorithms: Merge & Quick Sort Analysis
No ratings yet
Sorting Algorithms: Merge & Quick Sort Analysis
47 pages
Understanding Parallel Algorithms and Techniques
No ratings yet
Understanding Parallel Algorithms and Techniques
19 pages
Data Structures and Algorithms Overview
No ratings yet
Data Structures and Algorithms Overview
9 pages
Understanding Algorithmic Problems and Techniques
No ratings yet
Understanding Algorithmic Problems and Techniques
86 pages
Dsa 6
100% (1)
Dsa 6
53 pages
DSA Assignment 1-Solutions
No ratings yet
DSA Assignment 1-Solutions
10 pages
Parallel Sorting Algorithms Explained
No ratings yet
Parallel Sorting Algorithms Explained
23 pages
Merge Sort Algorithm Explained
No ratings yet
Merge Sort Algorithm Explained
35 pages
Forwardingindices of Folded: N-Cubes
No ratings yet
Forwardingindices of Folded: N-Cubes
3 pages
Scalable Parallel Data Mining Algorithms
No ratings yet
Scalable Parallel Data Mining Algorithms
16 pages
Forwardingindices of Folded: N-Cubes
No ratings yet
Forwardingindices of Folded: N-Cubes
3 pages
Hamiltonian Circuits in Faulty Crossed Cubes
No ratings yet
Hamiltonian Circuits in Faulty Crossed Cubes
7 pages
Assoc Parallel
No ratings yet
Assoc Parallel
12 pages
SIMD Algorithms for Perfect Shuffle Networks
No ratings yet
SIMD Algorithms for Perfect Shuffle Networks
16 pages
Data-Parallel Vector Computing Models
No ratings yet
Data-Parallel Vector Computing Models
268 pages
Optimal Communication Algorithms For Hypercubes : Journal of Parallel and Distributed Computing 11, 263-275 (1991)
No ratings yet
Optimal Communication Algorithms For Hypercubes : Journal of Parallel and Distributed Computing 11, 263-275 (1991)
13 pages
Optimal Algorithm for Binary Phylogenetic Trees
No ratings yet
Optimal Algorithm for Binary Phylogenetic Trees
8 pages
Provably Efficient Scheduling For Languages With Fine-Grained Parallelism
No ratings yet
Provably Efficient Scheduling For Languages With Fine-Grained Parallelism
41 pages
Scheduling Threads For Constructive Cache Sharing On Cmps
No ratings yet
Scheduling Threads For Constructive Cache Sharing On Cmps
11 pages
Optimized Scan Primitives for CRAY Y-MP
No ratings yet
Optimized Scan Primitives for CRAY Y-MP
10 pages
A Provably Time-Efficient Parallel Implementation of Full Speculation
No ratings yet
A Provably Time-Efficient Parallel Implementation of Full Speculation
46 pages
Using Page Residency To Balance Tradeoffs in Tracing Garbage Collection
No ratings yet
Using Page Residency To Balance Tradeoffs in Tracing Garbage Collection
11 pages
Data-Parallel Vector Computing Models
No ratings yet
Data-Parallel Vector Computing Models
268 pages
Strongly History-Independent Hashing
No ratings yet
Strongly History-Independent Hashing
11 pages
Efficient Algorithms for Near-Perfect Phylogenies
No ratings yet
Efficient Algorithms for Near-Perfect Phylogenies
25 pages
Fixed Parameter Tractability of BNPP
No ratings yet
Fixed Parameter Tractability of BNPP
12 pages
Multicore Cache Performance for Algorithms
No ratings yet
Multicore Cache Performance for Algorithms
10 pages
Merged Assignment Programming in Mordern C++ All Answers
100% (1)
Merged Assignment Programming in Mordern C++ All Answers
195 pages
Python Lab 3
No ratings yet
Python Lab 3
19 pages
Chapter 5
100% (2)
Chapter 5
5 pages
Searchability: Obscan
No ratings yet
Searchability: Obscan
6 pages
Software Development Methodologies Guide
No ratings yet
Software Development Methodologies Guide
3 pages
Satya Gowri: Android Developer Resume
No ratings yet
Satya Gowri: Android Developer Resume
6 pages
Python Basics for Beginners
No ratings yet
Python Basics for Beginners
3 pages
Comprehensive Guide to JavaScript Frameworks
No ratings yet
Comprehensive Guide to JavaScript Frameworks
33 pages
AP Basic Language Manual
No ratings yet
AP Basic Language Manual
225 pages
RPA Essentials for Business Users
No ratings yet
RPA Essentials for Business Users
49 pages
WOKWI
No ratings yet
WOKWI
5 pages
Understanding Server-Side vs Client-Side Validation
No ratings yet
Understanding Server-Side vs Client-Side Validation
8 pages
C++ Practical Programs and Solutions
No ratings yet
C++ Practical Programs and Solutions
7 pages
Lululemon's AWS Cloud DevOps Case Study
No ratings yet
Lululemon's AWS Cloud DevOps Case Study
6 pages
Staad - Pro2007 FullCourse
No ratings yet
Staad - Pro2007 FullCourse
353 pages
Hce SDK - Log
No ratings yet
Hce SDK - Log
4 pages
Overloaded Subprograms Implementation
No ratings yet
Overloaded Subprograms Implementation
3 pages
MS Excel and Its Commands ICAI
No ratings yet
MS Excel and Its Commands ICAI
20 pages
Latest Log
No ratings yet
Latest Log
36 pages
Aktu BCA 2nd Year Syllabus 2025-26
No ratings yet
Aktu BCA 2nd Year Syllabus 2025-26
39 pages
Benefits Analysis for Online Trading Software
No ratings yet
Benefits Analysis for Online Trading Software
56 pages
JDBC Transactions: Auto-Commit Control
No ratings yet
JDBC Transactions: Auto-Commit Control
3 pages
Data Analysis and Processing Techniques
No ratings yet
Data Analysis and Processing Techniques
47 pages
Angular Universal: Michael Haberman Independent Consultant
No ratings yet
Angular Universal: Michael Haberman Independent Consultant
25 pages
Case Study: Google Cloud
No ratings yet
Case Study: Google Cloud
13 pages
Notes CA403 Object Oriented Programming Using C Sybbaca
No ratings yet
Notes CA403 Object Oriented Programming Using C Sybbaca
49 pages
Chapter 4
No ratings yet
Chapter 4
9 pages
Functions (Focused On Return Type) Java Isc 11
No ratings yet
Functions (Focused On Return Type) Java Isc 11
4 pages
Cobas e 411 Reference Data 4.15
No ratings yet
Cobas e 411 Reference Data 4.15
12 pages
Sit379-7 1P
No ratings yet
Sit379-7 1P
4 pages

Teaching Parallelism in Computing

Uploaded by

Teaching Parallelism in Computing

Uploaded by

Parallel Thinking*

as part of the Center for Computational Thinking

Andrew Chien, 2008

are many algorithms naturally parallel?

Quicksort from Sedgewick

Quicksort from Aho-Hopcroft-Ullman

Two forms of natural parallelism

[0, 2, 3, 7, 9, 12, 13, 18]

Work = O(n log n)

Not a very good parallel algorithm

Work = O(n log n)

Still not a very good parallel algorithm

Work = O(n log n)

In general all you need is sum (work) and max

Generally for a DAG

a DAG with span (depth)

parallelism and function parallelism

W (n) = 2W (n /2) + 6W* (n /2)

D(n) = 2D(n /2) + 6D* (n /2)

easily understanding, analyzing and

What about in parallel?

Merging with Futures

Observations 11, 12 and 13

parallel than sequentially

More algorithmic techniques

that will still be useful in 20 years

Possible course content

211: Intro to Data Structures+Algos

212: Principles of Programming

213: Introduction to Systems

You might also like