Welcome to Scribd!

Skip carousel

Lec15 Notes

Uploaded by

hancocker

0% found this document useful (0 votes)

8 views4 pages

Original Title

lec15-notes

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views4 pages

Lec15 Notes

Uploaded by

hancocker

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

Parallelism / Concurrency 6.

S080 Lec 15 10/28/2019

What is the objective:

- Exploit multiple processors to perform computation
faster by dividing it into subtasks
- Do something else while you wait for a slow task to complete

Overview:

This lecture is mostly about the first, but we'll talk briefly about the 2nd too.

Hardware perspective:

What does a modern machine look like inside, how many processors, etc.

Network architecture, bandwidth numbers, etc.

Ping test

Software perspective:

Single machine:

Multiple processes

Multiple threads

Python:

multithreading vs processing

Global interpreter lock lock

Although multi-threaded programs technically share memory, it's almost always a

good idea to coordinate their activity through some kind of shared data structure.
Typically this takes the form of a queue, or a lock

Example threading.py, threading_lock.py

What is our objective

Performance metric: speed up vs scale up

Scale up = N times larger task, N times more processors

goal: want runtime to remain the same

Speed Up = same size task, N times more processors

goal: want N times faster (show slide)

Impediments to scaling:

- Global interpreter lock -- switch to multiprocessing

Show API slide
- Fixed costs -- takes some time to set up / tear down
- Task grain
- tasks are too small, startup / coordination cost dominates
- Lack of scalability - some tasks cannot be parallelized, e..g, due to
contention or lack of a parallel algorithm
- Skew - some processors have more work to do than others

test_queue.py (show slide with walkthrough of approach, then show code)

Show speedup slide

Why isn't it as good as we would like? (Not enough processors!)

Ok, so we can write parallel code like this ourselves, but these kinds of patterns are
pretty common. This is a lot of work to rewrite a program to be parallel.

Fortunately, there are lots of good tools out there that can help you write efficient
parallel data processing programs. We'll look at one popular one, called dask, and
mention a few others.

Overall, our basic approach to parallelism is going to be to split a given data set split
into N partitions, and use M processors to process this data in parallel.

Show parallel data flow slide

Going to introduce a few of them, but first let's talk about frequent operations you might
want to do with data:

- Filter
- Project
- Element-wise or row-wise transform
- Join
- Repartition vs broadcast
- Aggregate
- Sort
- Train an ML model?
- User defined functions "UDF"s

Most of these are straightforward, i.e., just assign a processor to each partition -- no
further coordination needed

"Embarrassingly parallel" -- Filter, project, element wise operations

Others still paralellizable, but have to do some extra work -- e.g., aggregation example
above. We'll look at join in a bit. Lots of work on parallel sorting, parallel model
building, etc -- not going to focus on in this class.

Some general challenges:

What if N >> M ?
- fine as long as each partition isn't too small (which would waste work)
- task grain issue
What if M >> N?
- can split some partitions

How to assign workers to tasks / stages

- if each task happens one at a time, straightforward, but trickier
if we have multiple branches that can execute in parallel, or
are pipelining data from one task to the next

Types of data partitioning

- randomly (or in whatever way it is given)

- attribute based (e.g., on a range of values, or a date)

Attribute based can be good, because it can allow you to skip some partitions if you
filter on that attribute, and can also help in joins, as we will see.

Parallel joins

- Show slide sequence

Parallel aggregations can be done similarly to shuffle -- each worker scans one
partition, computes partial aggregate, and then merge aggregates at the end

Common SQL aggregtes -- sum, count, etc -- are commutative and associative, so easy
to merge

Postgres example?

Dask

Spark

Summary

Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
CS609
Document292 pages
CS609
jawad asghar
100% (1)
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
No ratings yet
PEGA Interview Set03
Document8 pages
PEGA Interview Set03
satyach123
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Streaming: Big Data Huawei Course
Document19 pages
Streaming: Big Data Huawei Course
Thiago Siqueira
No ratings yet
Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
Rating: 4.5 out of 5 stars
4.5/5 (2)
Java Theory and Practice:: Stick A Fork in It, Part 1
Document8 pages
Java Theory and Practice:: Stick A Fork in It, Part 1
Dipesh Gupta
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Struts 2 Overview2
Document16 pages
Struts 2 Overview2
amitl21066173
No ratings yet
Data Stage Designer Data Stage Server
Document4 pages
Data Stage Designer Data Stage Server
abreddy2003
No ratings yet
Unit VI Parallel Programming Concepts
Document90 pages
Unit VI Parallel Programming Concepts
Prathamesh
No ratings yet
Assignment # 2: Name: M. Nasir
Document12 pages
Assignment # 2: Name: M. Nasir
Muhammad Umer
No ratings yet
Abinitio Interview
Document70 pages
Abinitio Interview
Chandni Kumari
100% (5)
05 schedulingGraphicsPipelineWithNotes BPS2011 Ragankelley
Document52 pages
05 schedulingGraphicsPipelineWithNotes BPS2011 Ragankelley
yurymik
No ratings yet
An Inside Look at Version 9 and Release 9.1 Threaded Base SAS Procedures
Document6 pages
An Inside Look at Version 9 and Release 9.1 Threaded Base SAS Procedures
Nagesh Khandare
No ratings yet
Introduction To Concurrent-And Parallel Programming
Document33 pages
Introduction To Concurrent-And Parallel Programming
Kan Lex
No ratings yet
Clustering The Java Virtual Machine Using Aspect-Oriented Programming
Document8 pages
Clustering The Java Virtual Machine Using Aspect-Oriented Programming
Sharath Venki
No ratings yet
Optimizing and Improving Opensimulator Performance
Document32 pages
Optimizing and Improving Opensimulator Performance
John Simmons
No ratings yet
Architecting A Machine
Document23 pages
Architecting A Machine
Ed Cher
No ratings yet
Manage Your Data Science Project Structure in Early Stage
Document7 pages
Manage Your Data Science Project Structure in Early Stage
Wellington Oliveira
No ratings yet
An Introduction To Programming in Matlab
Document12 pages
An Introduction To Programming in Matlab
Divya Konda
No ratings yet
Dataeng-Zoomcamp - 5 - Batch - Processing - MD at Main Ziritrion - Dataeng-Zoomcamp GitHub
Document41 pages
Dataeng-Zoomcamp - 5 - Batch - Processing - MD at Main Ziritrion - Dataeng-Zoomcamp GitHub
Ashiq K
No ratings yet
Datastage Points
Document26 pages
Datastage Points
Naresh Kumar
No ratings yet
Spring Batch
Document29 pages
Spring Batch
Navneet kumar singh
No ratings yet
Informatica Tuning Guide
Document14 pages
Informatica Tuning Guide
nandha587y
No ratings yet
Exploiting Dynamic Resource Allocation For Efficient Parallel Data Processing in Cloud-By Using Nephel's Algorithm
Document3 pages
Exploiting Dynamic Resource Allocation For Efficient Parallel Data Processing in Cloud-By Using Nephel's Algorithm
anon_977232852
No ratings yet
C# Multithreaded and Parallel Programming Sample Chapter
Document36 pages
C# Multithreaded and Parallel Programming Sample Chapter
Packt Publishing
No ratings yet
Serial and Parallel First 3 Lecture
Document17 pages
Serial and Parallel First 3 Lecture
Asif Khan
No ratings yet
THREADS
Document3 pages
THREADS
James Joshua Dela Cruz
No ratings yet
Notes
Document5 pages
Notes
KIRBEY DOCALLOS
No ratings yet
Amp Viii
Document3 pages
Amp Viii
Anil Kumar Gona
No ratings yet
Chapter 1 - Why Threads
Document3 pages
Chapter 1 - Why Threads
savio77
No ratings yet
Datastage Debugging Tips
Document2 pages
Datastage Debugging Tips
Dinesh Sanodiya
No ratings yet
Qdoc - Tips Datastage-Material
Document40 pages
Qdoc - Tips Datastage-Material
Ranji Dhawan
No ratings yet
Measure Early For Perf 1
Document6 pages
Measure Early For Perf 1
Diego Mendes
No ratings yet
A Presentation On Parallel Computing: - Ameya Waghmare (Rno 41, BE CSE) Guided by-Dr.R.P.Adgaonkar (HOD), CSE Dept
Document32 pages
A Presentation On Parallel Computing: - Ameya Waghmare (Rno 41, BE CSE) Guided by-Dr.R.P.Adgaonkar (HOD), CSE Dept
Manish Ravula
No ratings yet
Node JS
Document10 pages
Node JS
Sana Fathima Sana
No ratings yet
Part 1: What Is N-Tier Architecture?: Karim Hyatt
Document8 pages
Part 1: What Is N-Tier Architecture?: Karim Hyatt
Surendra Bansal
No ratings yet
High Level Optimization Methods in Spark 1672230272
Document3 pages
High Level Optimization Methods in Spark 1672230272
Arshad Ahmeed
No ratings yet
TLP and ILP
Document9 pages
TLP and ILP
siddhu8134
No ratings yet
Lecture1 Notes
Document19 pages
Lecture1 Notes
l215376
No ratings yet
Fork Join Framework-Brian Goetz
Document9 pages
Fork Join Framework-Brian Goetz
Amol Chikhalkar
No ratings yet
Datastage Interview Questions
Document18 pages
Datastage Interview Questions
Ganesh Kumar
100% (1)
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Document24 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
11 OOPDesign 1
Document10 pages
11 OOPDesign 1
zkalel_m
No ratings yet
Watercolor Organic Shapes SlidesMania
Document23 pages
Watercolor Organic Shapes SlidesMania
Ayush kumar
No ratings yet
Design and Implementation of A Parallel Priority Queue On Many-Core Architectures
Document10 pages
Design and Implementation of A Parallel Priority Queue On Many-Core Architectures
fonseca_r
No ratings yet
What Is Difference Between Server Jobs and Parallel Jobs? Ans:-Server Jobs
Document71 pages
What Is Difference Between Server Jobs and Parallel Jobs? Ans:-Server Jobs
Dinesh Sanodiya
No ratings yet
SQL Server Performance Tuning Imp Points
Document8 pages
SQL Server Performance Tuning Imp Points
Firas Boubes
No ratings yet
ECE 587 - Hardware/Software Co-Design Lecture 09 Processor Modeling I
Document13 pages
ECE 587 - Hardware/Software Co-Design Lecture 09 Processor Modeling I
Sunil Patil
No ratings yet
Parallel Computing972003 1223239697675005 9
Document32 pages
Parallel Computing972003 1223239697675005 9
aadrika
No ratings yet
Multitasking Using Multithreading: Guided by
Document25 pages
Multitasking Using Multithreading: Guided by
Vikram Bhat
No ratings yet
Sig: A Methodology For The Study of Architecture
Document6 pages
Sig: A Methodology For The Study of Architecture
Adamo Ghirardelli
No ratings yet
Multithreadedjavascript Preview
Document5 pages
Multithreadedjavascript Preview
qcrw5
No ratings yet
Data Stage Architecture
Document9 pages
Data Stage Architecture
jbk111
No ratings yet
Nalsd Workbook A4
Document7 pages
Nalsd Workbook A4
Kun Huang
No ratings yet
SAP PI Interview Questions and Answers
Document11 pages
SAP PI Interview Questions and Answers
Venkateswara Reddy Kolagotla
No ratings yet
Application &amp Data Architecture
Document8 pages
Application &amp Data Architecture
tparams6366
No ratings yet
What Is Multi Threading
Document64 pages
What Is Multi Threading
ronak_sp
100% (1)
01 Agile Leadership Role
Document27 pages
01 Agile Leadership Role
hancocker
No ratings yet
Willowdale Stage 12 Release Flyer Release 9 Final
Document3 pages
Willowdale Stage 12 Release Flyer Release 9 Final
hancocker
No ratings yet
Opencv Java Tutorials
Document67 pages
Opencv Java Tutorials
Mihaela Hadarag
No ratings yet
Java NIOCookbook
Document81 pages
Java NIOCookbook
David Llanes
No ratings yet
Chap2 Slides
Document127 pages
Chap2 Slides
Dhara Rajput
No ratings yet
Unit 6 Software Design and Development
Document9 pages
Unit 6 Software Design and Development
Mike
No ratings yet
Lesson 1: Computer: Systems Design
Document16 pages
Lesson 1: Computer: Systems Design
Chanchan Lebumfacil
No ratings yet
Aws Elastic Beanstalk
Document1,087 pages
Aws Elastic Beanstalk
Sai Madhavi
No ratings yet
GMWIN Software
Document1 page
GMWIN Software
mamamuku
No ratings yet
3.2 Active Filters: I1 gV2 - I2 gV1 V2 V1
Document42 pages
3.2 Active Filters: I1 gV2 - I2 gV1 V2 V1
Sasith Wickramasinghe
No ratings yet
PS Pulse Connect Secure DS
Document9 pages
PS Pulse Connect Secure DS
Salwa Qasem
No ratings yet
Test Plan Report: Books Management System
Document10 pages
Test Plan Report: Books Management System
Rohan Naik
No ratings yet
Imp Circuit
Document18 pages
Imp Circuit
Vipul Patil
No ratings yet
Schottky Diode
Document20 pages
Schottky Diode
Riaz Israni
No ratings yet
Introduction To Damped Experiment
Document2 pages
Introduction To Damped Experiment
Leon Tetemu
No ratings yet
Plbconverter: Convert A PLB To Different Document Formats
Document10 pages
Plbconverter: Convert A PLB To Different Document Formats
ec05226
No ratings yet
Ibm Blockchain Platform Technical Brief May 2020 KUW12555USEN 2
Document9 pages
Ibm Blockchain Platform Technical Brief May 2020 KUW12555USEN 2
xf
No ratings yet
Project Proposal e Authentication
Document8 pages
Project Proposal e Authentication
Aadesh Valase
0% (1)
Msi Ms-6541 Rev 0ee1 SCH
Document32 pages
Msi Ms-6541 Rev 0ee1 SCH
Ruben Nina
0% (1)
UNIX Basic Commands
Document20 pages
UNIX Basic Commands
Aftab Hakki
No ratings yet
Midokura - Back End Engineer
Document3 pages
Midokura - Back End Engineer
zentropia
No ratings yet
Cambridge International AS & A Level: Computer Science 9608/13
Document12 pages
Cambridge International AS & A Level: Computer Science 9608/13
Asim
No ratings yet
1 - Unit 2 - Assignment Brief 1
Document3 pages
1 - Unit 2 - Assignment Brief 1
Cục Muối
No ratings yet
Arduino Notes
Document27 pages
Arduino Notes
James Joseph
No ratings yet
EMC VNX Parts Location Guide
Document44 pages
EMC VNX Parts Location Guide
AcalPD
100% (1)
1430 ALG-CAN Protocol
Document2 pages
1430 ALG-CAN Protocol
cristi
No ratings yet
DPDPK Manual ProfibusFV
Document514 pages
DPDPK Manual ProfibusFV
Andres Lobos Perez
No ratings yet
VN820 / VN820SO VN820SP / VN820-B5 / VN820PT: High Side Driver
Document35 pages
VN820 / VN820SO VN820SP / VN820-B5 / VN820PT: High Side Driver
Erasmo Franco
No ratings yet
"Path To Cloud - A Startup Story": Jayesh Sidhwani Cto / Co-Founder, Http://Eyech - at @jayeshsidhwani @jayesh - Sidhwani
Document6 pages
"Path To Cloud - A Startup Story": Jayesh Sidhwani Cto / Co-Founder, Http://Eyech - at @jayeshsidhwani @jayesh - Sidhwani
Jayesh Sidhwani
No ratings yet
Exam 1 Study Guide - CEN6016
Document2 pages
Exam 1 Study Guide - CEN6016
Salman Khan
No ratings yet
MPC Tutorial Odd Sem 2012
Document14 pages
MPC Tutorial Odd Sem 2012
Siddharth Nawani
No ratings yet
Programming 2
Document55 pages
Programming 2
Nathan
100% (1)
DCS 115 Object Oriented Programming
Document304 pages
DCS 115 Object Oriented Programming
Vikneshwar Vicky
No ratings yet
Adc0801 PDF
Document36 pages
Adc0801 PDF
roi_sihombing
No ratings yet
CompTIA Security+ All-in-One Exam Guide, Sixth Edition (Exam SY0-601)
From Everand
CompTIA Security+ All-in-One Exam Guide, Sixth Edition (Exam SY0-601)
Wm. Arthur Conklin
Rating: 5 out of 5 stars
5/5 (1)
Unlock Any Roku Device: Watch Shows, TV, & Download Apps
From Everand
Unlock Any Roku Device: Watch Shows, TV, & Download Apps
Jan Hendrick
No ratings yet
Computer Science: A Concise Introduction
From Everand
Computer Science: A Concise Introduction
Ian Sinclair
Rating: 4.5 out of 5 stars
4.5/5 (14)
Chip War: The Quest to Dominate the World's Most Critical Technology
From Everand
Chip War: The Quest to Dominate the World's Most Critical Technology
Chris Miller
Rating: 4.5 out of 5 stars
4.5/5 (227)
Chip War: The Fight for the World's Most Critical Technology
From Everand
Chip War: The Fight for the World's Most Critical Technology
Chris Miller
Rating: 4.5 out of 5 stars
4.5/5 (82)
CompTIA A+ Complete Review Guide: Core 1 Exam 220-1101 and Core 2 Exam 220-1102
From Everand
CompTIA A+ Complete Review Guide: Core 1 Exam 220-1101 and Core 2 Exam 220-1102
Troy McMillan
Rating: 5 out of 5 stars
5/5 (2)
Raspberry Pi Essentials
From Everand
Raspberry Pi Essentials
Jack Creasey
No ratings yet
iPhone X Hacks, Tips and Tricks: Discover 101 Awesome Tips and Tricks for iPhone XS, XS Max and iPhone X
From Everand
iPhone X Hacks, Tips and Tricks: Discover 101 Awesome Tips and Tricks for iPhone XS, XS Max and iPhone X
David Cromwell
Rating: 3 out of 5 stars
3/5 (2)
Hacking With Linux 2020:A Complete Beginners Guide to the World of Hacking Using Linux - Explore the Methods and Tools of Ethical Hacking with Linux
From Everand
Hacking With Linux 2020:A Complete Beginners Guide to the World of Hacking Using Linux - Explore the Methods and Tools of Ethical Hacking with Linux
Joseph Kenna
No ratings yet
Amazon Web Services (AWS) Interview Questions and Answers
From Everand
Amazon Web Services (AWS) Interview Questions and Answers
Tech Interviews
Rating: 4.5 out of 5 stars
4.5/5 (3)
Mastering ChatGPT
From Everand
Mastering ChatGPT
Charles J. Jones
No ratings yet
iPhone 15 Pro User Guide for Beginners and Seniors
From Everand
iPhone 15 Pro User Guide for Beginners and Seniors
Charles J. Jones
No ratings yet
How Disk Drives Work
From Everand
How Disk Drives Work
Robert Stetson
No ratings yet
Android Smartphone Photography For Dummies
From Everand
Android Smartphone Photography For Dummies
Mark Hemmings
No ratings yet
iPhone 15 Guide for Seniors
From Everand
iPhone 15 Guide for Seniors
Kevin Pitch
Rating: 5 out of 5 stars
5/5 (1)
Drones For Dummies
From Everand
Drones For Dummies
Mark LaFay
Rating: 5 out of 5 stars
5/5 (1)
iPhone 14 Guide for Seniors: Unlocking Seamless Simplicity for the Golden Generation with Step-by-Step Screenshots
From Everand
iPhone 14 Guide for Seniors: Unlocking Seamless Simplicity for the Golden Generation with Step-by-Step Screenshots
Kevin Pitch
Rating: 5 out of 5 stars
5/5 (1)
How to Jailbreak Roku: Unlock Roku, Roku Stick, Roku Ultra, Roku Express, Roku TV with Kodi Step by Step Guide
From Everand
How to Jailbreak Roku: Unlock Roku, Roku Stick, Roku Ultra, Roku Express, Roku TV with Kodi Step by Step Guide
Jonathan Gates
Rating: 1 out of 5 stars
1/5 (1)
Programming with STM32: Getting Started with the Nucleo Board and C/C++
From Everand
Programming with STM32: Getting Started with the Nucleo Board and C/C++
Donald Norris
Rating: 3.5 out of 5 stars
3.5/5 (3)
Students' Guide to Information Technology
From Everand
Students' Guide to Information Technology
Roger Carter
Rating: 5 out of 5 stars
5/5 (3)
iPhone Unlocked for the Non-Tech Savvy: Color Images & Illustrated Instructions to Simplify the Smartphone Use for Beginners & Seniors [COLOR EDITION]
From Everand
iPhone Unlocked for the Non-Tech Savvy: Color Images & Illustrated Instructions to Simplify the Smartphone Use for Beginners & Seniors [COLOR EDITION]
Kevin Pitch
Rating: 5 out of 5 stars
5/5 (1)
CompTIA A+ Complete Practice Tests: Core 1 Exam 220-1101 and Core 2 Exam 220-1102
From Everand
CompTIA A+ Complete Practice Tests: Core 1 Exam 220-1101 and Core 2 Exam 220-1102
Audrey O'Shea
No ratings yet
Patterns in the Machine: A Software Engineering Guide to Embedded Development
From Everand
Patterns in the Machine: A Software Engineering Guide to Embedded Development
John T. Taylor
Rating: 5 out of 5 stars
5/5 (1)
Windows 10 Mastery: The Complete User Guide to Learn Windows 10 from Beginner to Expert
From Everand
Windows 10 Mastery: The Complete User Guide to Learn Windows 10 from Beginner to Expert
Alex Nelson
Rating: 3.5 out of 5 stars
3.5/5 (6)
Amazon Echo Manual Guide : Top 30 Hacks And Secrets To Master Amazon Echo & Alexa For Beginners: The Blokehead Success Series
From Everand
Amazon Echo Manual Guide : Top 30 Hacks And Secrets To Master Amazon Echo & Alexa For Beginners: The Blokehead Success Series
Scott Green
No ratings yet
CompTIA A+ Certification All-in-One Exam Guide, Eleventh Edition (Exams 220-1101 & 220-1102)
From Everand
CompTIA A+ Certification All-in-One Exam Guide, Eleventh Edition (Exams 220-1101 & 220-1102)
Travis A. Everett
Rating: 5 out of 5 stars
5/5 (2)
Samsung Galaxy S7 For Dummies
From Everand
Samsung Galaxy S7 For Dummies
Bill Hughes
No ratings yet
Arduino for Secret Agents
From Everand
Arduino for Secret Agents
Schwartz Marco
Rating: 3.5 out of 5 stars
3.5/5 (4)
How To Market Mobile Apps: Your Step By Step Guide To Marketing Mobile Apps
From Everand
How To Market Mobile Apps: Your Step By Step Guide To Marketing Mobile Apps
HowExpert
No ratings yet
Raspberry Pi | 101: The Beginner’s Guide with Basics on Hardware, Software, Programming & Projec
From Everand
Raspberry Pi | 101: The Beginner’s Guide with Basics on Hardware, Software, Programming & Projec
M.Eng. Johannes Wild
No ratings yet