0% found this document useful (0 votes)

25 views58 pages

1 2 Logistics Comp Graphs

The document outlines the course EE-433/AI-511 on Deep Learning at UET Lahore, focusing on theory and practice in deep learning techniques, including optimization and computational graphs. It details course logistics, including contacts, homework assignments, and a complex engineering project, emphasizing teamwork and original problem-solving. Additionally, it provides resources for learning and tools required for programming in Python and PyTorch.

Uploaded by

hanimukhtar512

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views58 pages

1 2 Logistics Comp Graphs

Uploaded by

hanimukhtar512

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 58

Deep Learning

1-2 Logistics, Software, Computational Graphs

EE-433/AI-511, UET Lahore, Pakistan

Dr. Ahsen Tahir

.The slides in part have been modified from Ian Good Fellow book slides and Dive in to Deep Learning slides
Goals

• Introduction to Deep Learning

(MLP, optimization, convolutions, sequences)
• Theory
• Capacity control (weight decay, dropout, batch norm)
• Optimization, models, overfitting, objective functions
• Practice
• Write code in Python / Pytorch
• Solve realistic problems
• Complex Engineering Problem
• Ability to solve original problems in Deep Learning in a team
EE-433/AI-511
Getting there

• Course
• “Dive in to Deep learning” book online
[Link]
• “Deep Learning” book by Ian GoodFellow et al. online
[Link]
• Dive into Deep Learning
• Jupyter Notebooks
• Github repository at d2l-ai/d2l-en

EE-433/AI-511
Logistics
Contacts

• Lecturers
• Ahsen Tahir
Office hours: TBA
• Email ahsan@[Link]
• Teaching Support
• Anique Aslam
Office hours: TBA
• Email maniqueaslam@[Link]

EE-433/AI-511
Homework

• 5 assignments + 1 CEP BS/Project MS

• Due 1 week after posted
2/12, 2/26, 3/12, 4/2, 4/19, CEP at the end
• Best 4 out of 5 homeworks count
• Code plagiarism from each other or online ->6 months rustication
• No mark for late submission
• programming assignments

EE-433/AI-511
Homework

• Submit homework via GitHub

• Submit the homework by 12am it’s due
• pulled request after deadline
• Submit as Jupyter notebooks (code)
• Commited annotated feedback via Git
• Logistics
• Github account & repository (email to course)
• Permission for teacher to read/write the repository

EE-433/AI-511
Complex Engineering Problem (CEP) / Project

• Original work in machine learning

• Existing tools applied to novel problem
• Novel tools

• Research ‘with training wheels’ simulates academic process

• Research in a team (4 students BS/ 1 student MS)
• Deliverables with schedule / deadlines
• End result is a paper/report/presentation (NIPS template)

EE-433/AI-511
Complex Engineering Problem (CEP)

• 2/5 Register team (names, working title)

• 3/5 Project proposal (1-2 page, 5 min talk)
• 4/21-22 (or earlier) Talk to Teacher to discuss
• Final presentation & report
(6-20 pages report, 6-20 slides talk)

• Start early (last minute projects fail often)

• No, you cannot do it alone. This is teamwork.
EE-433/AI-511
Deep Learning
SIFT - DAVID LOWE
MOST CELEBRATED ALGORITHM FOR OBJECT (OVERLAPS YELLOW AND GREEN)

E
DETECTION/RECOGNITION, MAPPING, TRACKING 10-13 YEARS AGO

E T
O L
B S
O
THE FUTURE OF COMPUTER VISION
BELONGS TO THE FEATURE LEARNING

DAVID LOWE
Classify Images

[Link]

EE-433/AI-511
Classify Images

[Link]
Yanofsky, Quartz
[Link]
the-direction-of-ai-research-and-possibly-the-
world/
COMPUTER VISION WITH DEEP LEARNING
Convolutional neural networks for computer vision
Object Detection (Yolo-Lite) Image Segmentation (Yolo-Lite)
Detect and Segment Objects

[Link]
EE-433/AI-511
Style transfer

[Link]

EE-433/AI-511
Synthesize Faces

Karras et al, ICLR 2018

EE-433/AI-511
Analogies

[Link]

EE-433/AI-511
Machine Translation

[Link]
Image captioning

Shallue et al, 2016

[Link]
[Link]
Software
Tools [Link]

• Python
• Everyone is using it in machine learning & data science
• Conda package manager (for simplicity)
• Jupyter
• So much easier to keep track of your experiments
• Obviously you should put longer code into modules
• Reveal (for notebook slides)
conda install -c conda-forge rise
• pytorch
• Scalability & ease of use
• Imperative interface
EE-433/AI-503
Laptop / Desktop / Generic Cloud with Linux

• Conda
wget [Link]
sh Miniconda3-latest-Linux-x86_64.sh
mkdir d2l-en
cd d2l-en
curl [Link] -o [Link]
unzip [Link]
rm [Link]
• Install pytorch

• Install NVIDIA drivers / CUDA / CUDNN / TensorRT

Colab

• Go to [Link]
• Activate the GPU supported runtime
• Install d2l
# pytorch should already be installed
!pip install d2l

EE-433/AI-503
Disclaimer

• This course will not discuss basics of python, numpy

and/or pytorch tensors
• The course assumes you have sufficient programming
experience. You know the basics of machine learning including
working of ANN/Perceptron, basic learning algorithm etc.
• The course may give a review of few topics.

EE-433/AI-503
The Learning Problem
Supervised Learning

Given:

[object label]

Questions to answer:
Gradient-Based Learning

Specify
• Model
• Cost
• Design model and cost so cost is smooth
• Minimize cost using gradient descent or related
techniques
Conditional Distributions and Cross Entropy
Learning Problem

Given:

Predict… Based on…

category of object image
sentence in French sentence in English
presence of disease X-ray image
text of a phrase audio utterance
Learning Problem

Probability makes more sense than predicting discrete labels

It is also easier to learn, due to smoothness
Intuitively, we can’t change a discrete label “a tiny bit,”
it’s all or nothing
But we can change a probability “a tiny bit”
Given:
Learning Problem

probability distribution
over photos
~
conditional probability

distribution over labels

Learning Problem

Training set:
Learning Problem
Learning Problem

maximum likelihood
estimation (MLE)
negative log-likelihood (NLL)
this is our loss function!
Conditional Distributions and Cross Entropy
Computation Graphs
Computation Graphs
Computation Graph: NN Loss Function
Computation Graphs in pytorch
Computation Graphs in pytorch
Gradients, Jacobian and
Chain Rule
Gradient
A scalar function f (x1, x2, x3) that is defined and differentiable in a domain in 3D-space with
Cartesian coordinates x1, x2, x3. We denote the gradient of that function by grad f or f (read nabla f ).
Then the gradient of f(x1, x2, x3) is defined as the vector function*.

EE-433/AI-511 *Advanced Engineering Mathematics - Kreyszig

Gradient
A vector function y = f (x) that is defined and differentiable in a domain in 1D-space with
Cartesian coordinate x. We denote the gradient of that function by grad f or f (read nabla f ).
Then the gradient of f is defined as the vector function*.

EE-433/AI-511
∂y/∂x x
x
∂y1
∂y ∂y
y1 ∂x y
∂x ∂x
∂y2
y2 ∂y
y= = ∂x y ∂y ∂y
⋮ ∂x ⋮ ∂x ∂x
ym ∂ym
∂x

∂y/∂xis a row vector, while ∂y/∂x is a column vector

It is called numerator-layout notation. The reversed version is

called denominator-layout notation
Jacobian
A vector valued f (x1, x2, x3) that is defined and differentiable in a domain in 3D-space with
Cartesian coordinates x1, x2, x3. We denote the Jacobian of that function as:

EE-433/AI-511
∂y/∂x x1 y1 x
x
x2 y2
x= y= ∂y ∂y
⋮ ⋮ y
∂x ∂x
xn ym
y ∂y ∂y
∂x ∂x
∂y1 ∂y1 ∂y1 ∂y1
,
∂x1 ∂x2
, …,∂x
∂x n

∂y2 ∂y2 ∂y2 ∂y2

∂y , , …,∂x
= ∂x = ∂x1 ∂x2 n
∂x ⋮ ⋮
∂ym ∂ym ∂ym ∂ym
∂x ∂x1
, ∂x , …, ∂x
2 n
Examples

n m ∂y m×n
y a x Ax T
xA x ∈ ℝ, y ∈ ℝ , ∈ℝ
∂x
a, a and A are not functions of x
∂y
0 I A AT
∂x 0 and I are matrices

y au Au u+v

∂y ∂u ∂u ∂u ∂v
a A +
∂x ∂x ∂x ∂x ∂x
Generalize to Matrices
Scalar Vector Matrix

x (1,) x (n,1) X (n, k)

∂y ∂y ∂y
Scalar y (1,) (1,) (1,n) (k, n)
∂x ∂x ∂X

∂y ∂y
Vector y (m,1) (m,1) (m, n) ∂y (m, k, n)
∂x ∂x
∂X

Matrix ∂Y ∂Y (m, l, n) ∂Y
Y (m, l ) (m, l ) (m, l, k, n)
∂x ∂x ∂X

[Link]/berkeley-stat-157
Chain Rule

EE-433/AI-511 *Advanced Engineering Mathematics - Kreyszig

Chain Rule

EE-433/AI-511
Chain Rule

What is ?
EE-433/AI-511
Chain Rule for higher dimensional tensors

EE-433/AI-511
Jacobian-vector product example

def f(x1, x2): def g(y1, y2):

a = x1 * x2 return y1 * y2
y1 = log(a)
y2 = sin(x2)
return (y1, y2)
EE-433/AI-511
Jacobian-vector product – pytorch uses chain rule
def f(x1, x2):
a = x1 * x2
y1 = log(a)
y2 = sin(x2)
return (y1, y2)

def g(y1, y2):

return y1 * y2

EE-433/AI-511
Jacobian-vector product – pytorch uses chain rule

EE-433/AI-511
Thank you

Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
98 pages
(It-Ebooks-2017) It-Ebooks - Fast - Ai Computational Linear Algebra Textbook (2017, Ibooker It-Ebooks) - Libgen - Li
No ratings yet
(It-Ebooks-2017) It-Ebooks - Fast - Ai Computational Linear Algebra Textbook (2017, Ibooker It-Ebooks) - Libgen - Li
193 pages
W01 - FA23 - AIC270 - Programming For AI - Syed Ahmed
No ratings yet
W01 - FA23 - AIC270 - Programming For AI - Syed Ahmed
22 pages
Christopher Manning Lecture 3: Neural Net Learning: Gradients by Hand (Matrix Calculus) and Algorithmically (The Backpropagation Algorithm)
No ratings yet
Christopher Manning Lecture 3: Neural Net Learning: Gradients by Hand (Matrix Calculus) and Algorithmically (The Backpropagation Algorithm)
84 pages
Neural Networks and Word Vectors Explained
No ratings yet
Neural Networks and Word Vectors Explained
96 pages
Syllabus Ee541 22sp
No ratings yet
Syllabus Ee541 22sp
7 pages
Machine Learning: Martin Jaggi & Nicolas Flammarion
No ratings yet
Machine Learning: Martin Jaggi & Nicolas Flammarion
52 pages
Stanford CS229: Machine Learning Course Overview
No ratings yet
Stanford CS229: Machine Learning Course Overview
40 pages
LOD Differentiable
No ratings yet
LOD Differentiable
55 pages
CS 446: Machine Learning: Dan Roth University of Illinois, Urbana-Champaign
No ratings yet
CS 446: Machine Learning: Dan Roth University of Illinois, Urbana-Champaign
75 pages
AI Programming for Beginners
No ratings yet
AI Programming for Beginners
14 pages
AI Programming with Python Syllabus
No ratings yet
AI Programming with Python Syllabus
14 pages
Deep
No ratings yet
Deep
73 pages
CSCE 636: Deep Learning
No ratings yet
CSCE 636: Deep Learning
30 pages
794 Lec Intro Handout
No ratings yet
794 Lec Intro Handout
44 pages
Lecture 5
No ratings yet
Lecture 5
114 pages
Kernels, Data and Physics - Les Houches
No ratings yet
Kernels, Data and Physics - Les Houches
105 pages
01 Intro
No ratings yet
01 Intro
49 pages
AI Programming Syllabus
No ratings yet
AI Programming Syllabus
5 pages
AI Teacher Training - Machine Learning Curriculum
No ratings yet
AI Teacher Training - Machine Learning Curriculum
34 pages
DL Lab Manual
No ratings yet
DL Lab Manual
65 pages
ML Course Aug2025
No ratings yet
ML Course Aug2025
6 pages
Unit 1 - Fundamentals of Ai - Part I
No ratings yet
Unit 1 - Fundamentals of Ai - Part I
59 pages
Deep Learning Course Intro 2020
No ratings yet
Deep Learning Course Intro 2020
77 pages
6S191 MIT DeepLearning L1
No ratings yet
6S191 MIT DeepLearning L1
108 pages
Ai 4 All
No ratings yet
Ai 4 All
31 pages
Lecture1 Introduction CVML
No ratings yet
Lecture1 Introduction CVML
26 pages
AI & Machine Learning Insights
No ratings yet
AI & Machine Learning Insights
109 pages
Intro to Machine Learning Course
No ratings yet
Intro to Machine Learning Course
83 pages
TASI Lecture On Physics For ML
No ratings yet
TASI Lecture On Physics For ML
26 pages
Modules For Machine and Deep Learning:: Fundamentals and Single-Hidden Layer Network (With Matlab)
No ratings yet
Modules For Machine and Deep Learning:: Fundamentals and Single-Hidden Layer Network (With Matlab)
35 pages
First
No ratings yet
First
92 pages
Alice Book Volume 1
No ratings yet
Alice Book Volume 1
378 pages
Lecture 1 - Intro
No ratings yet
Lecture 1 - Intro
31 pages
Notation Example
No ratings yet
Notation Example
11 pages
AI 101 CheatSheet for Beginners
No ratings yet
AI 101 CheatSheet for Beginners
27 pages
Course Admin
No ratings yet
Course Admin
15 pages
Unit - 1 Deep Learning 3-2
No ratings yet
Unit - 1 Deep Learning 3-2
15 pages
Intro to AI for Mechanical Engineers
No ratings yet
Intro to AI for Mechanical Engineers
2 pages
Ai 4 All
No ratings yet
Ai 4 All
27 pages
Syllabus Instructors
No ratings yet
Syllabus Instructors
4 pages
Alice Book Volume 1
No ratings yet
Alice Book Volume 1
281 pages
Short Course On Deep Learning: Welcome!!
No ratings yet
Short Course On Deep Learning: Welcome!!
57 pages
Introduction SYDE572 - Fall 2023
No ratings yet
Introduction SYDE572 - Fall 2023
9 pages
AD-3501-Deep Learning - COURSE PLAN - Unit - Wise
No ratings yet
AD-3501-Deep Learning - COURSE PLAN - Unit - Wise
5 pages
Deep Learning-Lecture 1 (Student)
No ratings yet
Deep Learning-Lecture 1 (Student)
9 pages
Deep Learning for Beginners
No ratings yet
Deep Learning for Beginners
151 pages
Deep Learning PDF
No ratings yet
Deep Learning PDF
289 pages
Week 1 - Artificial Neural Networks - Part I - Justin
No ratings yet
Week 1 - Artificial Neural Networks - Part I - Justin
56 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
R18B Tech MinorIVYearISemesterTENTATIVESyllabus
No ratings yet
R18B Tech MinorIVYearISemesterTENTATIVESyllabus
22 pages
Computer Vision and Deep Learning 1708702317
No ratings yet
Computer Vision and Deep Learning 1708702317
93 pages
Computer Vision Course Overview
No ratings yet
Computer Vision Course Overview
111 pages
IF4071 Deep Learning Notes
No ratings yet
IF4071 Deep Learning Notes
188 pages
Lecture 2 Handout
No ratings yet
Lecture 2 Handout
154 pages
Lesson 2 - Background For AI (Autosaved) New
No ratings yet
Lesson 2 - Background For AI (Autosaved) New
37 pages
DL Unit1 Final
No ratings yet
DL Unit1 Final
41 pages
User Authentication and Validation Tests
No ratings yet
User Authentication and Validation Tests
15 pages
Calculus Course Objectives & Outline
No ratings yet
Calculus Course Objectives & Outline
3 pages
The Power of Data in QML
No ratings yet
The Power of Data in QML
34 pages
Tristation 1131: Turbomachinery Control Software
No ratings yet
Tristation 1131: Turbomachinery Control Software
13 pages
Relations and Functions Notes
No ratings yet
Relations and Functions Notes
4 pages
Lecture - 1
No ratings yet
Lecture - 1
16 pages
Methods of Solving Nonstandard Problems
No ratings yet
Methods of Solving Nonstandard Problems
349 pages
Course Outline MATH1106 - 2024
No ratings yet
Course Outline MATH1106 - 2024
4 pages
MPC 05 Revision Questions
No ratings yet
MPC 05 Revision Questions
5 pages
MAT 110 - 011 College Algebra: Syllabus Addendum TTH 8:00am - 11:00am 920/749
No ratings yet
MAT 110 - 011 College Algebra: Syllabus Addendum TTH 8:00am - 11:00am 920/749
4 pages
Mathematics - 2
No ratings yet
Mathematics - 2
5 pages
XII Holiday Homework 2024-25
No ratings yet
XII Holiday Homework 2024-25
5 pages
Yr11 Methods Exam Practice U1 Functions (Linear & Quadratic) Calculator Free Answers
No ratings yet
Yr11 Methods Exam Practice U1 Functions (Linear & Quadratic) Calculator Free Answers
21 pages
Calculus Final Practice Exam
No ratings yet
Calculus Final Practice Exam
8 pages
K To 12 Basic Education Curriculum Senior High School - Science, Technology, Engineering and Mathematics (Stem) Specialized Subject
No ratings yet
K To 12 Basic Education Curriculum Senior High School - Science, Technology, Engineering and Mathematics (Stem) Specialized Subject
5 pages
Modern Fortran Explained: Incorporating Fortran 2018 8th Edition Michael Metcalf Download
No ratings yet
Modern Fortran Explained: Incorporating Fortran 2018 8th Edition Michael Metcalf Download
48 pages
Homeomorfismo en La Circunferencia
No ratings yet
Homeomorfismo en La Circunferencia
4 pages
Differentiation Techniques for Grade 10
No ratings yet
Differentiation Techniques for Grade 10
4 pages
39 Definition of A Function
No ratings yet
39 Definition of A Function
13 pages
Manual de Manutenção DRF 450
No ratings yet
Manual de Manutenção DRF 450
388 pages
Algebra Driven Design Elegant Software From Simple Building Blocks (Sandy Maguire)
No ratings yet
Algebra Driven Design Elegant Software From Simple Building Blocks (Sandy Maguire)
337 pages
Calculus I Class Notes PDF
No ratings yet
Calculus I Class Notes PDF
57 pages
W1-Preliminaries Exercises-2
No ratings yet
W1-Preliminaries Exercises-2
17 pages
Learning Recursion v0 - 1
No ratings yet
Learning Recursion v0 - 1
115 pages
CUDNN Library
No ratings yet
CUDNN Library
38 pages
04a IGCSE Maths 4MA1 2HR - January 2022 Examination Paper PDF
No ratings yet
04a IGCSE Maths 4MA1 2HR - January 2022 Examination Paper PDF
32 pages
Exponential Equations LP
0% (1)
Exponential Equations LP
2 pages
Exponential-Function-Gen Math - 114040
No ratings yet
Exponential-Function-Gen Math - 114040
4 pages
DPP 14 Rank Refiner by Om Sir
No ratings yet
DPP 14 Rank Refiner by Om Sir
3 pages
Differential Equations in Engineering Math
No ratings yet
Differential Equations in Engineering Math
4 pages

1 2 Logistics Comp Graphs

Uploaded by

1 2 Logistics Comp Graphs

Uploaded by

Deep Learning

1-2 Logistics, Software, Computational Graphs

EE-433/AI-511, UET Lahore, Pakistan

Dr. Ahsen Tahir

• Introduction to Deep Learning

• 5 assignments + 1 CEP BS/Project MS

• Submit homework via GitHub

• Original work in machine learning

• Research ‘with training wheels’ simulates academic process

• 2/5 Register team (names, working title)

• Start early (last minute projects fail often)

Karras et al, ICLR 2018

Shallue et al, 2016

• Install NVIDIA drivers / CUDA / CUDNN / TensorRT

• This course will not discuss basics of python, numpy

Predict… Based on…

Probability makes more sense than predicting discrete labels

distribution over labels

EE-433/AI-511 *Advanced Engineering Mathematics - Kreyszig

∂y/∂xis a row vector, while ∂y/∂x is a column vector

It is called numerator-layout notation. The reversed version is

∂y2 ∂y2 ∂y2 ∂y2

x (1,) x (n,1) X (n, k)

EE-433/AI-511 *Advanced Engineering Mathematics - Kreyszig

def f(x1, x2): def g(y1, y2):

def g(y1, y2):

You might also like