Welcome to Scribd!

Skip carousel

Deep Learning Exam Guide

Uploaded by

Akshay Kumbarwar

0% found this document useful (0 votes)

3 views3 pages

Original Title

exam_SS18

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

3 views3 pages

Deep Learning Exam Guide

Uploaded by

Akshay Kumbarwar

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

lOMoARcPSD|5405364

Introduction to Deep Learning

Exam SS18, 16.07.2018

Multiple-/Single-Choice questions
❏ Means: several answers could be correct, tick all correct ones

● Means: only one answer is correct

And Regression
Given is the following and-operation between inputs x_1 and x_2.
X_1 x_2 and(x_1, x_2)
1 1 1
1 0 0
0 1 0
0 0 0

Which of the following weights and biases represents a valid linear regression for that task.
● W = (1,1), b = 2
● W = (1,1), b = -0.5
● W = (1, 1), b = -1.5
● None of the above

Which of the following introduce nonlinearities?

❏ Convolutions
❏ Pooling
❏ Batch Normalization
❏ ReLUs

Why do we use multiples of 2 for the mini batch size?

❏ GPUs are optimized for that
❏ Makes computation faster
❏
❏

Short Questions

What kind of architecture would you choose for a network to translate english to
german?

Heruntergeladen durch Petre Weinberger (extern.weinberger@tum.de)

lOMoARcPSD|5405364

Why is minimizing the cross entropy between desired output distribution q and and
actual estimation distribution p (H(q,p) the same as minimizing their KL divergence
KL(q,p) = H(q,p) - H(p)?

Batch Normalization
Given is the Batch-Normalization-operation
( x −μ)
y= ξ+ ϕ
σ

1. Compute the gradients of the loss function with respect to the parameters ξ , ϕ given the
gradient ∂ L/∂ y ij.
2. How can you accumulate the backward gradients? ?? Very unsure about that one

Convolution

Given is a simple convolutional operation. You have an input of dimension 2x2x2 and apply a
convolutional kernel with filter size 1 and one output channel. This convolutional layer is
followed by a max-pooling layer of size 2x2 with stride 2 and a padding of 1.

Given are the following values:

x=( [ 1−1.5 ; 1−2 ] , [ −21 ;−1.51 ] ) , ytarget=[0 1 ;1 0]

1. Compute the output.

2. Compute the binary cross-entropy loss.
3. Compute the weight update given a gradient with respect to the outputs consisting of
ones and a learning rate of 1.

You have an input of size 50x50x200 and a convolutional layer with kernel 10x10, your output
should be size 10x10x10, how many multiplications does one forwards pass involve?
What would you do, to reduce this number?

Basic network stuff

Given is a deep fully-connected network with 15 layers that uses tanh as an activation function.
1. What do large inputs mean for the gradients?
2. Do any problems arise from that?
3. Now the weights are initialized with small random weights. Do you foresee any
problems?
4. Name a method that makes the network robust against initialization. (1 point)
5. If one now uses a ReLU as activation-function, which problems are solved? Which
problems do still exist?

Heruntergeladen durch Petre Weinberger (extern.weinberger@tum.de)

lOMoARcPSD|5405364

Optimization
1. Explain difference between vanilla SGD and RMSProp.
2. Explain difference between Newton and quasi-Newton-methods, like BFGS.
3. Explain difference between Newton methods and the Gauss-Newton method.
4. Explain beta_1 and beta_2, the parameters of the Adam optimizer.

You have two Models, Model 1 and Model 2 and train them the same way. You find out, that
Model 1 has too few parameters, to perform well. What is this called. Draw test and validation
loss.
You now severely increase the model complexity of Model 1, but Model 2 still performs better.
What happened now? Draw again training and validation loss.

Other questions:
1. Name two techniques used in Deep learning where you use a dataset separate from the
training dataset
2. Name two technique for learning rate scheduling.

Formulas for a Batch normalization layer was given, calculate the partial derivative with respect
to the scaling and shifting parameter.

You decrease you mini batch size, what do you do with your learning rate, why?

In general:
-It was 17 pages of mostly short questions (90 points in total, mostly 2 per question, sometimes
4, sometimes 1)
-time was tight, but not undable

Heruntergeladen durch Petre Weinberger (extern.weinberger@tum.de)

General Notes: Heruntergeladen Durch Petre Weinberger (Extern - Weinberger@tum - De)
Document6 pages
General Notes: Heruntergeladen Durch Petre Weinberger (Extern - Weinberger@tum - De)
Akshay Kumbarwar
No ratings yet
Second Exam 2021-22
Document14 pages
Second Exam 2021-22
Luís Lopes
No ratings yet
Insem2 Scheme
Document6 pages
Insem2 Scheme
Balathrinath Reddy
No ratings yet
DSCI 303 Machine Learning Problem Set 4
Document5 pages
DSCI 303 Machine Learning Problem Set 4
Anonymous Student
No ratings yet
DSE 5251 Insem1 Scheme
Document8 pages
DSE 5251 Insem1 Scheme
Balathrinath Reddy
No ratings yet
hw2 311
Document4 pages
hw2 311
john doe
No ratings yet
ODSExams Merged
Document103 pages
ODSExams Merged
Ahmed Ali
No ratings yet
Ex09 1 Sol
Document8 pages
Ex09 1 Sol
muguchialio
No ratings yet
Part A: Texas A&M University MEEN 683 Multidisciplinary System Design Optimization (MSADO) Spring 2021 Assignment 3
Document3 pages
Part A: Texas A&M University MEEN 683 Multidisciplinary System Design Optimization (MSADO) Spring 2021 Assignment 3
Boba
No ratings yet
Applicable Artificial Intelligence Back Propagation: Academic Session 2022/2023
Document20 pages
Applicable Artificial Intelligence Back Propagation: Academic Session 2022/2023
muhammed suhail
No ratings yet
ML Assignment: Regression, PCA, Clustering
Document13 pages
ML Assignment: Regression, PCA, Clustering
Si Woo Lee
No ratings yet
DSE 3151 25 Sep 2023
Document9 pages
DSE 3151 25 Sep 2023
Kai
No ratings yet
Exam #1 For Computer Networks SOLUTIONS
Document4 pages
Exam #1 For Computer Networks SOLUTIONS
87bb
No ratings yet
Soft Computing 0105IT171089
Document17 pages
Soft Computing 0105IT171089
OMSAINATH MPONLINE
No ratings yet
Solution Dseclzg524!01!102020 Ec2r
Document6 pages
Solution Dseclzg524!01!102020 Ec2r
srirams007
100% (1)
MT1 SP19 Solutions
Document14 pages
MT1 SP19 Solutions
Hasim
No ratings yet
Bumblebee: Secure Two-Party Inference Framework For Large Transformers
Document18 pages
Bumblebee: Secure Two-Party Inference Framework For Large Transformers
aaa
No ratings yet
Violations of LP Conditions
Document28 pages
Violations of LP Conditions
WANJIRUKAMAU
No ratings yet
Part A: Texas A&M University MEEN 683 Multidisciplinary System Design Optimization (MSADO) Spring 2021 Assignment 2
Document5 pages
Part A: Texas A&M University MEEN 683 Multidisciplinary System Design Optimization (MSADO) Spring 2021 Assignment 2
Boba
No ratings yet
Solution PDF
Document20 pages
Solution PDF
Vard Farrell
No ratings yet
COMP3308/COMP3608 Artificial Intelligence Week 10 Tutorial Exercises Support Vector Machines. Ensembles of Classifiers
Document3 pages
COMP3308/COMP3608 Artificial Intelligence Week 10 Tutorial Exercises Support Vector Machines. Ensembles of Classifiers
hariet
No ratings yet
Supervised Learning Networks: Perceptron and Backpropagation
Document22 pages
Supervised Learning Networks: Perceptron and Backpropagation
mohit
No ratings yet
Kernel PCA
Document13 pages
Kernel PCA
Prayag Gowgi
No ratings yet
Final Sol
Document14 pages
Final Sol
Androw Shonoda
No ratings yet
Deep Neural Network (DNN)
Document80 pages
Deep Neural Network (DNN)
20210802144
No ratings yet
4 Multilayer Perceptrons and Radial Basis Functions
Document6 pages
4 Multilayer Perceptrons and Radial Basis Functions
Vivek
No ratings yet
SP18 CS182 Midterm Solutions - Edited
Document14 pages
SP18 CS182 Midterm Solutions - Edited
Hasim
No ratings yet
CORE - 14: Algorithm Design Techniques (Unit - 1)
Document7 pages
CORE - 14: Algorithm Design Techniques (Unit - 1)
Priyaranjan Soren
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
Document8 pages
ML Lab 11 Manual - Neural Networks (Ver4)
dodela6303
No ratings yet
CHE 536 Engineering Optimization: Course Policies and Outline
Document33 pages
CHE 536 Engineering Optimization: Course Policies and Outline
prasaad08
No ratings yet
Pattern Classification of Back-Propagation Algorithm Using Exclusive Connecting Network
Document5 pages
Pattern Classification of Back-Propagation Algorithm Using Exclusive Connecting Network
Ekin Rafiai
No ratings yet
Main
Document2 pages
Main
Srinath
No ratings yet
Sheet #6 Ensemble + Neural Nets + Linear Regression + Backpropagation + CNN
Document4 pages
Sheet #6 Ensemble + Neural Nets + Linear Regression + Backpropagation + CNN
rowaida elsayed
No ratings yet
55 - BD - Data Structures and Algorithms - Narasimha Karumanchi
Document19 pages
55 - BD - Data Structures and Algorithms - Narasimha Karumanchi
TritonCPC
No ratings yet
Exercises INF 5860: Linear regression, logistic classification, neural networks
Document11 pages
Exercises INF 5860: Linear regression, logistic classification, neural networks
Patrick O'Rourke
No ratings yet
TS PSWV
Document16 pages
TS PSWV
Wail Nou00
No ratings yet
Activations, Loss Functions & Optimizers in ML
Document29 pages
Activations, Loss Functions & Optimizers in ML
Aniket Dhar
No ratings yet
What is dynamic programming and how does it work? (39
Document8 pages
What is dynamic programming and how does it work? (39
Shadab Ahmed Ghazaly
No ratings yet
NN Lab2
Document5 pages
NN Lab2
Anne Wanningen
No ratings yet
2802ICT Programming Assignment 2
Document6 pages
2802ICT Programming Assignment 2
Anonymous 07GrYB0sN
No ratings yet
Chapter 3-Fuzzy Logic Control
Document67 pages
Chapter 3-Fuzzy Logic Control
Silesh
No ratings yet
MT1SP19
Document13 pages
MT1SP19
Hasim
No ratings yet
MA402 Hw1 Solution
Document6 pages
MA402 Hw1 Solution
Andrea Stancescu
No ratings yet
Assignment 2
Document4 pages
Assignment 2
10121011
No ratings yet
Using Matlab ODE Solvers to Solve Numerical Methods Problems
Document14 pages
Using Matlab ODE Solvers to Solve Numerical Methods Problems
Concepción de Puentes
No ratings yet
2-Mathematical Optimization and Deep Learning
Document53 pages
2-Mathematical Optimization and Deep Learning
عزام صالح
No ratings yet
DL - Assignment 8 Solution
Document6 pages
DL - Assignment 8 Solution
swathisreejith6
100% (1)
HW 1
Document5 pages
HW 1
Zoopesh Haerdie
No ratings yet
HW 7
Document4 pages
HW 7
adithya604
No ratings yet
CS229 Problem Set #4 Neural Networks, EM Algorithm, and Convergence
Document10 pages
CS229 Problem Set #4 Neural Networks, EM Algorithm, and Convergence
nxp He
No ratings yet
Warsaw University of Technology: Problem Description
Document5 pages
Warsaw University of Technology: Problem Description
Zaid BOUSLIKHIN
No ratings yet
ISE503 Project Report
Document6 pages
ISE503 Project Report
Didik Hariadi
No ratings yet
Editorial
Document12 pages
Editorial
Sara Arrachera
No ratings yet
Fast Training of Multilayer Perceptrons
Document15 pages
Fast Training of Multilayer Perceptrons
garima_rathi
No ratings yet
HW1 Final
Document4 pages
HW1 Final
Vino Wad
No ratings yet
10-701 Midterm Exam, Fall 2007 Key Questions
Document25 pages
10-701 Midterm Exam, Fall 2007 Key Questions
Juan Vera
No ratings yet
Lec 2 - Neural Network Perceptron Adaline PDF
Document7 pages
Lec 2 - Neural Network Perceptron Adaline PDF
Loris Franchi
No ratings yet
Integer Optimization and its Computation in Emergency Management
From Everand
Integer Optimization and its Computation in Emergency Management
Zhengtian Wu
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
From Everand
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
Michael Stueben
No ratings yet
Spot Speed Study
Document13 pages
Spot Speed Study
Nita Yuniarti
100% (1)
Interpreting The Data: Inquiries, Investigation and Immersion Week 3 and 4/ Finals Grade 11 Readings
Document4 pages
Interpreting The Data: Inquiries, Investigation and Immersion Week 3 and 4/ Finals Grade 11 Readings
VANESSA JABAGAT
No ratings yet
A Detailed Study On Selection and Recruitment Process at A Corwhite Solutions PVT LTD
Document9 pages
A Detailed Study On Selection and Recruitment Process at A Corwhite Solutions PVT LTD
saurabh
No ratings yet
An Overview of The Spatial Statistics Toolbox PDF
Document155 pages
An Overview of The Spatial Statistics Toolbox PDF
Ragir Teli
No ratings yet
BHARATH INSTITUTE OF SCIENCE AND TECHNOLOGY DEPARTMENT OF MATHEMATICS QUESTION BANK
Document16 pages
BHARATH INSTITUTE OF SCIENCE AND TECHNOLOGY DEPARTMENT OF MATHEMATICS QUESTION BANK
Cameout
No ratings yet
SPSS: Chi-Square Test of Independence: Analyze
Document34 pages
SPSS: Chi-Square Test of Independence: Analyze
Anonymous JO19KCSvC
No ratings yet
Benchmark Chart
Document2 pages
Benchmark Chart
anjo0225
No ratings yet
M.tech Research Methodology and Ipr
Document3 pages
M.tech Research Methodology and Ipr
divikartheek
No ratings yet
Optimization of Concrete Mix with Glass Fiber and TiO2 Nanoparticles
Document31 pages
Optimization of Concrete Mix with Glass Fiber and TiO2 Nanoparticles
manny ansari
No ratings yet
A Research Study On The Dinescape Factors Differences
Document213 pages
A Research Study On The Dinescape Factors Differences
Hendrick
100% (1)
Business Forecasting Example Questions
Document7 pages
Business Forecasting Example Questions
Ravi Arjun Kumar
No ratings yet
Jari Oksanen Vegan - Ecological Diversity PDF
Document17 pages
Jari Oksanen Vegan - Ecological Diversity PDF
Nur Rachman
No ratings yet
Using Machine Learning Classifiers To Predict Stock Exchange Index
Document6 pages
Using Machine Learning Classifiers To Predict Stock Exchange Index
Rico Bayu Wiranata
No ratings yet
Mat 152 Course Syllabus - Fa2019
Document4 pages
Mat 152 Course Syllabus - Fa2019
api-488084761
No ratings yet
Binary Choice Models PDF
Document4 pages
Binary Choice Models PDF
Sasi Honey
No ratings yet
E122
Document5 pages
E122
zudujawalo
100% (1)
Dwnload Full Statistics For Social Workers 8th Edition Weinbach Test Bank PDF
Document35 pages
Dwnload Full Statistics For Social Workers 8th Edition Weinbach Test Bank PDF
masonh7dswebb
100% (11)
11.1 Uncertainties and Errors in Measurement and Results
Document4 pages
11.1 Uncertainties and Errors in Measurement and Results
灭霸
No ratings yet
Financial Econometrics
Document16 pages
Financial Econometrics
Vivek
No ratings yet
Business Research Methods Unit - 1
Document45 pages
Business Research Methods Unit - 1
Poovendra Kumar
No ratings yet
QMB3250 Fall 2011 Ripol EXAM 3 December 14, 2011
Document8 pages
QMB3250 Fall 2011 Ripol EXAM 3 December 14, 2011
Esra'a Alhaj
No ratings yet
Syllabus and Structure
Document90 pages
Syllabus and Structure
1032210521
No ratings yet
US Passenger Vehicle Sales 2004-2017
Document5 pages
US Passenger Vehicle Sales 2004-2017
Samdeesh Singh
No ratings yet
Data Preprocessing and Exploratory Analysis
Document15 pages
Data Preprocessing and Exploratory Analysis
Joseph Conteh
No ratings yet
Solution PDF
Document20 pages
Solution PDF
Vard Farrell
No ratings yet
Chapter3-Probability and Probability Distributions PB
Document68 pages
Chapter3-Probability and Probability Distributions PB
Tasnim Alam
No ratings yet
Thesis Manual
Document39 pages
Thesis Manual
Jesse Boi
No ratings yet
GHJGHG
Document3 pages
GHJGHG
mariacastro2323
No ratings yet
Econometric Methods Internal Assessment I Prediction
Document4 pages
Econometric Methods Internal Assessment I Prediction
Sanchit Singhal
No ratings yet