Professional Documents
Culture Documents
Which of the algorithms are not suitable for mini-batch gradient descent 1 point
algorithm?
Delta-bar-delta
Adam
RMSProp
Because pooling layers do not have parameters, they do not affect the 1 point
True
False
Clear selection
You're editing your response. Sharing this URL allows others to also edit
OPEN BLANK FORM
your response.
https://docs.google.com/forms/d/e/1FAIpQLSdgx7QHosSvcAnlCi6Pd-seOXGa9Q4krkFFB_y4ERwJ2kVv4g/viewform?edit2=2_ABaOnudqLvPN6SVgpP8-1s… 1/6
29/11/2020 Quiz 05: More on Deep Neural Networks and CNN Intro
Consider this figure. These plots were generated with gradient descent; 3 points
(1) is gradient descent. (2) is gradient descent with momentum (small β). (3) is
gradient descent with momentum (large β)
(1) is gradient descent. (2) is gradient descent with momentum (large β) . (3) is
gradient descent with momentum (small β)
(1) is gradient descent with momentum (small β), (2) is gradient descent with
momentum (small β), (3) is gradient descent
(1) is gradient descent with momentum (small β). (2) is gradient descent. (3) is
gradient descent with momentum (large β)
Clear selection
You have an input volume that is 32x32x16, and apply max pooling with a 2 points
stride of 2 and a filter size of 2. What is the output volume?
16x16x16
15x15x16
16x16x8
32x32x8
Clear selection
You're editing your response. Sharing this URL allows others to also edit
OPEN BLANK FORM
your response.
https://docs.google.com/forms/d/e/1FAIpQLSdgx7QHosSvcAnlCi6Pd-seOXGa9Q4krkFFB_y4ERwJ2kVv4g/viewform?edit2=2_ABaOnudqLvPN6SVgpP8-1s… 2/6
29/11/2020 Quiz 05: More on Deep Neural Networks and CNN Intro
Which of these is NOT a good learning rate decay scheme? Here, t is the 2 points
epoch number. lets assume alpha0 is the initial learning rate.
Clear selection
A neural network has three layers. Inputs to this layers are 32x32 size 2 points
single channel image. After the inputs are taken, there is a convolutional
layer with 32 filters each of size 3x3 followed by a fully connected layer.
The flattened output of the Convolutional Layer is fed to the fully
connected layer with 64 units. The last layer of this network is a softmax
layer with 4 units. What is the total number of learnable parameters?
You have an input volume that is 63x63x16, and convolve it with 32 filters 1 point
that are each 7x7, using a stride of 2 and no padding. What is the output
volume?
29x29x32
16x16x32
29x29x16
16x16x16
Clear selection
You're editing your response. Sharing this URL allows others to also edit
OPEN BLANK FORM
your response.
https://docs.google.com/forms/d/e/1FAIpQLSdgx7QHosSvcAnlCi6Pd-seOXGa9Q4krkFFB_y4ERwJ2kVv4g/viewform?edit2=2_ABaOnudqLvPN6SVgpP8-1s… 3/6
29/11/2020 Quiz 05: More on Deep Neural Networks and CNN Intro
The summation of the gradients accumulated might be too high to make the
algorithm too slow
Adam should be used with batch gradient computations, not with mini-batches.
What do you think this filter will do to the gray scale images? 1 point
Clear selection
You're editing your response. Sharing this URL allows others to also edit
OPEN BLANK FORM
your response.
https://docs.google.com/forms/d/e/1FAIpQLSdgx7QHosSvcAnlCi6Pd-seOXGa9Q4krkFFB_y4ERwJ2kVv4g/viewform?edit2=2_ABaOnudqLvPN6SVgpP8-1s… 4/6
29/11/2020 Quiz 05: More on Deep Neural Networks and CNN Intro
You have an input volume that is 15x15x8, and pad it using “pad=2.” What is 1 point
17x17x8
17x17x10
19x19x8
19x19x12
Clear selection
dataset. You use the following to track the temperature: v(t) =βv(t−1) +
(1−β)θ(t). The red line below was computed using β=0.9. What would
happen to your red curve as you vary β?
Decreasing beta will shift the red line slightly to the right
Increasing beta will shift the red line slightly to the right
Decreasing beta will create more oscillations within the red line
Increasing
You're editing beta will create
your response. more
Sharing this oscillations within to
URL allows others thealso
rededit
line
OPEN BLANK FORM
your response.
https://docs.google.com/forms/d/e/1FAIpQLSdgx7QHosSvcAnlCi6Pd-seOXGa9Q4krkFFB_y4ERwJ2kVv4g/viewform?edit2=2_ABaOnudqLvPN6SVgpP8-1s… 5/6
29/11/2020 Quiz 05: More on Deep Neural Networks and CNN Intro
You have an input volume that is 63x63x16, and convolve it with 32 filters 2 points
that are each 7x7, and stride of 1. You want to use a “same” convolution.
What is the padding?
Clear selection
Submit
This form was created inside of United International University. Report Abuse
Forms
You're editing your response. Sharing this URL allows others to also edit
OPEN BLANK FORM
your response.
https://docs.google.com/forms/d/e/1FAIpQLSdgx7QHosSvcAnlCi6Pd-seOXGa9Q4krkFFB_y4ERwJ2kVv4g/viewform?edit2=2_ABaOnudqLvPN6SVgpP8-1s… 6/6