Professional Documents
Culture Documents
ASSIGNMENT
ASSIGNMENT
ASSIGNMENT
6. What is a recurrent neural network (RNN), and how does it differ from other
types of neural networks? Explain the fundamental architecture of an RNN and its
applications.
7. How does the Long Short-Term Memory (LSTM) architecture address the
problem of vanishing gradients in RNNs? Explain the key components of an
LSTM cell and their roles in capturing long-term dependencies.
The LSTM model can be used to solve the vanishing gradient problem; the
weight update of the activation function converges to zero when long-term
data are learned in an RNN algorithm, which is useful for data with a long
learning period.
A common LSTM unit is composed of a cell, an input gate, an output gate and
a forget gate. The cell remembers values over arbitrary time intervals and the
three gates regulate the flow of information into and out of the cell.
There are four main components to an LSTM network: the forget gate, the
input gate, the output gate, and the cell state. The forget gate controls how
much information from the previous time step is retained in the current time
step. The input gate controls how much new information from the current
time step is added to the cell state. The output gate controls how much
information from the cell state is used to produce an output at the current
time step. And finally, the cell state is a vector that represents the “memory”
of the LSTM network; it contains information from both the previous time
step and the current time step.
8. Explain the concept of attention mechanisms in RNNs and their applications in
machine translation and image captioning. How do attention mechanisms help in
focusing on relevant parts of the input sequence while processing it?
The Attention Module takes the encoded image from the Encoder, and the
hidden state from the Sequence Decoder and computes the weighted
Attention Score. The input sequence is passed through the Embedding layer
and then combined with the Attention Score.
Attention is proposed as a method to both align and translate. Alignment is the
problem in machine translation that identifies which parts of the input
sequence are relevant to each word in the output, whereas translation is the
process of using the relevant information to select the appropriate output.
Architecture:
Applications of autoencoders
Dimensionality reduction
Image denoising
Anomaly Detection
10. What is a convolutional autoencoder (CAE), and how does it differ from a
regular autoencoder? Explain the application of CAEs in image compression and
denoising.