You are on page 1of 5

IAS

Day 76
DIY Solution

Q1. What are recurrent neural networks?

Answer: A recurrent neural network (RNN) is a type of artificial neural network


which uses sequential data or time-series data. These deep learning algorithms are
commonly used for ordinal or temporal problems, such as language translation,
natural language processing (NLP), speech recognition, and image captioning; they
are incorporated into popular applications such as Siri, voice search, and Google
Translate. Like feedforward and convolutional neural networks (CNNs), recurrent
neural networks utilize training data to learn. They are distinguished by their
“memory” as they take information from prior inputs to influence the current input
and output. While traditional deep neural networks assume that inputs and outputs
are independent of each other, the output of recurrent neural networks depends
on the prior elements within the sequence. While future events would also be
helpful in determining the output of a given sequence, unidirectional recurrent
neural networks cannot account for these events in their predictions.
Q2. What is the difference between a feedforward neural network and a
recurrent neural network?

Answer: A feedforward neural network signals travel in one direction from input to
output. There are no feedback loops, the network considers only the current input.
It cannot memorize previous inputs (e.g., CNN).

A recurrent neural network’s signals travel in both directions, creating a looped


network. It considers the current input with the previously received inputs for
generating the output of a layer and can memorize past data due to its internal
memory.

Q3. What's the difference between CNN and RNN?

Answer: The main difference between a CNN and an RNN is the ability to process
temporal information data that comes in sequences, such as a sentence. Recurrent
neural networks are designed for this very purpose, while convolutional neural
networks are incapable of effectively interpreting temporal information. As a result,
CNNs and RNNs are used for completely distinct purposes, and there are differences
in the structures of the neural networks themselves to fit those different use cases.

CNNs employ filters within convolutional layers to transform data, whereas RNNs
are predictive, reusing activation functions from other data points in the sequence
to generate the next output in a series.

Once you look at the structure of both types of neural networks and understand
what they are used for, the difference between CNN and RNN becomes more clear.
Q4. Advantages and disadvantages of RNN.

Answer:

Advantages of RNN

• The principal advantage of RNN over ANN is that RNN can model a collection of records
(i.e., time collection) so that each pattern can be assumed to be dependent on
previous ones.
• Recurrent neural networks are even used with convolutional layers to extend the
powerful pixel neighborhood.

Disadvantages of RNN

• Gradient exploding and vanishing problems.


• Training an RNN is a completely tough task.
• It cannot system very lengthy sequences if the usage of Tanh or ReLUas an activation
feature.

Q5. Describe the different applications of RNN.

Answer:

1. Language Modelling and Generating Text

Taking a sequence of words as input, we try to predict the possibility of the next
word. This can be considered to be one of the most useful approaches for the
translation since the most likely sentence would be the one that is correct. In
this method, the probability of the output of a particular time-step is used to
sample the words in the next iteration.

2. Machine Translation

RNNs in one form or the other can be used for translating text from one
language to other. Almost all of the translation systems being used today use
some advanced version of a RNN. The input can be the source language and the
output will be in the target language which the user wants.

Currently one of the most popular and prominent machine translation


applications is Google Translate. There are even numerous custom recurrent
neural network applications used to refine and confine content by various
platforms. E-commerce platforms like Flipkart, Amazon, and eBay make use of
machine translation in many areas and it also helps with the efficiency of the
search results.

3. Speech Recognition

RNNs can be used for predicting phonetic segments considering sound waves
from a medium as an input source. The set of inputs consists of a phoneme or
acoustic signals from the audio which is processed in a proper manner and
taken as inputs. The RNN network will compute the phonemes and then
produce a phonetic segment along with the likelihood of output. The steps used
in speech recognition are as follows:-

• The input data is first processed and recognized through a neural network.
The result consists of a varied collection of input sound waves.

• The information contained in the sound wave is further classified by intent


and through keywords related to the query.

• Then input sound waves are classified into phonetic segments and are
pieced together into cohesive words using a RNN application. The output
consists of a pattern of phonetic segments put together into a singular whole in
a logical manner.
4. Generating Image Descriptions

A combination of CNNs and RNNs is used to provide a description of what


exactly is happening inside an image. CNN does the segmentation part and RNN
then uses the segmented data to recreate the description.

5. Video Tagging

RNNs can be used for video search where we can do an image description of a
video divided into numerous frames.

6. Text Summarization

Text Summarization can provide major help in summarizing content from


literature and customizing them for delivery within applications that cannot
support large volumes of text. For example, if a publisher wants to display the
summary of one of his books on its back page to help the readers get an idea of
the content present within, Text Summarization would be helpful.

7. Face Detection, OCR Applications as Image Recognition

Image recognition is one of the major applications of computer vision. It is also


one of the most accessible forms of RNN to explain.

At its core, the algorithm is designed to consider one unit of the image as input
and produce the description of the image in the form of multiple groups of
output .

You might also like