Professional Documents
Culture Documents
• Suźiÿect
• Ożiÿecfiwe
Scanned with
is a domain of AI that depicts the capability of a machine to get and
analyse visual information and afterwards predict some decisions about it.
a) NLP
b) Data Sciences
c) Augmented Reality
d) Computer Vision
Scanned with
is the sub-f1eld of AI that is focused on enabling computers
to understand and process human languages.
a) Deep Learning
b) Machine Learning
c) NLP
d) Data Sciences
4
Scanned with ComScanner
Expand CBT
a) Computer Behaved Training
b) Cognitive Behavioural Therapy
c) Consol1dafed Bafch of trainers
d) Combined Basic Training
Scanned with
refers to the AI modelling where the machine
learns by itself.
a) Learning Based
b) Rule Based
c) Machine Learning
d) Data Sciences
Scanned with
, the mach1ne is trained w1th huge amounts of data
which helps it in training itself around the data.
a) Machine Learning
b) Artificial Intelligence
c) NLP
d) Deep Learning
Scanned with
Define the term Machine Learning. Also give 2 applications of /ñachine
Learning in our daily lives.
10
Classification Regression
This model works on a discrete Such models work on continuous
dataset which means the data data.
need not be continuous.
11
13
14
15
16
Scanned with ComScanner
Tom is a student of grade five. He likes to move constantly at his desk. He
plays with pencils and taps his fingers, stands up in his place any time he gets a
chance. He enjoys playing basketball, and likes to play in the classroom.
Which of the following intelligence does he demonstrate?
a) Linguistic
b) Logical-Mathematical
c) Musical
d) Kinesthetic
17
18
Scanned with ComScanner
Infrared sensors detect infrared energy that is emitted by one's body heat. When
hands are placed in the proximity of the sensor, the infrared energy quickly
fluctuates. This fluctuation triggers the pump to activate and dispense the
designated amount of sanitizer. This is an example of
a) Automated machine
b) AI machine
c) Semi-automatic machine
d) Deep Learning machine
19
Scanned with ComScanner
Match Column A with Column B:
Column A Column B
Face recognition machine (i) Nnt AI
2. Automatic door
Gesture recognition
4. Automatic toy car
a) 1 -> (i) ; 2 -> (ii) ; 3 -> (i) ; 4 -> (ii)
b) 1 -> (ii) ; 2 -> (i) ; 3 -> (ii) ; 4 -> (i)
c) 1 -> (i) ; 2 -> (i) ; 3 -> (ii) ; 4 -> (i)
d) 1 -> (ii) ; 2 -> (i) ; 3 -> (i) ; 4 -> (ii)
20
21
22
23
24
25
26
27
28
Scanned with ComScanner
and are AI based applications that help us in navigation.
oogle a , Apple ma ,
29
Naturalist Intelligence
80
31
52
33
54
55
37
Any machine that has been trained with data and can make decisions/predictions on its own
can be termed as AI.
Eg: The bot or the automation machine is not trained with any data is not an AI while a chatbot
that understands and processes human language is an AI.
88
When we talk about a machine, we know that it is artificial and cannot think on its own. It can have
intelligence, but we cannot expect a machine to have any biases of its own.
Any bias can transfer from the developer to the machine while the algorithm is being developed.
3. Spatial Visual Intelligence : ability to perceive the visual world and the relationship of one object to another.
4.Kinesthetic Intelligence : ability that is related to how a person uses his limbs in a skilled manner.
5.musical Intelligence : ability to recognize and create sounds, rhythms, and sound patterns. 40
Artificial Intelligence (AI) refers to any technique that enables computers to mimic human
intelligence i.e., make decisions, predict the future, learn and improve on its own.
With respect to the type of data fed in the Al model, Al models can be broadly categorised into
three domains:
1.Data sciences: takes input in the form of numeric and alphanumeric data.
Neural networks are loosely modelled after how neurons in the human brain
behave. The features of a neural network are :
1.They are able to extract data features automatically without needing the input
of the programmer. fJ LTlS
44
45
Scanned with
Which of fhe following is not part: of the Al Project Cycle?
a) Data Exploration
b) Modelling
c) Testing
d) Problem Scoping
46
47
49
50
51
52
Problem
Scoping Data
Exploration Evaluation
Modelling
Acquisition
(b) doc
(c) csv
(d) png
58
Scanned with ComScanner
Which of the following is an application of data science?
(a) Text summarization
(b) Target Advertisements
(c) Face lock in smartphones
(d) Email filters
59
60
While accessing data from any of the data sources, following points should be kept in mind:
1. Data which is available for public usage only should be taken up.
2.Personal datasets should only be used with the consent of the owner.
4.Reliable sources of data ensure the authenticity of data which helps in the proper training of the AI
model.
61
Scanned with ComScanner
JãUUD3S D3 fl›!• pauuD3S
Give one example of an application which uses augmented reality.
63
Scanned with ComScanner
, input to machines can be photographs, videos acd pictures
from thermal or infrared sensors, indicators and different sources.
a) Computer Vision
b) Data Acquisition
c) Data Collection
d) Machine learning
64
Object Detection
65
67
Resolution of an image refers to the number of pixels in an image, across the width and height.
For example a monitor resolution of 1280•1024. This means there are 1280 pixels from one side
to the other, and 1024 from top to bottom.
68
70
71
Scanned with ComScanner
Explain the term Text Normalisation in Data Processing.
It helps in cleaning up the textual data in such a way that it comes down to a level
where its complexity is lower than the actual data.
The term used for the whole textual data from all the documents is known as corpus.
72
• Automatic Summanzation,
• Sentiment Analysis,
• Text classification,
• Virtual Assistants
73
Scanned with
Differentiate between stemming and lemmatization. Explain with the help
of an examp\e.
Stemming is the process in which the affixes of words are removed and the words are converted to
their base form.
In lemmatization, the word we get after affix removal(also known as lemma) is a meaningful one.
Lemmatization makes sure that lemma is a word with meaning and hence it takes a longer time to
execute than stemming.
74
75
46 tokens
76
77
78
Scanned with ComScanner
“Automatic summarization is used in NLP applications". Is the given statement
correct? Justify your answer with an example.
79
80
TEXT CLASSIFICATION
81
82
Scanned with ComScanner
Ayushi was learning about NLP. She wanted to know the term used for the
whole textual data from all the documents altogether. Help her in identifying the term
used for it.
83
Scanned with ComScanner
What is f TF-IDF?
th
Term requency Inverse ocument re uency
84
Scanned with ComScanner
A corpus contains 12 documents. How many document vectors will be there for
that corpus?
a. 12
b. 1
c.24
d. 1/12
85
Scanned with ComScanner
Identify the type of chatbot with the information given below:
These bots work on pre-programmed instructions inside the
application/machine and are generallyeasy to develop. They are deployed in
the customer care section of various companies. Their job is to answer some
basic queries that they are coded for and connect them to human executives
once they are unable to handle the conversation.
Script bot
86
87
Scanned with ComScanner
What do we get from the “bag of algorithm?
words
88
Scanned with ComScanner
Samiksha, a student of class X was exploring the Natural Language
Processing domain. She got stuck while performing the text normalisation.
Help her to normalise the text on the segmented sentences given below:
1.Tokenisation:
Akash, and, Ajay, are, best, friends |Akash, likes, to, play, football, but, Ajay, prefers, to, play, online, games
2.Removal of stopwords
Akash, Ajay, best, friends Akash, likes, play, football, Ajay, prefers, play, online, games
4.Stemming/Lemmatisation
akash, ajay, best, friend ahash, like, play, football, ajay, prefer, play, online, game 89
Scanned with
is defined as the percentage of correct predictions out of all
observations. the
a) Predictions
b) Accuracy
c) Reality
d) F1 Score
90
Scanned with
What will be the outcome, if the Prediction is “Yes” and it matches
with the Reality? What will be the outcome, if the Prediction is “Yes”
and it does not match the Reatity7
a) True Positive, True Negative
b) True Negative, False Negative
c) True Negative, False Positive
d) True Positive, False Positive
91
92
93
Scanned with ComScanner
Which of the following statements is true for the term Evaluation?
a) Helps in classifying the type and genre of a document.
b) It helps in predicting the topic for a corpus.
c) Helps in understanding the reliability of any AI model
d) Process to extract the impo1ant information out of a corpus.
94
95
Scanned with
What is F1 Score in Evatuation?
re -* 0
0
Scanned with
96
Scanned with
Imagine that you have come up with an AI based prediction model
which has been deployed on the roads to check traffic jams. Now, the
objective of the model is to predict whether there will be a traffic jam
or not. Now, to understand the efficiency of this model, we need to Case 1: Is there a traffic Jam?
check if the predictions which it makes are correct or not. Thus, there Prediction: Yes Reality: Yes
exist two conditions which we need to ponder upon: Prediction and True Positive
Reality. Case 2: Is there a traffic
Jam? Prediction: No Reality:
Traffic Jams have become a common part of our lives nowadays. Living in
No
an urban area means you have to face traffic each and every time you get
True Negative
out on the road. Mostly, school students opt for buses to go to school.
Many times, the bus gets late due to such jams and the students are not Case 3: Is there a traffic Jam?
able to reach their school on time. Prediction: Yes Reality: No
False Positive
Considering all the possible situations make a Confusion Matrix for the Case 4: Is there a traffic
above situation. Jam? Prediction: No
Reality: Yes
False Negative
Scanned with
Yes No
Yes True Positive False
Prediction
No False Positive
97
Negative
Scanned with
What should be the value of F1 score if the model needs to have 100% accuracy?
98
Scanned with
Give an example of a situation wherein false positive would have a high cost associated
with it.
• If the model always predicts that the mail is spam, people would
not look at it and eventually might lose important information.
• Here False Positive condition (Predicting the mail as spam while the
mail is not spam) would have a high cost.
99
Scanned with ComScanner
What “is a confusion matrix? What is it used for?
100
Stop ›'‹ords
”
.. Rare / Valuable
words
Value
As shown in the graph, occurrence and value of a word are inversely proportional.
The words which occur most (like stop words) have negligible value.
As the occurrence of words drops, the value of such words rises. These words are termed as rare or valuable
words. These words occur the least but add the most value to the corpus
1 01
Yes
1 02
103
Scanned with ComScanner
is used to record the result of comparison between the prediction
and reality. It "is not an evaluation metric b re whiE/ Can help in evaluation
onfusion atrix
104
105
106
Scanned with ComScanner
Priya was confused with the terms used in the evaluation stage. Suggest her the term
used for the percentage of correct predictions out of all the observations.
(a) Accuracy
(b) Precision
(c) Recall
(d) F1Score
107
• Focusing only on positive predictions — Storm is coming: can lead to farmers delaying their crop if not
accurate.
• Focusing only on negative predictions — Storm is not coming can lead to damaged crop if not accurate.
• The best approach is to balance both accuracy and catching important events. This is what the F1 Score
measures.
1 08
(i) How many total tests have been performed in the above scenario? Recall=TP/(TP+FN) =60/(60+5) =60/65 =0.92
(ii) Calculate precision, recall and F1 Score.
F1 Score=2*Precision*Recall/(Precision+Recall)
2*0.7*0.92/(0.7+0.92)
0.79
Scanned with ComScanner
109