You are on page 1of 18

18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Quiz 1
Due Dec 19 at 23:59 Points 10 Questions 40
Available Dec 18 at 19:00 - Dec 19 at 23:59 Time Limit 60 Minutes

Instructions
The Quiz I can be attempted only once. You will not be provided with a  make-up Quiz, if you miss
this Quiz.

The quiz is timed for one hour and has to be completed once you start. You will not be allowed to go
back to the previous question if you skip a question. 

“Choose the most appropriate answer to each question.”

The answers will be visible only after three days once the quiz has ended.

All the best.

Attempt History
Attempt Time Score
LATEST Attempt 1 60 minutes 6.5 out of 10

 Correct answers will be available on Dec 22 at 0:00.

Score for this quiz: 6.5 out of 10


Submitted Dec 18 at 20:33
This attempt took 60 minutes.

Question 1 0.25 / 0.25 pts

Which of the following are correct skills for a Data Scientist?

  machine learning/deep learning

  all of the options

  data wrangling

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 1/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  probability and statistics

Incorrect Question 2 0 / 0.25 pts

R and Python is not a preferred skill for

  Data Analyst

  Data Scientist

  Data Architect

  Data Journalist

Question 3 0.25 / 0.25 pts

Statement 1: Business Intelligence involves analyzing past data and


reporting on it.

Statement 2: Descriptive analysis involves analyzing past data and


reporting on it.

Which of the following is right?

  Both statements are correct

  Both statements are wrong

  Statement 1 is correct and Statement 2 is wrong

  Statement 1 is wrong and Statement 2 is correct

Question 4 0.25 / 0.25 pts

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 2/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Data scientist is not responsible for 

  building continuous data stream

  data manipulation

  data mining

  data analytics

Question 5 0.25 / 0.25 pts

Which of the following is not a application for data science?

  Online Price Comparison

  Image & Speech Recognition

  Privacy Checker

  Recommendation Systems

Question 6 0.25 / 0.25 pts

For an organization, having not much data analytics need and just
embarking on the analytics path will most likely structure its data team
in a

  Centralized model

  Federated model

  Consulting model

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 3/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  Functional model

Question 7 0.25 / 0.25 pts

Suppose that the minimum and maximum values for an attribute are 4.3
and 7.6, respectively. Compute the scaled value of 5.4 if min-max
normalization is applied to scale [0.0,1.0]. (Answer should have a
precision of X.XX]

  0.33

  0.43

  0.36

  0.45

Question 8 0.25 / 0.25 pts

Compute the Jaccard's Co-efficient for x = (1,0,0,0,1,1,1) and y =


(0,1,1,0,0,1,0)

  0.765

  0.166

  0.455

  0.234

Question 9 0.25 / 0.25 pts

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 4/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Compute the width of each bin for data given below, if the number of
bins is 5.

[39, 45, 49, 45, 31, 37, 38, 41, 37, 41, 39, 34, 35, 30, 47, 43, 44, 46, 48,
36]

 6

 4

(49−30)÷5

 3

 5

Question 10 0.25 / 0.25 pts

Suppose that the minimum and maximum values for an attribute are 4.3
and 7.6, respectively. Compute the scaled value of 5.4 if min-max
normalization is applied to scale [0.0,1.0]. (Answer should have a
precision of X.XX]

  0.33

  0.29

  0.30

  0.38

Incorrect Question 11 0 / 0.25 pts

The most time-consuming phase in a data science process is

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 5/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  Deployment

  Data Modelling

  Data preparation

  Data collection

Question 12 0.25 / 0.25 pts

Which of the following is not a type of predictive analytics?

  What is the student's performance in the next question?

  average attendance of the students in the current semester?

  Which course will the student take in the next semester?

 
what is the average score of all students in the CBSE 10th Math Exam

Incorrect
Question 13 0 / 0.25 pts

Which of the following methodologies focus the most on model


deployment and embedding in operational systems?

  CRISP-DM

  SEMMA

  SMAM

  All options are correct

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 6/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Question 14 0.25 / 0.25 pts

BITS investigating to determine the cause for decreased admissions for


CSI PG program is an example of

  Descriptive analysis

  Prescriptive analysis

  Diagnositic Analysis

  Predictive analysis

Question 15 0.25 / 0.25 pts

Identify the data analytics task for the following scenario. A physics
teacher is analyzing the answer scripts of the students to identify the
areas that he/she should concentrate on so that the students
understand the concepts better.

  Predictive Analytics

  Diagnostic Analytics

  Descriptive Analytics

  Prescriptive Analytics

Question 16 0.25 / 0.25 pts

Which of the following statement is false with respect to data set?

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 7/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

 
Sub setting can be used to select and exclude variables and
observations

 
Merging concerns combining datasets on the same observations to
produce a result

  Raw data should be processed only one time.

  All of the listed options

Incorrect
Question 17 0 / 0.25 pts

A police team wants to predict the crime rate in a locality based on


certain attributes. Which modelling technique would be appropriate

  Optimization

  Classification

  Regression

  Clustering

Question 18 0.25 / 0.25 pts

Google tries to differentiate emails as spam and non-spam, this is an


example of

  Clustering

  Association Rule

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 8/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  Classification

  Regression

Incorrect
Question 19 0 / 0.25 pts

Which of the following data science project step is the most critical step
for the success of the project?

  Model Selection

  Model Evaluation

  Model Building

  Data preprocessing

Question 20 0.25 / 0.25 pts

The answer to following question can be obtained by which type of


analytics?
"Whats the best that can happen?"

  Diagnostic Analytics

  Descriptive Analytics

  Prescriptive Analytics

  Predictive Analytics

Question 21 0.25 / 0.25 pts

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 9/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

A company wants to find the target segment of people for one of its
products. Which modelling technique would be generally appropriate

  Classification

  Clustering

  Regression

  Ranking

Question 22 0.25 / 0.25 pts

Identify the data analytics task for the following scenario. A student is
analyzing various blogs and vlogs to find out the skill set that has to be
acquired to become a data scientist in the future. 

  Diagnostic Analytics

  Descriptive Analytics

  Predictive Analytics

  Prescriptive Analytics

Question 23 0.25 / 0.25 pts

Which of the following Python library is required for web scraping?

  WebSpider

  WebCrawler

  BeautifulSoup

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 10/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  Scraper

Incorrect Question 24 0 / 0.25 pts

As part of a survey in a large organization, one of the features that you


capture is designation. This type of data has the characteristic

  None of the given answers

  Discrete, Qualitative, Ordinal

  Nominal, Quantitative, Discrete

  Discrete, Quantitative, Ordinal

Question 25 0.25 / 0.25 pts

A dataset can contain

  Both Quantitative and Qualitative Values

  None of the answers

  Quantitative values

  Qualitative Values

Question 26 0.25 / 0.25 pts

Which of the following properties are supported by interval attribute.

   P)  Distinctness
https://bits-pilani.instructure.com/courses/1704/quizzes/3453 11/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

   Q)  Order

   R)   Meaningful differences

   S)  Meaningful ratios

  P and R

  P and S

  P,Q and R

  All the options are correct

Question 27 0.25 / 0.25 pts

In a dataset, CarColor is one of the attributes and it can take the


following values {Red, Green, Yellow, Black}, what type of attribute is
CarColor?

  Interval attribute

  Ordinal attribute

  Nominal attribute

  Ratio attribute

Question 28 0.25 / 0.25 pts

Raw data should be processed only one time.

  True

  False

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 12/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Incorrect
Question 29 0 / 0.25 pts

Which statement best compares histogram and 5-number summary

  5-number summary can be used for non-numeric data

  Histogram is always more informative on data distribution

  Histogram can be very informative with finer ranges

  5-number summary is robust w.r.t. noise and outliers

Incorrect
Question 30 0 / 0.25 pts

Consider the following Python code. 


input=['Havells','Philips','Syska','Eveready','Lloyd']
le = sklearn.preprocessing.LabelEncoder()
le.fit(input)
print(le.transform('Lloyd'))
The last line of the code snippet will print

 2

 4

 1

 3

Question 31 0.25 / 0.25 pts

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 13/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

For a data analytics task to analyse feedback on her subject for a class
of 60 students, a school teacher decided to use the survey submitted
by the ten students who come for tuitions for that subject, at her home.
Identify the type of sampling she is doing.

  Non Probabilistic sampling

  Systematic Sampling

  Sampling without replacement

  Stratified sampling

Incorrect
Question 32 0 / 0.25 pts

Which of the statement is TRUE ?

  Outliers should be addressed in the test dataset

  Outliers should always be addressed in the dataset

  Outliers should be addressed only in the training dataset

  Treatment of outliers depends on the problem statement

Incorrect
Question 33 0 / 0.25 pts

The statistical description (x1,x2, . . . ,xN)/N, for the data values x1,x2, .
. . ,xN is called as their ________________

  mean

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 14/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  IQR

  median

  mode

Incorrect Question 34 0 / 0.25 pts

Exploratory data analysis does not help in

  Finding out the data type of a variable

  In univariate and bivariate variable analysis

  Derivation of new attributes

  Finding statistical estimates of a variable

Question 35 0.25 / 0.25 pts

Which of the following statements are true with respect to data quality
issues?

  The given data set should not miss any values or attributes

  All of the above

 
Pre-processing of data is required to address the problems of
inconsistency, incompleteness.

 
If data are not updated time to time there will be a negative impact on
data quality.

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 15/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Incorrect
Question 36 0 / 0.25 pts

Consider the following Python code.  


lb = sklearn.preprocessing.LabelBinarizer()
print(lb.fit_transform(['yes', 'no', 'no', 'yes']))
The last line of the code snippet will print

  [0,1,1,0]

  [true, false, false, true]

  [True, False, False, True]

  [1,0,0,1]

Question 37 0.25 / 0.25 pts

Identify the false statement.

 
As a data scientist, while cleaning the data, you are not concerned with
why the data values are missing. You just fix them.

 
As a data scientist, even if you know that certain outliers are valid data,
you might still omit them from model construction.

  It may not be a good idea to drop a data field with missing values.

 
As a data scientist, even if you know that certain outliers are valid data,
you might still omit them from model construction.

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 16/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Incorrect
Question 38 0 / 0.25 pts

A data object is being described by a categorical attribute having four


categories, then for data analysis purpose if we want to transform the
attributes into numerical values, then

  A. it can’t be done

  C. it can be done by encoding using only 4 binary variables

  B. it can be done by encoding using only 3 binary variables

  D. it can be done by encoding using 3 or 4 binary variables

Question 39 0.25 / 0.25 pts

In a boxplot, where Q1, Q2 and Q3 are the first, second and third
quartiles respectively, the interquartile range IQR is calculated as:

  None of the above

  IQR = Q3 – Q1

  IQR = Q2-Q1

  IQR = Q3-Q2

Incorrect
Question 40 0 / 0.25 pts

The____ and standard deviation are strongly affected by outliers

  MEAN

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 17/18
18/12/2022, 20:34 Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  RANGE

  MEDIAN

  MODE

Quiz Score: 6.5 out of 10

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 18/18

You might also like