You are on page 1of 18

12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Quiz 1
Due Dec 19 at 23:59 Points 10 Questions 40
Available Dec 18 at 19:00 - Dec 19 at 23:59 Time Limit 60 Minutes

Instructions
The Quiz I can be attempted only once. You will not be provided with a  make-up Quiz, if you miss
this Quiz.

The quiz is timed for one hour and has to be completed once you start. You will not be allowed to go
back to the previous question if you skip a question. 

“Choose the most appropriate answer to each question.”

The answers will be visible only after three days once the quiz has ended.

All the best.

Attempt History
Attempt Time Score

LATEST Attempt 1 60 minutes 7.71 out of 10

 Correct answers will be available on Dec 22 at 0:00.

Score for this quiz: 7.71 out of 10


Submitted Dec 18 at 20:58
This attempt took 60 minutes.

Question 1 0.25 / 0.25 pts

Due to market expectations, businesses are having difficulty retaining


highly trained data scientists and engineers.

  False

  No answer text provided.

  True

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 1/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  No answer text provided.

Question 2 0.25 / 0.25 pts

Data Science project steps are highly linear.

  True

  False

Question 3 0.25 / 0.25 pts

Which of the following is correct skills for a Data Scientist?

  Probability & Statistics

  Machine Learning / Deep Learning

  Data Wrangling

  All of the options

Partial Question 4 0.08 / 0.25 pts

Which of the following are the reasons for the sudden growth of
analytics?

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 2/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

 
Large number of user friendly analytics tools available for data
processing

  Data is growing at 40% compound annual rate

  Cost of storage has hugely dropped

  Large number of analysts available in the market

Question 5 0.25 / 0.25 pts

Data scientist is not responsible for

  Building a continuous data stream

  Data manipulation

  Data mining

  Data Analytics

Question 6 0.25 / 0.25 pts

Tableau software is most likely to be used during

  presentation of idea to stakeholders

  Feature selection

  Data warehousing

  Deployment

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 3/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Question 7 0.25 / 0.25 pts

Compute the width of each bin for data given below, if the number of
bins is 5.

[39, 45, 49, 45, 31, 37, 38, 41, 37, 41, 39, 34, 35, 30, 47, 43, 44, 46,
48, 36]

 5

 4

(49−30)÷5

 6

 3

Question 8 0.25 / 0.25 pts

The value 56739 when scaled into Decimal Normalization will be


_________.

  56.739

  5673.9

  5.6739

  567.39

Question 9 0.25 / 0.25 pts

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 4/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Suppose the Lab administrator measured the power consumption of


an entire network operations centre (NOC) and the set of consumption
details is 90 W, 104 W, 98 W, 98 W, 105 W, 92 W, 102 W, 100 W, 110
W, 98 W, 210 W and 115 W.What is the mode power consumption?

  150W

  100W

  90W

  98W

Question 10 0.25 / 0.25 pts

Suppose that the mean and standard deviation of the values for an
attribute are 8.9 and 6.5, respectively. Apply z-score normalization to a
value of 3.2. [Answer should have a precision of X.XXXX]

  -0.8679

  -0.8769

  0.8679

  0.8769

Question 11 0.25 / 0.25 pts

In _________ phase, final report/technical document of process is


prepared

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 5/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  None

  Operationalize

  Model building

  Model planning

Question 12 0.25 / 0.25 pts

In 2001, Big Data created the three Vs: volume, velocity, and variety.
The V's have grown to encompass veracity and value in the years
since. Big data is sometimes subjected to a fifth V, which is:

  Vector

  Variability

  Vulnerability

  Volatile

Question 13 0.25 / 0.25 pts

Match with the most appropriate answer related to analytics.

Descriptive Analytics   what happened

Diagnostic Analytics   why happened

Predictive Analytics   what will happened

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 6/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Prescriptive Analytics   what can make it happ

Question 14 0.25 / 0.25 pts

BITS investigating to determine the cause for decreased admissions for


CSI PG program is an example of

  Predictive analysis

  Prescriptive analysis

  Diagnositic Analysis

  Descriptive analysis

Question 15 0.25 / 0.25 pts

Identify the data analytics task for the following scenario. A


pharmaceutical organization is developing a new drug or vaccine to
compact Covid-19 using machine learning techniques where the data is
from the existing drugs and the diseases it can fight or cure. 

  Descriptive Analytics

  Predictive Analytics

  Prescriptive Analytics

  Diagnostic Analytics

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 7/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Question 16 0.25 / 0.25 pts

Identify the data analytics task for the following scenario. The sales of
various products of your organization per month per geographical area
is reported using an interactive visual tool. 

  Descriptive Analytics

  Prescriptive Analytics

  Predictive Analytics

  Diagnostic Analytics

Question 17 0.25 / 0.25 pts

Which of the following are classification problems?

 
Finding the shorter path between two already-existing routes between
two locations.

 
Predict traffic congestion along a specific route between two locations
using vehicle journey times.

 
Calculating a room's temperature (in Celsius) based on other
environmental factors (such as atmospheric pressure, humidity etc).

  filtering spam from mails

 
Predicting if a cricket player is a batsman or bowler, given his playing
record.

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 8/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Question 18 0.25 / 0.25 pts

A scenario where you feel unwell and go to a doctor. The doctor asks
you questions like were you exposed to rain or cold climate or did you
have contact with a sick person or did you have food from outside etc.
Based on your answers,doctor came to the conclusion. This can be
considered analogous to which stage of data analytics?

  Diagnostic

  Descriptive

  Predictive

  Prescriptive

Incorrect
Question 19 0 / 0.25 pts

The 5 stages of KDD process is


1.Selection
2.Preprocessing
3.Tranformation
4.Data Mining
5.Interpretaion/Evaluation.

Identify the CRISP DM phases that corresponds to Stage 3 and 4 of


KDD.

  Data Preparation,Modeling

  Data Understanding,Evaluation

  Modeling,Data Preparation

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 9/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  Evaluation,Business understanding

Question 20 0.25 / 0.25 pts

Match with the most appropriate answer, related to the tools


available to a Data Scientist.

Cassandra   Big-data

Tableau   Visualization

SAS   Statistics

Weka   Machine Learning

Incorrect
Question 21 0 / 0.25 pts

Which one of the following statement(s) is correct (Choose the most


appropriate answer)?

  All the statements

  None of the statements

 
Data analytics is the pursuit of extracting meaning from raw data using
specialized computer systems.

 
Data Analytics refers to the techniques used to analyze data to enhance
productivity and business gain.

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 10/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

 
Analytics is a process in which a computer examines information using
mathematical methods to find useful patterns.

Incorrect
Question 22 0 / 0.25 pts

Which of the following statements are true about data cleaning?

  It focuses on removing inaccurate data from your data set

  All of the given options

 
It focuses on transforming the data’s format by converting raw data into
another format

  It enhances the data’s accuracy and integrity

Question 23 0.25 / 0.25 pts

Which of the following Python library is required for web scraping?

  WebSpider

  BeautifulSoup

  Scraper

  WebCrawler

Question 24 0.25 / 0.25 pts

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 11/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Which of the following is an example of raw data?

  original swath files generated from a sonar system

  all of the mentioned

  a real-time GPS-encoded navigation file

  initial time-series file of temperature values

Question 25 0.25 / 0.25 pts

Is it possible to convert an interval variable to an Ordinal Variable

  True

  False

Question 26 0.25 / 0.25 pts

In a FashionStore Data set the feature Jacket_Shade { Grey,brown,


black, Indigo, Beige , Khaki} is an example of

  Ordinal attribute

  Continuous attribute

  Numeric attribute

  Nominal attribute

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 12/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Incorrect Question 27 0 / 0.25 pts

As part of a survey in a large organization, one of the features that you


capture is designation. This type of data has the characteristic

  Nominal, Quantitative, Discrete

  Discrete, Quantitative, Ordinal

  None of the given answers

  Discrete, Qualitative, Ordinal

Question 28 0.25 / 0.25 pts

Raw data should be processed only one time.

  True

  False

Question 29 0.25 / 0.25 pts

In which phase, the duplicates of the data are removed? Choose the
best possible answer.

  Data Requirements

  Data Preparation

  Data Understanding

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 13/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  Data Collection

Partial Question 30 0.13 / 0.25 pts

Match the following function usage in Python used in data cleaning.

dropna()   return Index without NA

fillna()   to fill NA/NaN values u

interpolate()   to find missing values

notnull()   to fill NA values in the

Question 31 0.25 / 0.25 pts

A table contains the salary details of professionals in different fields,


categorized by field. The table has got 100,000 rows. Around 10% of
the rows do not have salary data. It is required to fill in the missing
salary data as part of data pre-processing. Choose from below the best
method for this:

 
Find the mean salary of the available 90% of the data and use that to fill
in all the missing data.

 
Find the field wise mean salary for the available data and fill in the
missing salary data with the applicable mean salary.

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 14/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  Guess the missing data manually.

  Delete the rows where salary data is missing.

Incorrect Question 32 0 / 0.25 pts

A data object is being described by a categorical attribute having four


categories, then for data analysis purpose if we want to transform the
attributes into numerical values, then

  D. it can be done by encoding using 3 or 4 binary variables

  A. it can’t be done

  C. it can be done by encoding using only 4 binary variables

  B. it can be done by encoding using only 3 binary variables

Incorrect Question 33 0 / 0.25 pts

One Hot Encoding scales well as the number of class labels increases

  Most of the time

  None of the given statements

  The statement is false

  Some of the time

Question 34 0.25 / 0.25 pts

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 15/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Dealing with missing values during data preparation is what kind of an


operation

  Data cleansing

  Data retrieval

  Data combining

  Data transformation

Question 35 0.25 / 0.25 pts

Which among the following are valid methods of handling missing data

       P. Eliminating Data Objects

      Q. Estimating Missing Values

      R. Ignoring the Missing Values during Analysis

      S. Replacing with all possible values

  Q and R

  P, Q and R

  R only

  All the options are correct

Incorrect Question 36 0 / 0.25 pts

Identify the sampling technique used in the following use case. For ML
classification task, the algorithm requires that the test set has equal
examples from the three categories.

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 16/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

  Stratified Random

  Systematic Sampling

  Simple Random

  Cluster Sampling

Question 37 0.25 / 0.25 pts

Data Integration is a 

  Data Normalization Technique

  Generalization technique

  None of the answers

  Pre-processing technique

Question 38 0.25 / 0.25 pts

Which of the following methods are considered to be the best practice


for data cleaning?

  All of the given options

  cleansing large dataset without segmentation

  Sorting data by attributes

  By breaking large dataset into small data

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 17/18
12/18/22, 8:59 PM Quiz 1: Introduction to Data Science (S1-22_DSECLZG532)

Question 39 0.25 / 0.25 pts

A sample is ------------------ if it has approximately the same property of


interest

  Probabilistic

  Systamatic

  Qualitative

  Representative

Incorrect Question 40 0 / 0.25 pts

Exploratory data analysis does not help in

  Finding out the data type of a variable

  In univariate and bivariate variable analysis

  Finding statistical estimates of a variable

  Derivation of new attributes

Quiz Score: 7.71 out of 10

https://bits-pilani.instructure.com/courses/1704/quizzes/3453 18/18

You might also like