You are on page 1of 8

1. Point out the correct statement.

a) Raw data is original source of data


b) Preprocessed data is original source of data
c) Raw data is the data obtained after processing steps
d) None of the mentioned
Answer: a
Explanation: Accounting programs are prototypical examples of data processing applications.

2. Which of the following is performed by Data Scientist?


a) Define the question
b) Create reproducible code
c) Challenge results
d) All of the mentioned
Answer: d
Explanation: A data scientist is a job title for an employee or business intelligence (BI) consultant who excels at analyzing data,
particularly large amounts of data.

3. Which of the following is one of the key data science skills?


a) Statistics
b) Machine Learning
c) Data Visualization
d) All of the mentioned
Answer: d
Explanation: Data visualization is the presentation of data in a pictorial or graphical format.

4. Which of the following is characteristic of Processed Data?


a) Data is not ready for analysis
b) All steps should be noted
c) Hard to use for data analysis
d) None of the mentioned
Answer: b
Explanation: Processing includes merging, summarizing and subsetting data.

5. Raw data should be processed only one time.


a) True
b) False
Answer: b
Explanation: Raw data may only need to be processed once.

6. Which of the following is/are correct types of data?


A. Semi-structured Data

B. Unstructured Data

C. Semi Data

D. Both a & b

Ans: D

7. Data Analysis is a process of?


A. inspecting data
B. cleaning data
C. transforming data
D. All of the above

Ans : D

Explanation: Data Analysis is a process of inspecting, cleaning, transforming and modeling data with the goal of discovering useful
information, suggesting conclusions and supporting decision-making.
8. Which of the following is not a major data analysis approaches?
A. Data Mining
B. Predictive Intelligence
C. Business Intelligence
D. Text Analytics

Ans : B

9. Explanation: Predictive Analytics is major data analysis approaches not Predictive Intelligence.

 How many main statistical methodologies are used in data analysis?


A. 2
B. 3
C. 4
D. 5

Ans : A

10 Explanation: In data analysis, two main statistical methodologies are used Descriptive statistics and Inferential statistics.

Data Analysis is defined by the statistician?


A. William S.
B. Hans Peter Luhn
C. Gregory Piatetsky-Shapiro
D. John Tukey

Ans : D

Explanation: Data Analysis is defined by the statistician John Tukey in 1961 as "Procedures for analyzing data.
Which of the following is true about hypothesis testing?
A. answering yes/no questions about the data
B. estimating numerical characteristics of the data
C. describing associations within the data
D. modeling relationships within the data

Ans : A

Explanation: answering yes/no questions about the data (hypothesis testing)

The goal of business intelligence is to allow easy interpretation of large volumes of data to identify new
opportunities.
A. TRUE
B. FALSE
C. Can be true or false
D. Can not say

Ans : A

Explanation: The goal of business intelligence is to allow easy interpretation of large volumes of data to identify new opportunities.

The branch of statistics which deals with development of particular statistical methods is classified as
A. industry statistics
B. economic statistics
C. applied statistics
D. applied statistics

Ans : D

Explanation: The branch of statistics which deals with development of particular statistical methods is classified as applied statistics.
Which of the following is true about regression analysis?
A. answering yes/no questions about the data
B. estimating numerical characteristics of the data
C. modeling relationships within the data
D. describing associations within the data

Ans : C

Explanation: modeling relationships within the data (E.g. regression analysis).

Text Analytics, also referred to as Text Mining?


A. TRUE
B. FALSE
C. Can be true or false
D. Can not say

Ans : A

Explanation: Text Data Mining is the process of deriving high-quality information from text.

Data science is the process of diverse set of data through ?


A. organizing data
B. processing data
C. analysing data
D. All of the above

Ans : D

Explanation: Data science is the process of deriving knowledge and insights from a huge and diverse set of data through organizing,
processing and analysing the data.
The modern conception of data science as an independent discipline is sometimes attributed to?
A. William S.
B. John McCarthy
C. Arthur Samuel
D. Satoshi Nakamoto

Ans : A

Explanation: Data science developed by William S.

Which of the following is false?


A. Subsetting can be used to select and exclude variables and observations
B. Raw data should be processed only one time.
C. Merging concerns combining datasets on the same observations to produce a result with more variables
D. None Of the above

Ans : B

Explanation: Raw data may only need to be processed once.

Which of the following is correct skills for a Data Scientist?


A. Probability & Statistics
B. Machine Learning / Deep Learning
C. Data Wrangling
D. All of the above

Ans : D

Explanation: All of the above is the correct skills for a Data Scientist.
Which of the following are correct component for data science?
A. Data Engineering
B. Advanced Computing
C. Domain expertise
D. All of the above

Ans : D

Explanation: All are correct component for data science

Which of the following is not a part of data science process?


A. Discovery
B. Model Planning
C. Communication Building
D. Operationalize

Ans : C

Explanation: Communication Building is not a part of data science process.

Which of the following are the Data Sources in data science?


A. Structured
B. UnStructured
C. Both A and B
D. None Of the above

Ans : C

Explanation: Structured and Unstructured data. Like logs, SQL, NoSQL, or text
Which of the following is not a application for data science?
A. Recommendation Systems
B. Image & Speech Recognition
C. Online Price Comparison
D. Privacy Checker

Ans : D

Explanation: Privacy Checker is not a application for data science

 __________Statistics uses the data to provide descriptions of the population, either through numerical
calculations or graphs or tables.

A. Descriptive
B. Quantitative
C. Inferential
D. Qualitative

Ans : A

Explanation: Descriptive Statistics uses the data to provide descriptions of the population, either through numerical calculations or
graphs or tables.

Point out the wrong statement.


A. A random variable is a numerical outcome of an experiment
B. Continuous random variable can take any value on the real line
C. There are three types of random variable
D. None of the above

Ans : C

Explanation: There are two types of random variable-continuous and discrete.

You might also like