You are on page 1of 26

BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 1:

Total marks = (15 Qns * 1 mark = 15 marks)

1. What concept does the phrase "turning data tombs into 'golden nuggets' of
knowledge"signify with respect to data mining? (1 mark)

Answer: a) The transformation of extensive data reserves into valuable insights


and knowledge.

2. Which step involves the extraction of data patterns using intelligent methods? (1 mark)

Answer: d) Data mining

3. What is the primary purpose of data mining in the context of the data age? (1 mark)

Answer: C) Uncovering valuable information from vast data and converting it


into organized knowledge

4. Which technology contributed substantially to the evolution and wide acceptance of


relational technology for efficient storage, retrieval, and management of large
amountsof data? (1 mark)

Answer: B) Online transaction processing (OLTP)

5. What does the architecture of a data warehouse primarily aim to facilitate? (1 mark)

Answer: C) Management decision making

6. What is the primary advantage of using data warehouse systems for OLAP? (1 mark)

Answer: c) Presenting data at different levels of abstraction

7. What defines outliers in a dataset? (1 mark)

Answer: d) Objects that deviate from the general behaviour or model of the data.

8. How does data mining benefit from scalable database technologies? (1 mark)

Answer: c) It enables high efficiency and scalability on large datasets

9. What distinguishes Descriptive Analytics from other types of analytics? (1 mark)

Answer: b) It identifies patterns and trends from past data.

10. Which phase in the knowledge discovery process involves the removal of noise
andinconsistent data? (1 mark)

Answer: c) Data cleaning


11. In the context of data preprocessing, what is the purpose of data
transformation?(1 mark)

Answer: C) To prepare data for mining by performing summary or aggregation


operations

12. What is the fundamental characteristic of a relational database? (1 mark)

Answer: A) It consists of a collection of tables with unique names.

13. Which of the following can be called as a major driver of Data Mining? (1 mark)

Answer: D) Rise of transaction processing systems/ERPs

14. What does the "long tail" phenomenon refer to in business? (1 mark)

Answer: B) The distribution of sales that extends to less common products


returning substantial profits

15. What can you infer from the following graph? (1 mark)

Answer: a) Less travelled destinations are growing more popular with each passing
year
BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 2:

Total marks = (15 Qns * 1 mark = 15 marks)

1. Which term describes the practice of making decisions purely on data analysis rather
than intuition? (1 mark)

Answer: c) Data-Driven Decision (DDD) Making

2. What characterizes the primary function of databases supporting Online Transaction


Processing (OLTP)? (1 mark)

Answer: c) Handling a large number of simple transactions and updates

3. What does the acronym ACID stand for in the context of databases? (1 mark)

Answer: c) Atomicity, Consistency, Isolation, Durability

4. What primarily distinguishes Online Analytical Processing (OLAP) from


Online Transaction Processing (OLTP)? (1 mark)

Answer: c) OLAP deals with data retrieval and analysis for revealing business
trends, while OLTP supports a large number of simple transactions.

5. Which of the following best describes a data warehouse? (1 mark)

Answer: b) A repository separate from operational databases, providing


integrated and historic data for decision-making processes.

6. Why does a data warehouse not require transaction processing, recovery, and
concurrency control mechanisms? (1 mark)

Answer: b) Due to its physical separation from operational data

7. Which data warehouse model spans the entire organization and provides corporate-
wide data integration? (1 mark)

Answer: c) Enterprise warehouse

8. What distinguishes a data mart from an enterprise warehouse? (1 mark)

Answer: d) Data marts typically contain a subset of corporate-wide data for


specific users, while enterprise warehouses collect information spanning the
entire organization.

9. Which function of back-end tools in data warehouse systems involves


rectifying errors detected in the data? (1 mark)

Answer: b) Data cleaning


10. What is the first step in the ETL process? (1 mark)

Answer: c) Extract data from source systems.

11. Which of the following gives a logical structure of the database graphically? (1 mark)

Answer: a) Entity-relationship diagram

12. Which type of DBMS language is used to create the database schema? (1 mark)

Answer: c) Data Definition Language

13. Which Data Manipulation command is used to add a new record in a database?
(1 mark)

ANS: b) INSERT

14. What does the atomicity property of the ACID database guarantee in a transaction?
(1 mark)

Ans: b) That the transaction will be all-or-nothing.

15. What problem does the ACID property of isolation address? (1 mark)

Ans: b) Concurrent access to data causing inconsistencies.


BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 3:

Total marks = (15 Qns * 1 mark = 15 marks)

1. Which database schema is typically associated with OLAP systems? (1 Mark)

Answer: b) Star or Snowflake schema

2. In a data cube, what are dimensions primarily representing? (1 Mark)

Answer: c) Entities or perspectives for record-keeping

3. What metaphor is used to describe multidimensional data storage in


data warehousing? (1 Mark)

Answer: c) Data cube

4. What does the apex cuboid in a data cube typically represent? (1 Mark)

Answer: b) Highest level of summarization

5. How many cuboids are there in a 4-dimensional cube with 4 levels each? (1 Mark)

Ans: a. 625 cuboids

6. What is a significant difference between a snowflake schema and a


star schema? (1 Mark)

Answer: c) Normalization of dimension tables

7. Which schema is commonly used in data warehouses due to its capability to model
multiple, interrelated subjects? (1 Mark)

Answer: c) Fact constellation

8. Which normal form deals with atomicity and ensures that each attribute contains only
indivisible values? (1 Mark)

Answer: A) First Normal Form (1NF)

9. In a relational database, what is the purpose of a foreign key? (1 Mark)

Answer: B) It maintains referential integrity between tables.

10. Consider the SQL statement: SELECT COUNT (*) FROM table_name. What does
it retrieve? (1 Mark)

Answer: c) All rows in the table

11. What is the primary objective of normalizing a database? (1 Mark)

Answer: A) To eliminate data redundancy and minimize data inconsistency


12. Which normalization form ensures that every non-prime attribute is fully
functionally dependent on the primary key, eliminating all transitive dependencies?
(1 Mark)

Answer: C) Boyce-Codd Normal Form (BCNF)

13. What is the purpose of generating a lattice of cuboids in a data cube model? (1 Mark)

Answer: A) To display data at various levels of summarization based on


different dimensions.

14. What distinguishes a data mart from a data warehouse in terms of schema
preference? (1 Mark)

Answer: C) Data marts typically utilize star or snowflake schemas, while data
warehouses favour the fact constellation schema.

15. What characterizes the Roll-up operation in OLAP? (1 Mark)

Answer: A) It aggregates data by stepping up a concept hierarchy or by adding


dimensions.
BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 4:

Total marks = (15 Qns * 1 mark = 15 marks)

1. The concept of "Survival at time 't'" in survival analysis refers to: (1 Mark)

Answer: D) The probability of a customer surviving from the previous time


period to 't'

2. What does the term "Churn Rate" signify in customer analytics? (1 Mark)

Answer: D) The rate at which customers discontinue or leave

3. How is Customer Lifetime Value (CLV) useful for businesses? (1 Mark)

Answer: C) It assists in identifying high-value customers for loyalty programs

4. What does the customer half-life measure? (1 Mark)

Answer: B) The time taken for exactly half of a customer cohort to leave

5. What does the hazard signify in survival analysis? (1 Mark)

Answer: B) The risk of customer attrition within a specific time interval

6. How is the hazard probability calculated in customer tenure analysis? (1 Mark)

Answer: C) It involves the ratio of customers who stop at a particular tenure to


the population at risk.

7. What is a key application of survival analysis besides measuring customer


churn? (1 Mark)

Ans: b) Calculating customer lifetime value.

8. Why is it important for businesses to track their customer acquisition cost (CAC)
alongside CLV? (1 Mark)

a) To determine the profitability of customer segments

b) To identify opportunities for cost reduction

c) To measure the effectiveness of marketing campaigns

d) All of the above

Ans: d) All of the above

9. What does a survival curve in customer retention showcase? (1 Mark)

Answer: B) The proportion of customers expected to remain active over specific


tenures
10. What makes the survival curve a more reliable measure compared to the retention
curve? (1 Mark)

Answer: B) Survival calculations use information from all customers, offering


more stability.

11. What are the potential limitations of using survival analysis in customer churn
prediction? (1 Mark)

a) It requires a large amount of historical data for accurate predictions.


b) It assumes that customer behaviour remains consistent over time.
c) It cannot account for external factors that may influence churn rates.
d) All of the above
Ans: d) All of the above

12. How does survival differ from retention in customer analytics? (1 Mark)

Answer: C) Survival accumulates probabilities of a customer event not


occurring over time.

13. Which components are crucial for a full customer value calculation? (1 Mark)

Answer: D) Revenues, costs, and length of the customer relationship

14. How does survival analysis contribute to customer value calculations? (1 Mark)

Answer: C) It provides insights into the expected remaining tenure for


customers.

15. An online gaming platform has 100,000 active users. During a specific month,
10,000users become inactive. The platform identifies 20,000 users as being at risk
of becoming inactive during that month. What is the hazard probability for the online
gaming platform during that month? (1 Mark)

Ans: c) 0.5
BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 5:

Total marks = (15 Qns * 1 mark = 15 marks)

1. In regression analysis, multicollinearity refers to: (1 Mark)

Answer: C) High intercorrelation among the independent variables, leading to


unstable estimates of the regression coefficients.

2. What type of data transformation technique scales data to a specific range, such as
0to 1? (1 Mark)

ANS: d) Standardization/Normalization

3. Which of the following statements about the coefficient of determination (R-


squared)is true? (1 Mark)

Answer: C) R-squared ranges from 0 to 1 and represents the percentage of


variation in the dependent variable explained by the independent variables.

4. What does Ordinary Least Squares (OLS) aim to minimize in the context of linear
regression? (1 Mark)

Answer: A) The sum of squared errors between the predicted and observed
values of the dependent variable.

5. The coefficient of determination (R-squared) value of 0.98 in a regression model


implies: (1 Mark)

Answer: B) 98% of the variability in the dependent variable is explained by the


independent variable.

6. Prediction error in a model refers to: (1 Mark)

Solution:A) The difference between actual and predicted values.

7. Which of the following statements is wrong with regards to Overfitting in a machine


learning model? (1 Mark)

Solution: A) The model is too simple to capture the underlying patterns in the
data.

8. Underfitting in a machine learning model results in: (1 Mark)

Solution: B) High bias and low variance.

9. When should one focus on reducing bias in a machine learning model? (1 Mark)

Solution: D) When the model doesn’t fit the data well, and works poorly in
explanatory/predictive performance
10. What is the bias-variance trade-off in machine learning? (1 Mark)

Solution: C- Finding the equilibrium between model complexity and its ability
to generalize to unseen data.

11. Training error refers to: (1 Mark)

Solution: A) Error calculated on the training dataset.

12. What does Leave-One-Out Cross-Validation (LOOCV) do? (1 Mark)

Solution: A) It iteratively uses all but one sample as the test set and the
remaining sample as the training set.

13. What is the primary purpose of cross-validation in machine learning? (1 Mark)

Solution: B) To evaluate the model's performance on unseen data.

14 What are the three sources of error in predicted Y in machine learning? (1 Mark)

Solution: C) Reducible error due to inaccurate estimation of f, irreducible error


due to randomness, and test data variation.

15. Which of the following statements most accurately distinguishes supervised learning
from unsupervised learning in machine learning? (1 Mark)

Solution: A) Supervised learning requires labelled data for training models to


predict specific outcomes, while unsupervised learning uncovers patterns or
structures in data without predefined outcomes.
BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 6:

Total marks = 18 Marks

12 Qns * 1 marks = 12 marks

3 Qns * 2 marks = 6 marks

1. Which of the following statements accurately describes a characteristic of classifiers


inmachine learning? (1 Mark)

Ans: c) They categorize objects into distinct and mutually exclusive groups based
on their characteristics.

2. Which classification technique is primarily statistics and probability-based? (1 Mark)

Answer: B) Bayes' Classifiers

3. Which are the two measures used in ROC curves to visualize the performance
of classifiers? (1 Mark)

Answer: A) Sensitivity and specificity

4. Which metric measures the ratio of correctly predicted positive observations to the
totalpredicted positives? (1 Mark)

Answer: D) Precision

5. Imagine you're building a spam filter that classifies emails as spam or not spam.
Aftertesting your model, you get the following results:

 True Positives (TP): 100 emails correctly classified as spam

 False Positives (FP): 5 emails incorrectly classified as spam

 False Negatives (FN): 10 emails correctly classified as not spam but are
actuallyspam

What is the recall of your spam filter? (2 Marks)

Ans: d) 0.909

6. Which technique primarily uses a set of if-else decision rules to categorize


data?(1 Mark)

Answer: A) Decision trees

7. How does the test data variation contribute to the errors in predicting Y
values?(1 Mark)

Answer: B) It causes the irreducible error due to randomness.


8. What does classifier accuracy represent in classification tasks? (1 Mark)

Answer: A) The percentage of test set tuples correctly classified by the classifier.

9. In classification, what does the term "reducible error" primarily refer to? (1 Mark)

Answer: C) Error that can be minimized by better learning techniques

10. In a medical study evaluating a diagnostic test for a certain disease, 150 patients
weretested. Of these, 90 patients were diagnosed with the disease, while 60 patients
did not have the disease. The model predictions are as follows:

Choose the correct option that represents the error rate of the diagnostic test basedon the
provided
classification
outcomes. (2 Marks)

Ans: B) 0.2

11. Overfitting occurs when a classifier incorporates anomalies of the training data that
arenot present in the general dataset. (True/False) (1 Mark)

Answer: True

12. In unsupervised learning, for every observation i = 1,..., n, we observe a vector of


measurements xi but no associated response yi.(True/False) (1 Mark)

Answer: True

13. What is the lift obtained by a marketing team if, without data mining, they achieve a
15% response rate by randomly selecting 20% of potential customers, while with
predictive analytics, they target 20% of likely customers and achieve a response rate
of 25%? (2 Marks)

Answer: B) 1.67

14. Choose the correct answer:

1. K-nearest neighbours or KNN is an unsupervised classification algorithm

2. K-means Clustering is a supervised classification algorithm.

a) 1 and 2 are correct

b) Only 1 is correct

c) Only 2 is correct

d) Both are wrong


Ans: d) Both are wrong

15. Which of the following is NOT a commonly used classification technique? (1 Mark)

Answer: D. Principal component analysis (PCA)


BUSINESS INTELLIGENGE AND ANALYTICS

ASSIGNMENT WEEK 7:

Total marks = 17 Marks

14 Qns * 1 marks = 14 marks

1 Qns * 3 marks = 3 marks

1. What is a key advantage of decision trees in knowledge representation? (1 Mark)

Answer: C) They represent acquired knowledge in an intuitive tree form

2. What does an internal node in a decision tree represent? (1 Mark)

Answer: B) Outcome of a test on an attribute

3. Why might a decision tree, resulting from the described process, perform poorly on
atest set? (1 Mark)

Answer: C) It's likely to be to have too many splits, causing overfitting

4. What might a smaller tree with fewer splits achieve in terms of variance and bias?
(1Mark)

Answer: B) It reduces variance but possibly increases bias

5. How does a classification tree differ from a regression tree? (1 Mark)

Answer: B) It predicts a qualitative response, not a quantitative one.

6. Which of the following is an advantage of decision trees compared to linear


regression? (1 Mark)

Answer: B) Decision trees are easier to explain and interpret.

7. Ensemble methods are used to improve prediction performance of decision


trees(T/F) (1 Mark)

Answer: True

8. Bagging primarily addresses which issue within statistical learning methods


likedecision trees? (1 Mark)

Answer: C) Deals with high variance and improves prediction accuracy.

9. Which technique involves averaging predictions from multiple models built


onbootstrapped training sets? (1 Mark)

Answer: b) Bootstrap aggregation (bagging)

10. How does bagging handle classification problems? (1 Mark)

Answer: c) By taking a majority vote from predictions of multiple trees


11. In Bagging, each individual tree is independent of each other because they consider
different subset of features and samples. (T/F) (1 Mark)

Ans: True

12. What are some common techniques for handling imbalanced data in
classificationtasks? (1 Mark)

a) Oversampling the minority class to create a more balanced dataset.


b) Under sampling the majority class to reduce its dominance.
c) Only a is correct
d) Both a and b are correct
Ans: d) Both a and b are correct

13. In Random forest you can generate hundreds of trees (say T1, T2 …..Tn) and then
aggregate the results of these trees. Which of the following is true about an individual
(Tk) tree in Random Forest? (1 Mark)

a) Individual tree is built on a subset of the features


b) Individual tree is built on all the features
c) Individual tree is built on a subset of observations
d) Individual tree is built on full set of observations
Solution: A) 1 and 3

14. Consider a dataset with a binary target variable (0 or 1) and a split based on
afeature resulting in two child nodes after the split.

 Node 1 (left child): Out of 40 samples, 30 belong to class 0 and 10 belong to class 1.

 Node 2 (right child): Out of 60 samples, 20 belong to class 0 and 40 belong to class
1.

which option has the correct Gini indices of the child nodes? (3 Marks)

Solution: b) Gini index for Node 1: 0.375, Gini index for Node 2: 0.444

15. How does Random Forest aim to reduce correlation among trees? (1 Mark)

Answer: b) By growing trees independently with a random subset of


predictors ateach split.
BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 8:

Total marks = 1 Marks

15 Qns * 1 marks = 15 marks

1. Which of the following is a common method for splitting nodes in a decision tree?
(1Mark)

Answer: A) Gini impurity

2. What is the purpose of pruning in decision trees? (1 Mark)

Answer: A) To reduce the depth of the tree and prevent overfitting

3. Which of the following is a popular algorithm for constructing decision trees? (1 Mark)

Answer: A) ID3

4. What is the main difference between classification and regression trees in a


CARTalgorithm? (1 Mark)

Answer: A) Classification trees predict categorical variables, while


regression trees predictcontinuous variables

5. Consider the following statements:

a) Bagging is less susceptible to model overfitting as compared to


BoostingWhich of the above statements are correct? (1 Mark)

Ans: B.) Only b

6. What is entropy in the context of decision trees? (1 Mark)

Answers: A) A measure of disorder or impurity in a node

7. Which of the following is a common stopping criterion for growing a decision tree?
(1Mark)

A. Reaching a maximum depth

B. Achieving a minimum information gain

Answer: D) Both A and B

8. For decision trees, what purpose does "one-hot encoding" serve? (1 Mark)

Answer: B) Transform categorical data into numerical format that algorithms


can process

9. What's the primary drawback of utilizing a substantial maximum depth for a decision
tree? (1 Mark)

Answer: A) It leads to overfitting


10. Which strategy is effective in mitigating overfitting in decision trees? (1 Mark)

A. Pruning

B. Bagging

C. Boosting

D. All of the above

Answer: D

11. Which of the following is NOT commonly associated with the use of decision trees?
(1Mark)

Answer: D ) Image classification

12. How can decision trees be made more robust to noise in the data? (1 Mark)

Answer: C) By using ensemble techniques like bagging or boosting

13. What role do leaf nodes play in a decision tree? (1 Mark)

Answer: D) To denote the class label or predicted value

14. If the true positive value is 10 and the false positive value is 15, what is the precision score
for the classification model? (1 Mark)

Answer: B) 0.4

15. Which of the following definitions describes false negatives? (1 Mark)

Answer: A) Predicted negatives that are actually positives


BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 9:

Total marks = 1 Marks

15 Qns * 1 marks = 15 marks

1. What is a key challenge associated with unsupervised learning? (1 Mark)

Answer: B) Subjectivity and absence of a clear analysis goal

2. For clustering, we do not require- (1 Mark)

ANSWER: A) Labeled data

3. Which of the following is an example of an unsupervised learning algorithm? (1 Mark)

Ans: C. K-means clustering

4. What distinguishes K-means clustering from hierarchical clustering? (1 Mark)

Answer: B) K-means clustering requires knowing the number of clusters


beforehand, while hierarchical clustering does not.

5. What does a dendrogram represent in hierarchical clustering? (1 Mark)

Answer: C) A tree-like structure showing clustering at various levels

6. Which of the following is a method of choosing the optimal number of clusters for
k- means? (1 Mark)

A. Shadow method

B. the silhouette method

C. the elbow method

D. B and C

ANSWER: D ) B and C

7. Which of the following statements best describes the goal of SMOTE preprocessing
technique? (1 Mark)

Ans:b) Balance the class distribution in imbalanced datasets

8. What defines a good clustering according to the K-means approach? (1 Mark)

Answer: C) Minimizing the sum of squared distances within each cluster

9. Which of the following is a limitation of K-means clustering? (1 Mark)

A. Sensitivity to the initial placement of cluster centroids

B. Inability to handle missing data


C. Inability to handle categorical data

D. All of the above

Explanation: D

10. Which of the following statements about distance between clusters is true? (1 Mark)

Answer: D) Complete linkage considers the maximum distance between


objects in different clusters.

11. In a 3-dimensional space represented by coordinates (x, y, z), two cluster centroids,
A and B, have coordinates A(2, 4, 6) and B(5, 1, 3) respectively. What is the precise
Euclidean distance between these centroids, denoting their dissimilarity in the
cluster space? (1 Mark)

Answer: A) 5.20 units

12. In K-means clustering, what is the purpose of the "elbow method"? (1 Mark)

ANSWER:A) To determine the optimal number of clusters

13. Suppose that a customer transaction table contains 9 items and 3 customers. What
is the Jaccard coefficient (similarity measure for asymmetric binary variables) for
C1 and C2? (1 Mark)

Ans: b. 0.25

14. In the figure below, if you draw a horizontal line on the y-axis for y=2. What will
be the number of clusters formed? (1 Mark)
Solution: (B) 2

15. Assume you want to cluster 7 observations into 3 clusters using the K-Means
clustering algorithm. After first iteration, clusters C1, C2, C3 have following
observations:

C1: {(2,2), (4,4), (6,6)}

C2: {(0,4), (4,0)}

C3: {(5,5), (9,9)}

What will be the Manhattan distance for observation (9, 9) from cluster centroid C1 in the
second iteration? (1 Mark)

Ans: Solution: (A) 10


BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 10:

Total marks = 1 Marks

15 Qns * 1 marks = 15 marks

1. What does RFM stand for in customer segmentation strategy? (1 Mark)

Answer: B) Recency, Frequency, Monetary value

2. Which of the following is NOT true about RFM analysis? (1 Mark)

Answer: A) It requires detailed demographic data.

3. How are Butterflies categorized in RFM segmentation? (1 Mark)

Answer: B) Profitable but disloyal customers

4. Which customer group is likely to need incentives to increase their


spending and engagement? (1 Mark)

Answer: C) Barnacles

5. What does CLV stand for in RFM analyses? (1 Mark)

Answer: A) Customer Lifetime Value

6. How are R, F, and M typically combined to create composite scores in some


methods? (1 Mark)

Answer: B) Multiplying R by 5, F by 2, and M by 1

7. What SQL function is used for RFM analysis to scale RFM into a
predefined range? (1 Mark)

Answer: C) NTILE()

8. In RFM analysis what does "Recency" refer to? (1 Mark)

Answer: C) The time elapsed since a customer's last purchase

9. What is the purpose of the fit_predict method in scikit-learn's


K-means implementation? (1 Mark)

Answer: B) Predicting cluster labels

10. Which Python package provides functionality for visualizing K-means


clustering results using 2D and 3D plots?

Answer: B) matplotlib

11. How is Recency (R) scaled after grouping Days since last order into 10
deciles? (1 Mark)
Answer: B) It is reversed, with the most recent customer receiving the
highest R value

12. Which clustering algorithm assigns data points to the nearest cluster centroid? (1 Mark)

Answer: a. K-Means

13. A retail company wants to segment its customers for targeted marketing
campaigns. They have data on customer demographics (age, gender,
income), purchase history (amount, frequency, categories), and online
behaviour (website visits, clicks). Which features are most suitable for k-
means clustering in this scenario? (1 Mark)

Ans: d) A combination of all features

14. True or False: In K-means clustering, each cluster is represented by its center
(centroid) which corresponds to the median of points assigned to the cluster.

Ans: False

15. Out of the reasons elicited below, what would be a major reason for you not
to choose K- means for clustering analysis? (1 Mark)

Ans: A) It is sensitive to noise and outlier data points and also sensitive to the
initial placement of its cluster centers (centroids).
BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 11:

Total marks = 1 Marks


15 Qns * 1 marks = 15 marks

1. Which of the following is not a type of layer in a neural network? (1 Mark)


A. Input layer
B. Hidden layer
C. Output layer
D. Support layer

Answer-D) Support layer

2. What is the process of adjusting control parameters to optimize a neural network's


performance called? (1 Mark)

Answer: B) Hyperparameter tunin

3. What is the purpose of the learning rate in a neural network? (1 Mark)

Answer: A) To control the speed of weight updates

4. What is the purpose of the loss function in a neural network? (1 Mark)

Answer: D ) To measure the difference between predicted output and actual


output

5. What does the term 'backpropagation' refer to in neural networks? (1 Mark)

Answer: B) Fine-tuning the weights by propagating errors backward

6. Which algorithm is commonly used for updating weights in backpropagation?


(1Mark)

Answer: A) Gradient Descent

7. What does the term 'epoch' refer to in neural network training? (1 Mark)

Answer: C) One complete cycle of training data through the network

8. What is a perceptron? (1 Mark)

Ans : A a single layer feed-forward neural network

9. Which of the following best defines cross-sectional data? (1 Mark)


Answer: B) Data collected from a single point in time from different subjects

10. If a neural network has 16 input neurons and 4 output neurons, how many neurons
would be recommended for the hidden layer according to thumb rule? (1 Mark)

Answer: A) 8 neurons

11. If you increase the number of hidden layers in a multi-layer perceptron,


theclassification error of test data always decreases- True/False (1 Mark)
Answer: False

12. There is a feedback loop in the final stage of a back propagation algorithm- T/F
(1Mark)
Answer: False

13. In time series analysis, which component represents the long-term movement or
thegeneral direction of the data? (1 Mark)

Answer: C) Trend

14. What defines panel data in econometric studies? (1 Mark)

Answer: A) Data that involves repeated multi-dimensional observations of the


samesubjects over different periods of time

15. What differentiates a feedforward neural network from other types of neural networks
like recurrent neural networks (RNNs) or convolutional neural networks (CNNs)? (1
Mark)

Answer: C) Information flows in a single direction, without loops or cycles.


BUSINESS INTELLIGENCE AND ANALYTICS

ASSIGNMENT WEEK 12:

Total marks = 15 Marks


15 Qns * 1 marks = 15 marks

1. What is a token in text mining? (1 Mark)

Answer: b) A string of contiguous alphanumeric characters


with space on either side

2. What does stemming or lemmatization aim to achieve in text processing? (1 Mark)

Answer: c) Normalization by reducing different word forms into a single stem


3. Which of the following defines a document in text mining? (1 Mark)

Answer: b) A sequence of N words


4. What does the term "Corpus" refer to in text mining? (1 Mark)

Answer: C ) A collection of M documents

5. Which type of words are typically considered for removal or stop word
lists in text mining? (1 Mark)

Answer: D ) Commonly used words like "the," "is," "of," etc.

6. How is the inverse document frequency (idf) calculated for a given


term? (1 Mark)

Answer: A) idf(term) = log(ndocuments / ndocuments containing term)


7. True or False: The statistic tf-idf is intended to measure how important a
word is to a document in a collection (or corpus) of documents

Answer: True

8. How can bigrams be beneficial in text analysis compared to individual words?


(1 Mark)

Answer: B ) Bigrams offer more context and capture structural relationships


between words

9. What does a higher Phi coefficient value indicate regarding word co-
occurrence? (1 Mark)

a) Given words occur independently of each other


b) Given words always appear together in the same documents
c) Given words rarely appear together in documents
d) Given words are more likely to co-occur compared to appearing separately
Answer: D) Given words are more likely to co-occur compared to
appearing separately

10. .In a text corpus comprising 200 documents, the word "forest" and “wildlife” doesn’t
co-occur in 120 documents. Both "forest" and "wildlife" co-occur in 50 documents.
Furthermore, "forest" without "wildlife" appears in 10 documents, and "wildlife" without
"forest" appears in 20 documents. What is the Phi coefficient to measure the correlation
between the appearance of the words "forest" and "wildlife" in this dataset?

ANSWER: B ) 0.66

11. Which of the following datasets provides a polarity score ranging from
-5 to +5 for words in sentiment analysis? (1 Mark)

Answer: C. AFINN sentiment dataset

12. What does a cosine similarity value of 0 indicate when comparing


term frequency vectors of two documents? (1 Mark)

Answer: A. The documents have no common words

13. In TF-IDF analysis, what does the term frequency (tf) measure for
a word in a document? (1 Mark)

Answer: A) The count of the word in a document divided by the total


words in that document

14. A bag of words model uses- (1 Mark)


A. A vocabulary of known words
B. A measure of the presence of known words
C. Both A and B
D. None
ANSWER:C

15. Which of these techniques is used for normalization in text mining? (1 Mark)

A. Stemming
B. Stop words removal
C. Lemmatization
D. All of the above

ANSWER:D

You might also like