Professional Documents
Culture Documents
MC4311 Set 4
MC4311 Set 4
Third Semester
(Regulations 2021)
2. a). Write an R script for Reading different types of data sets (.txt, .csv) from Web and disk
and writing in file in specific disk location and also reading Excel data sheet in R. (80)
b).Result/inference/observations (10)
c). Viva- Voce (10)
6. a. Download PIMA Indians diabetes data set then perform univariate analysis for the
diabetes dataset using R (80)
b.Result/inference/observations (10)
c. Viva- Voce (10)
Page 1 of 3
7. a. Perform Bivariate analysis for PIMA Indians diabetes data set using R (80)
b.Result/inference/observations (10)
c. Viva- Voce (10)
8. a. Download iris dataset then perform multiple regression analysis for this dataset using R
(80)
b.Result/inference/observations (10)
c. Viva- Voce (10)
9. a. Create dataset data.csv containing the attributes such as Country, Age, Salary, and
Purchased using Microsoft Excel. Importing the dataset. Replace the missing Age data with
the average of the feature in which the data is missing, and Encoding categorical data using
R. (80)
b.Result/inference/observations (10)
c. Viva- Voce (10)
10. a. Implement data preprocessing techniques on real time dataset using R (80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
11. a. Create confusion matrix and demonstrate how do you measure the performance of the
machine learning model such as accuracy, precision, recall, and F-score (80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
12. a. Apply multiple regressions, if data have a continuous Independent variable. Apply on
diabetes dataset. (80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
13. a. Apply EM algorithm to cluster a set of data stored in a .CSV file using python. (80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
14. a. Write a program to implement the naïve Bayesian classifier for a sample training data set.
(80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
15. a. Write a python program to construct a Bayesian network considering medical data. Use
this model to demonstrate the diagnosis of heart patients using standard Heart Disease Data
Set. (80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
Page 2 of 3
16. a. Write a program to implement the k-Nearest Neighbour algorithm to classify the iris data
set. Print both correct and wrong predictions. Java/Python ML library classes can be used for
this problem. (80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
17. a. Apply the technique of pruning for a noisy data monk2 data, and derive the decision tree
from this data. Analyze the results by comparing the structure of pruned and unpruned tree
using Python. (80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
18. a. Build an Artificial Neural Network by implementing the Backpropagation algorithm and test
the same using appropriate data sets (80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
19. a. Implement Support Vector Classification for linear kernels using Python. (80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
20. a. Perform Logistic Regression analysis on iris data set and plot the results (80)
b. Result/inference/observations (10)
c. Viva- Voce (10)
Page 3 of 3