You are on page 1of 1

MODEL EXAMINATIONS

MC5306 – DATA SCIENCE


CLASS : II MCA(2020-2022) DATE : 05-01-2022
SEMESTER :III TIME : 01.00 PM TO 04.00 PM
PART – A (10 x 2 = 20 Marks)
1. Define data science. [CO1,L1]
2. What do you mean by statistical inference? [CO1,L2]
3. Compare Linear and Logistic Regression. [CO2,L1]
4. How Two Way Cross Tabulation is used? [CO2,L3]
5. What is kernel? [CO3,L1]
6. Define Sharding. [CO3,L2]
7. Differentiate Hadoop and RDBMS. [CO3,L1]
8. What is HLog? [CO3,L2]
9. What is streaming? When it is useful? [CO4,L1]
10. How decaying window is used? [CO4,L2]

PART – B (5 x 13 = 65 Marks)
11. a. Explain in detail about Exploratory Data Analysis? [CO1,L2]
(Or)
b. Describe the evolution of analytical scalability? [CO1,L2]
12. a. i. Explain the various measures of Univariate analysis? (8)
a. ii. Write about regression modeling? (5) [CO2,L21]
(Or)
b. Explain how the analysis is represented graphically using R? [CO2,L3]
13. a. Explain how Support Vectors and Kernel machines are used in data modeling? [CO3,L2]
(Or)
b. Describe about i. CAP Theorem (6) ii. CRUD Operations (7) [CO3,L2]
14. a. Write about HDFS Components? Explain Block Replication process in HDFS? [CO3,L2]
(Or)
b. Explain in detail about Map Reduce? [CO5,L3]
15. a. Describe the Stream Data Model and Architecture? [CO4,L2]
(Or)
b. Write about i. Filtering Streams (6) ii. Counting Distinct Elements (7) [CO4,L2]

PART – C (1 x 15 = 15 Marks)
16. Explain in detail about HBase Architecture and its processing model? [CO3,L2]
--------------------

CO1 : Convert real world problems to hypothesis and perform statistical testing.
CO2 : Perform data analysis using R.
CO3 :Design efficient modeling of very large data and work with big data platforms..
CO4 :Implement suitable data analysis for stream data.
CO5 :Write efficient MapReduce programs for small problem solving methods.
L1 : Remember ; L2 : Understand ; L3 : Apply

You might also like