Professional Documents
Culture Documents
Page 1of 2
1000ITT401122201
12 a) With a diagram, explain the various phases of Data Analytics Lifecycle. (14)
Module II
13 a) Explain how is cloud computing related to big data. (4)
b) Identify the various phases involved in big data acquisition and explain the (10)
functionalities of each phase.
OR
14 a) Illustrate the functionalities of five popular data analytics tools and identify their (7)
application areas
b) Explain how MongoDB can be applied to create, update, and delete documents. (7)
Module III
15 a) Illustrate the anatomy of a YARN application with necessary diagram. (10)
b) Explain the benefits and features of Apache Pig. (4)
OR
16 a) Draw the HDFS architecture and describe the HDFS framework and interface. (8)
b) Illustrate the architecture of HIVE using suitable diagram. (6)
Module IV
17 a) Explain Exploratory Data Analysis and its characteristics. (4)
b) With suitable example describe the five commonly used ‘dplyr’ key functions. (10)
OR
18 a) List out the various data structures in R. Represent each type using example. (8)
b) Write R code for the following with ggplot2 using diamonds data set (6)
i) Create a histogram of "carat" with a border colour and fill colour
Set the bin width of the histogram to 0.01
ii) Make a scatterplot: carat vs price and Facet it by clarity
iii) Show carat vs cut, make a violin and a boxplot.
Module V
19 a) Describe the five main techniques used in recommender systems. Also specify (14)
the advantages and disadvantages of each technique.
OR
20 a) Analyze Facebook data to do a case study on citizen centric public services. (7)
b) Illustrate uplift modelling with an appropriate example. (7)
****
Page 2of 2