You are on page 1of 3

SRI VENKATESWARA COLLEGE OF ENGINEERING

(Autonomous)
Karakambadi Road, TIRUPATI – 517507

Question bank -10 Marks Questions

Name of the subject III B.Tech II Sem Regular 2023


Subject Big Data Analytics
Subject Code CS20APE602
Year & Sem III Year & II Sem

2 Marks Questions

Unit – 1
1 What is the need of Big Data?
2 What are the major technological challenges in managing Big Data?
3 List the technologies available to manage big data?
4 Discuss why is big data analytics important?
5 Define Big Data?
Unit – 2
1 Discuss in brief the term HDFS in Hadoop environment.
2 What is the key-value pair format? How is it different from other data
structures?
3 What is a Job tracker program? How does it differ from the Task Tracker
Program?
4 What is the default replication factor in HDFS?
5 What is YARN?
Unit – 3
1 Discover the steps involved in running a Map Reduce application.
2 List the types of Map Reduce applications.
3 What is the task of Mapper?
4 How a secondary name node differs from the name node in HDFS?
5 How much memory does a Namenode need?
Unit – 4
1 Explain the core components of Hadoop.
2 List out the benefits of Pig.
3 Specify Role of PIG in Hadoop.
4 How security is provided in Hadoop?
5 List the data processing operators in PIG.
Unit – 5
1 What are the differences between Pig and Hive?
2 What is Hive? List any four main features of Hive
3 List the main features of spark.
4 What are the advantages of HBase?
5 What do you mean by windowing in HiveQL?
10 Marks Questions
Bloom’s
Unit – 1 Cos
Level
1 What is Big Data? Explain characteristics of Big Data? L1 CO1
2 List various applications of big data. What are the challenges to
L2 CO1
improve business for a superstore?
3 Describe the structure of HDFS in a Hadoop ecosystem using a CO1
L2
diagram
4 Compare Big Data with Conventional Data and indicate some of CO1
L2
the importance of Big Data Analysis
Explain how to analyze data with Hadoop with suitable diagrams CO1
5 L1
and example.
What are the major sources of big data? Describe a source of CO1
6 L2
each type.
7 Describe the architecture of Hadoop Technology. L2 CO1
8 List and explain the advantages of big data analytics. L1 CO1
Unit – 2
1 Explain the design of HDFS and HDFS concepts. L1 CO3
2 Explain Blocks, Namenodes, Datanodes and Block Caching
L2 CO2
concepts in HDFS.
3 Discuss the architecture of Hadoop Distributed File System. L2 CO3
4 Elicit how to Setting up the Development Environment of a Map
L3 CO2
Reduce?
5 Describe in brief about API for the map-reduce framework. L2 CO2
6 Enumerate the steps to create a word count application using CO2
L2
Map Reduce.
7 Explain the importance of Command Line interface in Hadoop. L1 CO2
8 Describe the working of Map reduce with a relevant example. CO2
L2
9 Briefly explore the feature of Map Reduce. L1 CO2
Unit-3
1 Discuss in brief the implementation of the MapReduce concept
L2 CO3
with a suitable example
Explain in brief, MapReduce types, Input formats,and output CO3
2 L2
formats.
3 Explain the framework of MapReduce. L1 CO3
4 List and explain the features of MapReduce programming model. CO3
L2
How does MapReduce program enable parallel processing?
5 Describe the Anatomy of a MapReduce. L2 CO3
With a neat sketch briefly discuss anatomy of a MapReduce Job
6 L2 CO3
Run.
7 Discuss how security is provided in Hadoop. L2 CO4
How does a map task implement using key-value pairs in an
8 input file? What are the uses of shuffle in processing the L3 CO4
aggregates?
Unit-4
1 Derive the core components of the Hadoop cluster. L2 CO5
2 Write in detail about the network topology of Hadoop cluster L2 CO5
architecture.

3 Explain in detail the Master and Slave components of the L2 CO5


Hadoop cluster.
4 What is Hadoop cluster? Write steps to configure of Hadoop L2 CO5
cluster.
5 Write in detail about user defined functions in pig. L1 CO6
6 Describe pig data types and operators: Group, Join, Filter, Order L2 CO4
by, Sort and Split.
7 Discuss how security is provided in Hadoop. L1 CO5
8 Explain the data processing operators in Pig. L1 CO6
Unit-5
1 Explain with suitable examples the built-in functions in Hive. L2 CO6
2 Compare Hive with traditional databases. L1 CO6
3 Describe the Hive architecture components. Why are HiveQL L2 CO6
used for big data?
4 What is HBASE? Give a detailed description of the feature of L2 CO6
HBASE.
5 Discuss in detail about how spark runs a job. L2 CO6
6 Explain the concept of Resilient Distributed Datasets in Spark. L1 CO6
7 What is HBase? Difference between HBase and Hive. L1 CO6
8 Construct or Building an Online Query Application using HBASE? L2 CO6

Signature of the Faculty Signature of the HOD

You might also like