You are on page 1of 5

ASSIGNMENT - 1

Module – 1
Q. RBT
Description of Questions Marks
No. Level
BATCH 1
What is big data? Explain classification of data.
1 5 L2
What are characteristics of big data? Explain big data types.
2 5 L2
Explain how big data used in chocolate marketing company.
3 5 L2
Explain how to implement vertical scalability in big data analytics.
4 5 L2
Draw and explain layers and functions in data processing
5 architectures. 5 L2

BATCH 2
Explain how big data used in weather data recording, monitoring and
1 prediction organization. 5 L2

List and explain parameters of good quality data.


2 5 L2
Explain different activities in data pre-processing.
3 5 L2
Draw and explain data store export to cloud.
4 5 L2
Draw and explain Google cloud platform for bigquery cloud service.
5 5 L2
BATCH 3
Explain how data store is used with structured and semi structured
1 data. 5 L2

List and explain different big data storages.


2 5 L2
Write different phases in analytics.
3 5 L2
Explain how big data used in automotive components and predictive
4 maintenance services. 5 L2

Explain how to implement horizontal scalability in big data analytics.


5 5 L2
ASSIGNMENT - 2
Module – 2
Q. RBT
Description of Questions Marks
No. Level
BATCH 1
Draw and explain Hadoop architecture.
1 5 L2
How does Hadoop works.
2 5 L2
Explain different components of Hadoop ecosystem.
3 5 L2
Explain components of Hadoop distributed file system.
4 5 L2
List and explain Hadoop user commands.
5 5 L2
BATCH 2
Draw and explain Hadoop MapReduce architecture.
1 5 L2
Explain following terminologies related to Hadoop MapReduce
architecture.
2 5 L2
1) Payload 2) Mapper 3) Name Node 4) Data Node 5) master
Node

Explain how YARN manages resources in Hadoop architecture.


3 5 L2
Explain the main components of YARN architecture.
4 5 L2
What is HBase? How it store semi structured data?
5 5 L2
BATCH 3
Explain two use cases of HBase.
1 5 L2
Draw Hive architecture and explain how it process query.
2 5 L2
Explain components of Apche pig.
3 5 L2
Explain big data import and export using Sqoop.
4 5 L2
Explain how Apache Flume is used to process web log data.
5 5 L2
ASSIGNMENT - 3
Module – 3
Q. RBT
Description of Questions Marks
No. Level
BATCH 1
Explain features of distributed computing architectures
1 5 L2
Explain CAP theorem.
2 5 L2
Explain characteristics of schema less model.
3 5 L2
Draw and explain flexible NoSQL DB of students.
4 5 L2
Explain key-value store. Also explain advantages of key-value
5 store. 5 L2

BATCH 2
Write features of document store.
1 5 L2
Explain CSV and JSON file format.
2 5 L2
With example explain XML document architecture pattern.
3 5 L2
Explain columnar data store with example.
4 5 L2
Write and explain characteristics of columnar family data store.
5 5 L2
BATCH 3
Explain features of BigDataTable.
1 5 L2
Draw and explain ORC file format.
2 5 L2
Explain characteristics of big data NoSQL solutions.
3 5 L2
Explain any two sharding models.
4 5 L2
Explain features of MongoDB.
5 5 L2
ASSIGNMENT - 4
Module – 4
Q. RBT
Description of Questions Marks
No. Level
BATCH 1
Draw and explain the MapReduce process on client submitting a job.
1 5 L2
With sample code, explain map task.
2 5 L2
Explain following terms, Inputsplit, RecordReader, combiner.
3 5 L2
Draw ad explain MapReduce execution steps.
4 5 L2
Explain MapReduce for ACPAMS data analysis.
5 5 L2
BATCH 2
Explain how node failures are handled.
1 5 L2
Explain various operations of MapReduce.
2 5 L2
Explain composing of MapReduce for different types of calculations.
3 5 L2
Explain cascade steps for multiplication of two matrices.
4 5 L2
Explain main features of Hive.
5 5 L2
BATCH 3
Draw and explain architecture of Hive.
1 5 L2
Draw and explain Hive data flow sequences and workflow steps.
2 5 L2
Explain Hive data definition language.
3 5 L2
Write features of pig.
4 5 L2
Draw an explain pig architecture.
5 5 L2
ASSIGNMENT - 5
Module – 5
Q. RBT
Description of Questions Marks
No. Level
BATCH 1
Explain linear and non linear relationship between data points.
1 5 L2
Explain terms outliers and variance.
2 5 L2
Explain how to calculate standard deviation and standard error.
3 5 L2
Explain simple linear regression.
4 5 L2

5 List and explain examples of modeling using regression. 5 L2


BATCH 2
Explain applications of text mining.
1 5 L2
Draw and explain the text mining process.
2 5 L2
Explain feature generation phase in text mining.
3 5 L2
Explain Naive Bayes analysis.
4 5 L2
Explain how support vector machine can be used for classification.
5 5 L2
BATCH 3
Draw and explain taxonomy of web mining.
1 5 L2
Explain different tasks in web content analysis.
2 5 L2
Draw and explain web usage mining.
3 5 L2
Explain page rank algorithm using in degrees.
4 5 L2
Explain centralities, ranking and anomaly detection in social network
5 graph. 5 L2

You might also like