You are on page 1of 1

KLS Gogte Institute of Technology, Belgaum

Department of Computer Science and Engineering


Academic Year: 2014-15

Assignment

Subject: Managing the Big Data


M.Tech 2nd SEM

Code: 14SCS21 Semester:

Answer the following questions


1. Define Aggregate and explain aggregate data models.
2. Differentiate between key value and document data models.
3. Explain the relationships and graph database with an example.
4. Discuss Schemaless databases and materialized views.
5. What do you mean by sharding? Explain in detail.
6. Differentiate master slave replication and peer to peer replication.
7. Define consistency and explain relaxing consistency.
8. Explain need of version stamps for business and system transaction.
9. Explain a two stage Map Reduce with an example.
10. Write the unit tests with MR unit for the mapper and the reducer
with an example of Max temperature.
11. Discuss how job is run on a local job runner with an example.
12. Explain how Hadoop runs a Map Reduce job using a classic
framework.
13. Discuss failures in classic Map Reduce in YARN.
14. Explain the following:
i) Job Scheduling

ii) Shuffle and Sort.

15. Describe the following in detail:


i) Task Environment Properties ii) Speculative Execution.
16. Explain in brief all Input and Output formats that Hadoop provides
to process data.

You might also like