1. Define Aggregate and explain aggregate data models. 2. Differentiate between key value and document data models. 3. Explain the relationships and graph database with an example. 4. Discuss Schemaless databases and materialized views. 5. What do you mean by sharding? Explain in detail. 6. Differentiate master slave replication and peer to peer replication. 7. Define consistency and explain relaxing consistency. 8. Explain need of version stamps for business and system transaction. 9. Explain a two stage Map Reduce with an example. 10. Write the unit tests with MR unit for the mapper and the reducer with an example of Max temperature. 11. Discuss how job is run on a local job runner with an example. 12. Explain how Hadoop runs a Map Reduce job using a classic framework. 13. Discuss failures in classic Map Reduce in YARN. 14. Explain the following: i) Job Scheduling
ii) Shuffle and Sort.
15. Describe the following in detail:
i) Task Environment Properties ii) Speculative Execution. 16. Explain in brief all Input and Output formats that Hadoop provides to process data.