You are on page 1of 1

Big Data Analytics

Assignment_1

1. Explain the HDFS design features and important aspects of Hadoop Distributed File
System.
2. What do you understand by HDFS? Explain its components with a neat diagram.
3. Explain Various System roles in an HDFS deployment.
4. Bring out concepts of HDFS block replication, with an example.
5. Briefly explain HDFS NameNode Federation, NFS Gateway, Snapshots, Checkpoint
and backups.
6. Explain with a neat diagram NameNode High Availability.
7. Explain the commands needed to perform basic operations within HDFS with an
example.
8. How does the Hadoop Map Reduce Data flow work for a word count program? Give
an example.
9. Explain the Terasort Test.
10. Explain the process of Running the Test DFSIO Benchmark.
11. Explain Apache Hadoop Parallel MapReduce data flow.
12. Explain the compiling & running of wordcount example.
13. Explain Apache Squoop Import and Export methods with neat diagrams.
14. Explain with a neat diagram the Apache Oozie work flow for Hadoop architecture.
15. Explain the following operations in Hbase.
i) Create Database. ii) Inspect the Database. iii) Get Table Cells. iv) Scripting input

You might also like