Pollachi. (An Autonomous Institution affiliated to Anna University) Question Paper Code : OS16533 Regulation : 2014 MCA. DEGREE EXAMINATION, NOVEMBER / DECEMBER 2016 Third Semester – Master of Computer Application 140CA0503 – Big Data Analytics Duration: Three hours Answer ALL questions Maximum: 100 marks
PART – A (10 x 2 = 20 marks)
1. List the types of Big Data.
2. Define Big data stack. 3. Differentiate between “scale up” and “scale out”. 4. Write the significance of Sqoop. 5. Define HDFS. 6. Describe serialization. 7. Mention the core Components of Hadoop. 8. What does „Mapper‟? 9. What is Pig? 10. Compare „Pig‟ and „Hive‟
PART – B (1 x 16 = 16 marks)
11. (i) Discuss the characteristics of Big data (4)
(ii) Explain Big Data Management architecture. (8)
(iii) Write four Big Data analytics Applications in detail. (4)
PART – C (4 x 16 = 64 marks)
12.(a) Write short notes on
(i) Hadoop Ecosystem (12) (ii) Hadoop Releases (4) Or 12.(b) Write short notes on the following (16) i) Hadoop streaming ii) Hadoop Pipes
13.(a) What is Distributed file system? Explain the (16)
architecture of Hadoop Distributed File System. Or 13.(b) Explain the following in Hadoop: (16) i) Data integrity ii) Compression iii) Archives iv) File-based Data structures
14.(a) Analyze the weather dataset to determine (16)
maximum temperature using Map Reduce program. Or 14.(b) (i) Discuss the types of input and output formats in (8) Hadoop. (ii) Write a note on Map Reduce library classes. (8)
15.(a) (i) Explain different relational operations in “Pig ( 8)
Latin” with an example. (ii) Write a short note on Hive. (8) Or 15.(b) (i) Detail in Hbase cluster. (8) (ii) Differentiate between Hbase with RDBMS (8)