You are on page 1of 9

1. What is the primary purpose of ZooKeeper according to the slides?

A) Manages storage and serves read/write requests.


B) Manages file metadata and regulates data access.
C) Exposes primitives for synchronization, configuration maintenance,
and groups/naming.
D) Stores metadata information.

2. Which of the following is NOT a module that HDFS consists of


according to the slides?
A) DataNodes
B) Hadoop YARN
C) Hadoop Common
D) NameNode

3. Which of the following is NOT a component of YARN according to the


slides?
A) Beeline
B) Client
C) Resource Manager
D) Node Manager
4. According to the slides, what is the primary function of Apache
Spark?
A) Fully managed wide-column and key-value NoSQL database service
in Google Cloud.
B) Distributed, open-source coordination service for distributed
applications.
C) Multi-language engine for executing data engineering, data science,
and machine learning on single-node machines or clusters.
D) Distributed event store and stream-processing platform.

5. Which stage in MapReduce shuffles the map outputs to reduce


nodes?
A) Reduce
B) Partition
C) Shuffle
D) Map
6. Which of the following is NOT a key module of cloud security
according to the slides?
A) Network Security
B) Identity and Access Management
C) Data Encryption
D) Load Balancer

7. What is the primary purpose of the NameNode according to the


HDFS architecture described in the slides?
A) Replicate data for fault tolerance.
B) Stores files in fixed blocks.
C) Serve read and write requests from clients.
D) Manages file metadata and regulates data access.

8. Which of the following is NOT a component of the YARN architecture


according to the slides?
A) Resource Manager
B) Job Tracker
C) Client
D) Node Manager
9. Which cloud storage provider uses enhanced reliability techniques
like RAID?
A) DataNodes
B) HDFS
C) Cloud storage providers
D) NameNode

10. Which of the following is NOT a characteristic of effective data


pipelines according to the slides?
A) High availability
B) Support for both batch processing and data stream processing
C) Scalable performance
D) Extensible based on standards

11. What does NFS stand for according to the slides?


A) Network File System
B) Network Foundational Services
C) Node File System
D) No File Structure
12. Which component of YARN is responsible for negotiating resources
with the Resource Manager?
A) Node Manager
B) Resource Manager
C) Client
D) Application Master

13. Which Hadoop module is responsible for job scheduling and cluster
resource management according to the slides?
A) Hadoop Common
B) Hadoop MapReduce
C) HDFS
D) Hadoop YARN

14. Which of the following is NOT an advantage of using Apache Pig


according to the slides?
A) Ease of Programming
B) Extensibility
C) Optimization Opportunities
D) Complex Programming
15. Which component of HDFS stores metadata information according
to the slides?
A) NameNode
B) Client
C) ResourceManager
D) DataNodes

16. According to the slides, what does GFS stand for?


A) General File Storage
B) Google File System
C) General File System
D) Global File Structure

17. Which technique allows dynamic allocation of resources and


scheduling for efficient large-volume data processing according to
YARN?
A) Compatibility
B) Scalability
C) Efficiency
D) Resource Management
18. Which cloud file system is automatically grown and shrunk with file
additions and removals according to the slides?
A) Amazon S3
B) Amazon EFS
C) Google File System
D) NFS

19. What does S3 stand for according to the slides?


A) Simple Storage Service
B) Structured Storage Service
C) Special Storage Solution
D) Secure Storage System

20. Which component of YARN is responsible for monitoring progress of


a single application?
A) Node Manager
B) Resource Manager
C) Client
D) Application Master
21. According to the slides, what is the primary storage location for
Hive?
A) /hdfs/hive
B) /hive/data
C) /user/hive/warehouse
D) /hive/warehouse

22. Which technique allows handling failures at the application layer for
high availability according to Hadoop?
A) Runs on Hadoop infrastructure using commodity hardware
B) Detects and handles failures at the application layer
C) Provides a client for connectivity
D) Supports SQL syntax
Answer Sheet
1. C
2. B
3. A
4. C
5. C
6. D
7. D
8. B
9. C
10. A
11. A
12. D
13. D
14. D
15. A
16. B
17. C
18. B
19. A
20. D
21. C
22. B

You might also like