Professional Documents
Culture Documents
Final BD #1
Final BD #1
CONTROLLER OF EXAMINATIONS
NOTE : This admit card is valid subject to fulfillment of Rules & Regulations as per BRAINWARE UNIVERSITY Act regarding
Examinations.
YOU ARE ELIGIBLE TO APPEAR FOR THE FOLLOWING SUBJECTS AS MENTIONED IN EXAMINATION FORM
AND VERIFIED BY UNIVERSITY
Instructions:
1. Hdfs is the primary storage system used by Hadoop, a distributed processing framework for r Big Data. It breaks large Datasets into
smaller blocks and distributes them across multiple nodes in a cluster for parallel processing. HDFS provides
2. fault tolerance, scalability, and high-throughput access to data.\YARN is the resource management layer in Hadoop that
3. Separates the job scheduling and resource management functions. It allows multiple applications to share
Resources in a Hadoop cluster. YARN enables more flexible and efficient resource utilization compared to the earlier version of
4. Hadoop, where MapReduce handled both processing and resource management.
Mapreduce is a programming model and processing engine used for distributed computing on large datasets. It divides tasks Into two
phases - Map and Reduce. The Map phase processes and filters the input data, while the Reduce phase aggregates
5. And summarizes the results. MapReduce is a key component of the Hadoop ecosystem.
The five V’s of big data are: volume: refers to the vast amount of data generated and processed.
Velocity: Describes the speed at which data is generated, processed, and analyzed in real-time. Variety: Encompasses the
diverse types of data, including structured, unstructured, and semistructured. (Reported Against) case: -
a. Veracity:Focuses on the accuracy and reliability of data..
b. Value:Stresses the importance of extracting meaningful insights and value from Big Data.
Download Date from student self-service : - Dec 27, 2023 at 07:24 PM