You are on page 1of 4

FILLED BY THE STUDENT:

Tarandeep singh
_______________________________________________________________________________________
_

Student’s Name

5354120
_______________________________________________________________________________________
_

Student’s ID Number

2022-01-03
_______________________________________________________________________________________

Mid-Term Exam (30%) _

Date

PROFESSOR: Oussama Derbel


SECTION: 11112

EXAM RULES: FILLED BY THE PROFESSOR:


 All students must have an ID to confirm their identity.
 No student will be allowed to enter the evaluation room Evaluated Competencies:
20 minutes after the evaluation has started.
 Students may not leave the evaluation room during the
Use Hadoop components
exam period for any reason.
 Any student who arrives late will not be given any extra Time Allowed: 1h30 Hours
time to complete his or her evaluation.
 Students may be assigned a specific desk/location by the Materials Allowed: Yes
teacher.
 Students may not bring any food or drink other than
water into the evaluation room. Total Mark: 100
 All communication devices including but not limited to
cell phones, smart phones, smart watches, iPods, pagers Mark Obtained:
and Web-accessible electronic devices must be turned off
and left at a place designated by the teacher. Failure to
do so may lead to the removal of the evaluation.
 Cheating attempts or any assistance offered to others will
merit a mark of zero on the evaluation. This includes but
not limited to speaking or looking around the evaluation
room. In this case, the teacher will seize the evaluation
documents and submit a written report to the Program
Coordinator.
Big Data
420-BZ2-GX
STUDENT’S NAME:_____________ _____________________________________________________________________________________________________________

This Exam paper should be uploaded on Omnivox via Lea (No Mio)

Exercise 1 (40%):

a- What is Data ?
Ans- On a computer, data is information that is translated into a form that
works well for movement or processing. With regard to modern computers
and transmission media, data is information that is converted into a digital
binary form. It is acceptable for data to be used as a singular or plural topic.
Raw data is a term used to describe data in its basic digital format.
b- What is Big Data?
Ans- Big data refers to large, diverse sets of information growing at ever-
increasing prices. It covers the amount of information, speed or speed at
which it is built and collected, as well as the variety or scope of data points
to be combined.
c- What is information?
Ans- Big data involves managing data sets that are so large and
sophisticated that software processing software is not enough to capture,
filter, manage, and process data over a reasonable amount of time. Big
data can be used to predict and analyze user behavior.
d- What is Hadoop?
Ans- Apache Hadoop is an open source framework used to store and
process large data sets ranging from gigabyte to petabytes of data. Instead
of using a single large computer to store and process data, Hadoop allows
multiple computers to analyze large data sets for faster compliance.
e- List the 5 components of the Hadoop ecosystems and briefly describe the
functionality of each component:
Ans- Following are the components that collectively form a Hadoop
ecosystem:
 HDFS: Hadoop Distributed File System.
 YARN: Yet Another Resource Negotiator.
 MapReduce: Programming based Data Processing.
 Spark: In-Memory data processing.
 PIG, HIVE: Query based processing of data services.

2|Page
Big Data
420-BZ2-GX
STUDENT’S NAME:_____________ _____________________________________________________________________________________________________________

Exercise 3 (70%):

Describes the three operations (Map, Combine, Reduce) to count occurrences of


each word across these four files:

File 1: I’m Indian student.

File 2: I live in Montreal.

File 3: I’m learning Big data subject.

File 4: I love India.

File 5: I love Canada.

3|Page
Big Data
420-BZ2-GX
STUDENT’S NAME:_____________ _____________________________________________________________________________________________________________

4|Page

You might also like