Professional Documents
Culture Documents
1. MapReduce
2. HDFS
3. YARN
4. All of these
2. What are the five V’s of Big Data?
1. Volume
2. velocity
3. Variety
4. All of the above
3. All of the following accurately describe Hadoop, EXCEPT:
1. Open source
2. Real-time
3. Java-based
4. Distributed computing approach
4. Hadoop is good for:
a) Processing transactions (random access)
b) Massive amounts of data through parallelism
c) Processing lots of small files
d) Intensive calculations with little data
e) Low latency data access.
5. ………….. is data whose scale, distribution, diversity, and/or timeliness require the use of new
technical architectures and analytics to enable insights that unlock new sources of business value.
a)Big data (b) Mapreduce (c) Data mining (d) Hadoop
6. …………………. Charactistics at which Big data is collected and created in various formats and
sources.
1. Volume
2. velocity
3. Variety
4. All of the above
7. …………………. Is the speed or frequency at which data is collected in various forms and from
different sources for processing
1. Volume
2. velocity
3. Variety
4. All of the above
8. ………………. refers to the humungous amounts of data generated each second from social
media, cell phones, cars, credit cards, M2M sensors, photographs, video, etc.
1. Volume
2. velocity
3. Variety
4. Veracity
5. All of the above
12…………… Data that can be stored and processed in a fixed format, aka schema
1. Structured
2. Semi-structured
3. Unstructured.
4. other
13. ……………… Data that does not have a formal structure of a data model, but nevertheless it
has some organizational properties like tags and other markers t
1. Structured
2. Semi-structured
3. Unstructured.
4. Other.
14………………….. The data which have unknown form and cannot be stored in RDBMS and
cannot be analyzed unless it is transformed into a structured format.
1. Structured
2. Semi-structured
3. Unstructured.
4. Other.
15…………………. Apache open source software framework for reliable, scalable, distributed
computing of massive amount of data
(a) Big data (b) Mapreduce (c) Data mining (d) Hadoop
35. True or False? Ambari is backed by RESTful APIs for developers to easily integrate
with their own applications. True
36. Which Hadoop functionalities does Ambari provide?
Provision, manage, monitor and integrate
37. Which page from the Ambari UI allows you to check the versions of the software
installed on your cluster?
The Admin > Manage Ambari page
38. True or False? Creating users through the Ambari UI will also create the user on the
HDFS. False.
39. True or False? You can use the CURL commands to issue commands to Ambari.
True.
4. True or False? One of the driving principal of Hadoop is that the data
is brought to the program.
=>False. The program is brought to the data, to eliminate the need to move large
amounts of data.