Professional Documents
Culture Documents
KB (N)
Facebook (N)
Google (N)
3 Satellite gives
Minimum data (N)
10^15 (Y)
10^17 (N)
4 Petabyte=
10^18 (N)
10^19 (N)
10^15 (N)
10^17 (N)
5 Exabyte=
10^18 (Y)
10^19 (N)
10^15 (N)
10^17 (N)
6 Zettabyte=
10^18 (N)
10^21 (Y)
10^21 (N)
10^22 (N)
7 Yottabyte=
10^23 (N)
10^24 (Y)
10^21 (N)
10^22 (N)
8 Padma=
10^23 (N)
10^32 (Y)
9 2.5 petabytes =
downloaded documents of 1 day (N)
10 4.75 exabytes=
genome sequences of 9 billion people (N)
11 422 exabytes=
Total digital data created in 2010 (N)
structured (N)
unstructured (N)
14 Structured data
variable form (N)
structured (N)
Un structured (Y)
16 Unstructured data is
fixed format (N)
Filter (N)
Reliable (N)
19 Veracity means
manage data (N)
sensors (N)
Hardware (N)
CPU (N)
processor (N)
memory (Y)
29 Storage provides
demanding data (N)
pipes (Y)
30 Network provides
demanding data (N)
local (N)
close (N)
35 HDFS provides
Both easier and difficult access (N)
36 HDFS is highly
Both fault tolerant and designed using low-cost hardware (Y)
permissions (N)
authentication (N)
64MB (Y)
64 GB (N)
read-write operations (Y)
39 Datanodes perform
write operations (N)
computation (N)
analysis (N)
parallel (Y)
sequential (N)
MapReduce (N)
HDFS (N)
volume (N)
velocity (N)
Facebook (Y)
apple (N)
Structured (N)
Velocity (N)
variety (N)
Volume (N)
HDFS (N)
Hadoop (Y)
Cloud (N)
Validation (Y)
Verification (N)
MAPPER (N)
REDUCER (Y)
PARTITIONER (N)
Graphs (N)
A is a decision support tool that uses a tree-
like graph or model of decisions and their
53
possible consequences, including chance event
outcomes, resource costs, and utility. Trees (N)
Disks (N)
Squares (Y)
54 Decision Nodes are represented by
Circles (N)
Triangles (N)
Disks (N)
Squares (N)
Triangles (N)
Disks (N)
Squares (N)
Triangles (Y)
58 classification is
clustering process (N)
Dependant (N)
In the context of KDD and data mining, this refers to random errors in
a database table. (N)
62 Node is
One of the defining aspects of a data warehouse (N)
Assumes that all the features in a dataset are equally important (N)
Scalability (N)
66 Requirements of cluster analysis Minimal requirements for domain knowledge to determine input
parameters (N)
Google(Y)
Youtube(N)
TB(N)
YB(N)
EB(N)
Google(N)
NetFlix(N)
CERN(N)
Open-Source(N)
Scalability(N)
None of these(N)
possible(Y)
impossible(N)
None of these(N)
1(N)
2(Y)
None of these(N)
hybrid system(N)
itemset filtering(N)
none of these(N)
Mapper(Y)
Reducer(N)
task(N)
output(N)
none(N)
structured(N)
unstructured(Y)
None of above(N)
Cassandra(N)
Scylla(N)
PostgreSQL(Y)
Uses JSON(Y)
Needs a schema(N)
Network(N)
Distributed(N)
Object-oriented(N)
Field(Y)
Database(N)
Document(N)
High availability(Y)
Low availability(N)
None of above(N)
Scalability(N)
Relational data(Y)
Document databases.(N)
Key-value stores(N)
89
Key-value(Y)
Document(N)
ALWAYS True(N)
ALWAYS False(N)
Analytics(N)
Data mining(N)
Data Warehouse(N)
Weather forecasting(N)
Marketing(N)
Check below the best answer to “which industries
95 employ the use of so-called “Big Data” in their
day to day operations? Healthcare(N)
It is a distributed framework(N)
Data Node(N)
NameNode(Y)
Replication(N)
Hive(N)
Imphala(N)
Scala(N)
Data Node(N)
NameNode(Y)
Replication(N)
unstructured(Y)
structured(N)
Creation of a record(N)
Modification of a record(N)
sequence of data items that arrive in some order and may be seen only
once.(Y)
sequence of data items that arrive in some order and may be seen
twice.(N)
A Bloom filter always returns TRUE when testing for a previously added
element(Y)
master-slave fashion(Y)
slave-master fashion(N)
web traffic(N)
internet(N)
None of these(Y)
financial applications(N)
network monitoring(N)
web application(N)
document(N)
key-value(Y)
simple(N)
mapped, reduce(N)
mapping, Reduction(N)
Map, Reduce(Y)
continuous queries(N)
none of these(N)
MongoDB(Y)
Oracle (N)
Not SQL(N)
No usage of SQL(N)
Google (N)
NetFlix(N)
None of these(N)
Twitter(Y)
Facebook(N)
WhatsAPP(N)
column based(N)
119 MongoDB is
document based(Y)
graph based(N)
Local file(N)
HDFS(N)
high in size(N)
speed of data(N)
data in certain(Y)
Cassandra(Y)
Riak(N)
Redis(N)
Larry Page(N)
Bill Gates(N)
poor results(N)
poor data(N)
All of these(Y)
content(N)
collaborative(N)
All of these(Y)
associations(N)
All of these(Y)
cross-marketing(N)
All of these(Y)
coherent signals(N)
packets of data(N)
All of these(Y)
analyzes data(N)
correlates data(N)
All of these(Y)
unbounded in size(N)
All of these(Y)
likely structured(N)
Security applications(N)
All of these(Y)
All of these(Y)