You are on page 1of 3

UNIT 1

SNO QUESTION OPTION1 OPTION2 OPTION3 OPTION4 ANSWER


ALL OF THE
What are the main components of big data? HDFS MAPREDUCE YARN ALL OF THE ABOVE
1 ABOVE
2 Data in ____ bytes size is called big data Meta Giga Tera Peta Peta
3 Transaction of data of the bank is a type of _____ Unstructured data Structured data Both a and b None of the above Structured data
4 The total forms of big data is ____ 1 2 3 4 3
5 Identify the incorrect big data Technologies. Apache Pytorch Apache Kafka Apache Hadoop Apache Spark Apache Pytorch
____ is a collection of data that is used in volume, yet
Big Database Big DBMS Big Datafile Big Data Big Data
6 growing exponentially with time
Identify among the options below which is general-
purpose computing model and runtime system for HDFS MapReduce Oozie All of the above MapReduce
7 Distributed Data Analytics.
Choose the primary characteristics of big data among the
Value Variety Volume All of the above All of the above
8 following
The relationship The relationship
The relationship
The relationship between between one The relationship between two
What does a simple linear regression analysis examine? between only two
only two variables dependent and one between many variables dependent and one
variables
independent variable independent variable
9
The relationship The relationship
The relationship
The relationship between between one or The relationship between one
between one
more than one more than one between more than dependent and
What does a multiple linear regression analysis examine? dependent and more
dependent and only one dependent and only one independent more than one
than one independent
independent variable one independent variables independent
variables
variable variables
10
A specific value of The strength of
A specific value of the y- The strength of the
the x-variable given a the relationship
The correlation coefficient is used to determine: variable given a specific relationship between All of the above
specific value of the y- between the x
value of the x-variable the x and y variables
variable and y variables
11
Which of the following function is used by logistic
regression to convert the probability in the range Sigmoid Mode Square All of the above Sigmoid
12 between [0,1].
To test linear relationship of y(dependent) and
x(independent) continuous variables, which of the Scatter plot Barchart Histograms All of the above Scatter plot
13 following plot best suited?
The p-value for the
The t-statistic for the
Which of the following indicates a fairly strong Correlation coefficient = null hypothesis Beta Correlation
null hypothesis Beta None of the above
relationship between X and Y? 0.9 coefficient =0 is coefficient = 0.9
coefficient=0 is 30
0.0001
14
In a simple linear regression model (One independent
variable), If we change the input variable by 1 unit. How By 1 No change By its Slope None of the above By its Slope
much output variable will change?
15
“Velocity” in Big Data mean Speed of input data Speed of individual Speed of ONLY storing Speed of storing and Speed of storing
generation machine data processing data and processing
processors data
16
The term Big Data first originated from: Stock Markets Domain Banking and Finance Genomics and Social Media Domain Genomics and
Domain Astronomy Domain Astronomy
17 Domain
are example(s) of Real Time Big Data Complex Event Stock market data transactions detection Complex Event
Processing Processing (CEP) analysis Processing (CEP)
platforms & Bank fraud platforms & Bank
transactions detection fraud transactions
detection

18 Stock transactions
Big Data Real
Sliding window operations typically fall in the category OLTP Big Data Batch Big Data Real Time Small Batch
Time Processing
of . Transactions Processing Processing Processing
19
20 Big data analysis does the following except? Collects data Spreads data Organizes data Analyzes data Spreads data
The examination of large amounts of data to see what
patterns or other useful information can be found is Data examination Information analysis Big data analytics Data analysis Big data analytics
21 known as
Which of the following characteristic of big data is None of the
22 relatively more concerned to data science? Velocity Variety Volume mentioned Variety
Which of the following step is performed by data
23 scientist after acquiring the data? Data Cleansing Data Integration Data Replication All of the mentioned Data Cleansing
In regression analysis, the variable that is being predicted the dependent the dependent
24 is; the independent variable variable usually denoted by x usually denoted by r variable
If the slope of the regression equation y = bo + b1x is as x increases y as x increases so as x decreases y Either a or b is
25 positive, then; decreases does y Either a or b is correct increases correct
independent coefficient of
26 In the regression equation y = bo + b1x, bo is the; slope of the line variable y intercept determination y intercept

What is the process of storing and managing data in a Data


27 way that allows for efficient retrieval and analysis? Data Warehousing Data Mining Data Integration Data Processing Warehousing
Which of the following is a popular NoSQL database used
28 for Big Data processing? MySQL PostgreSQL Oracle MongoDB MongoDB
What is the process of combining data from multiple
29 sources into a single, unified view? Data Mining Data Warehousing Data Integration Data Processing Data Integration
What is the term used for the ability of a system to
handle increasing amounts of data and traffic without
30 compromising performance? Scalability Reliability Availability Security Scalability
What is the process of cleaning and transforming data
31 before it is used for analysis? Data Mining Data Warehousing Data Integration Data Processing Data Processing

You might also like