Professional Documents
Culture Documents
S-MATH001 Data Analytics for Engineering CEE14 T06 2nd Sem SY 2021-2022
SUMMATIVE ASSESSMENT (FINAL EXAM) DATA ANALYSIS FOR
ENGINEERING
End of quiz
You are at the end; press Finished to complete and grade the quiz.
You can review your answers below and click Edit if you want to change any.
Finished
Question 1
Which of the following values of correlation coefficient r show moderate correlation?
Response: -0.54
Question 2
Correlation analysis is a measure of causal relationship between two variables.
Response: FALSE
Question 3
If the coefficient of determination is 0.81, the correlation coefficient is
Question 4
Data collected to study the relationship between child obesity and parental
Child
Obese Non-Obese
Parent
Non-obese 16 21
What is the decision if the computer output revealed:
χ² Tests
Value df p
χ² 1.073 1 0.3004
N 100
Question 5
Regression modeling is a statistical framework for developing a mathematical equation that
describes how
Response: one response and one or more explanatory variables are related
Question 6
The relation between y = ideal weight (lbs) and x =actual weight, based on data from n = 119 women, resulted in the
regression line .
Which choice is not an appropriate term for the x variable in a regression equation?
Response: dependent variable
Question 7
When the F test is used to test for the equality of a set of population means, if the null
hypothesis is rejected then all of the population means are declared to differ from one
another.
Response: False
Question 8
A study was conducted to test if there is a difference between the blood sugar
level of factory workers and bank managers. The result are as follows;
Response:
Since the P-value is less than 0.05, reject the null hypothesis and conclude that
there is a significant difference between the blood sugar level of factory workers and
bank managers.
Question 9
Which of the following is true about scatter plot?
Question 10
The relation between y = ideal weight (lbs) and x =actual weight, based on data from n = 119 women, resulted in the
regression line .
The interpretation of the value 0.70 in the regression equation for this question is
Response: The estimated increase in average ideal weight for an increase of one pound in actual weight.
Question 11
What should be the value/s of r to have a perfect correlation?
Response: 1 and -1
Question 12
The relation between y = ideal weight (lbs) and x =actual weight, based on data from n = 119 women, resulted in the
regression line .
The estimated ideal weight for a women who weighs 118 pounds is
Response: 127.6
Question 13
Eight tomato plants, of the same variety were selected at random and treated, weekly, with a solution in
which x grams of fertilizer was dissolved in a fixed quantity of water. The yield y kilograms of tomatoes
were recorded.
If 8.57 kilograms of tomatoes were yielded, how much fertilizer was used?
Response: 12.51
Question 14
Data collected to study the relationship between child obesity and parental
Child
Obese Non-Obese
χ² Tests
Value df p
χ² 1.073 1 0.3004
χ² Tests
Value df p
N 100
Response: There is no significant relationship between the obesity of the children and their parents.
Question 15
The Bieber Manufacturing Co. operates 24 hours a day, five days a week. The workers
rotate shifts each week. Todd Bieber, the owner, is interested in whether there is a
difference in the number of units produced when the employees work on various shifts. A
sample of five workers is selected and their output recorded on each shift.
Is there a significant difference in the number of produce when employees work on various
shifts?
Output:
Descriptives
Std.
N Mean
Deviation
ANOVA
Sum of
df Mean Square F Sig.
Squares
Between
62.533 2 31.267 4.860 .028
Groups
Within
77.200 12 6.433
Groups
Total 139.733 14
What is the computed value of test statistic and the appropriate decision rule?
Question 16
Five sets of identical twins were selected at random from a population of
identical twins. One child was selected at random from each pair to form an "experimental
group." These five children were sent to school. The other five children were kept at home
as a control group. At the end of the school year the following IQ scores were obtained.
Does this evidence justify the conclusion that lack of school experience has a depressing
effect on IQ scores? Analyze the data with the Wilcoxon Signed-Rank Test
Experimental Control
1 110 112
2 125 120
3 139 128
4 142 135
5 127 126
If the W statistic is 2.0 with a p-value of 0.138 then what is the decision?
Response: the experimental (school) group tends to have lower IQs than the control (home)group
Question 17
The effectiveness of advertising for two rival products (Brand X and
was carried out, with the participants being shown adverts for two rival
brands of coffee, which they then rated on the overall likelihood of them
buying the product (out of 10, with 10 being "definitely going to buy the
product"). Half of the participants gave ratings for one of the products,
Question 18
Data collected to study the relationship between child obesity and parental
Child
Obese Non-Obese
Parent
Response: the proportion of obese children is the same for obese and nonobese parents
Question 19
If the coefficient of determination is a positive value very near to one , then the regression equation
Question 20
Suppose we developed the following simple linear regression equation: y = 3.5 + 2.1x, which of the following
statement is correct?
Question 21
In regression analysis, the variable that is being predicted is the
Question 22
/files/978293/AutoMPG.xlsx
Used AutoMPG.xlxs
Set Cylinder, displacement, horsepower, weight, and age in years as independent variables.
INTERPRET
Response:
1. The R2 value of 0.809 tells us that 80.9% of the variation in MPG (which is the dependent variable)
is best explained by the set of independent variables such as Cylinder, displacement, horsepower, weight, and age
in years.
2. Result of the ANOVA indicates that the regression model predicts the dependent variable significantly well.
The resulting significance / p -value of the model is less than 0.05, and indicates that, overall,
the regression model statistically significantly predicts the outcome variable is a good fit for data
Question 23
In regression, the equation that describes how the response variable (y) is related to the
Question 24
Eight tomato plants, of the same variety were selected at random and treated, weekly, with a solution in
which x grams of fertilizer was dissolved in a fixed quantity of water. The yield y kilograms of tomatoes
were recorded.
Response: 0.944
Question 25
Data collected to study the relationship between child obesity and parental
Child
Obese Non-Obese
Parent
Response: 50
Question 26
The relation between y = ideal weight (lbs) and x =actual weight, based on data from n = 119 women, resulted in the
regression line .
Response: 45
Question 27
The manager of ABC apparels wanted to know the relationship between TV spot ads per week and the
number of stores that made orders from the company. Compiled data are as follows:
Tv ads/week (X) 14 19 14 20 22 24 27 42 40
Question 28
The relation between y = ideal weight (lbs) and x =actual weight, based on data from n = 119 women, resulted in the
regression line .
Response: 0.70
Question 29
The Bieber Manufacturing Co. operates 24 hours a day, five days a week. The workers
rotate shifts each week. Todd Bieber, the owner, is interested in whether there is a
difference in the number of units produced when the employees work on various shifts. A
sample of five workers is selected and their output recorded on each shift.
Is there a significant difference in the number of produce when employees work on various
shifts?
Output:
Descriptives
Std.
N Mean
Deviation
ANOVA
Sum of
df Mean Square F Sig.
Squares
Between
62.533 2 31.267 4.860 .028
Groups
Within
77.200 12 6.433
Groups
Total 139.733 14
Question 30
If the correlation coefficient is 0.8, the percentage of variation in the response variable explained
Response: 64%
Question 31
The effectiveness of advertising for two rival products (Brand X and
was carried out, with the participants being shown adverts for two rival
brands of coffee, which they then rated on the overall likelihood of them
buying the product (out of 10, with 10 being "definitely going to buy the
product"). Half of the participants gave ratings for one of the products,
If another brand is added, BRAND Z, and another group of respondents was added to rate BRAND Z, then what
statistical tool should be used?.
Which choice is most likely to be approximately the value of r2, the proportion of variation in y explained by x?
Response: 99.5%
Question 33
In a one-way ANOVA, if the computed F value exceeds the critical F value, what decision can be made about the
null hypothesis?
Response: Reject H0 since there is evidence that is a significant difference among means
Question 34
Which
1. of the following scatter diagrams indicate strong positive correlation?
Response:
Question 35
Which of the following values of correlation coefficient r show strong correlation?
Response: -0.91
Question 36
Which of the following kinds of data can be analyzed with Nonparametric procedures?
Response: rank