You are on page 1of 11

Problem 5:

Problem Statement:
Zingaro stone printing is a company that specializes in printing images or patterns on polished or
unpolished stones. However, for the optimum level of printing of the image the stone surface has to
have a Brinell's hardness index of at least 150. Recently, Zingaro has received a batch of polished and
unpolished stones from its clients. Use the data provided to answer the following (assuming a 5%
significance level);
Dataset - Link

Assumption & Solution:


5.1 Earlier experience of Zingaro with this particular client is favourable as the stone
surface was found to be of adequate hardness. However, Zingaro has reason to
believe now that the unpolished stones may not be suitable for printing. Do you think
Zingaro is justified in thinking so?
Step 1: Define the null and alternative hypothesis

 Ho: Stone was found of adequate hardness >= 150


 Ha: Unpolished stone is not suitable for printing <150

Step 2: Decide the significance level

Here we select α = 0.05 as given in the problem statement

Step 3: Identify the test statistics

 Population standard deviation is not known, for one sample we can use One sample T test
 The sample value is 75, n=75 and degree of freedom is 74(n-1)

Step 4: Calculate the test statistics and p-value

Tstat = -4.165 , p-Value for 1 tailed = 0.0

Step 5: Decide to Reject or Accept the null hypothesis

As p-value is less than Alpha 0.05. We will have to reject the null hypothesis and conclude that the
Unpolished stone is not suitable for printing and has hardness less than 150

Also, The confidence interval for the population mean of the unpolished stones can be calculated as

= 134.11 +- 1.96*(33.04/sqrt(75)).
This confidence interval does not contain 150, which is the minimum value of Brinell's hardness
index required for optimum printing.

Yes, Zingaro is justified in thinking that the unpolished stones are not adequate for optimum printing.

5.2 Is the mean hardness of the polished and unpolished stones the same?
Step 1: Define the null and alternative hypothesis
 Ho: Unpolish Mean = Polished Stone mean
 Ha: Unpolished stone ≠ Polished Stone mean
Step 2: Decide the significance level
Here we select α = 0.05 as given in the problem statement
Step 3: Identify the test statistics
 Population standard deviation is not known and we have 2 independent sample , perform 2
sample Ttest
 Sample size n1=75 and n2 =75 and degree of freedom is 148 (n1+n2-2)
 A two-sample t-test is a statistical test used to compare the means of two groups. In this
case, we are interested in whether the mean hardness of polished and unpolished stones is
the same.
Step 4: Calculate the test statistics and p-value
Tstat = 3.242 , p-Value for two tailed = 0.001
Step 5: Decide to Reject or Accept the null hypothesis
As p-value is less than Alpha 0.05. So, we reject the null hypothesis, that says Mean of Unpolished
and Polished is not equal
The p-value is the probability to identify the difference between two group means, if the null
hypothesis is true.
In this case, the p-value is 0.0015. This is a very small probability, so we can conclude that the null
hypothesis is false. There is not much of a difference between the means in polished and Unpolished
stones.
Hence, we can conclude that,
The mean hardness of the polished and unpolished stones is not the same.
Problem 6:
Problem Statement:
Aquarius health club, one of the largest and most popular cross-fit gyms in the country has been
advertising a rigorous program for body conditioning. The program is considered successful if the
candidate is able to do more than 5 push-ups, as compared to when he/she enrolled in the program.
Using the sample data provided can you conclude whether the program is successful? (Consider the
level of Significance as 5%)
Note that this is a problem of the paired-t-test. Since the claim is that the training will make a
difference of more than 5, the null and alternative hypotheses must be formed accordingly.
Dataset - Link

Assumption & Solution:


Step 1: Define the null and alternative hypothesis

 Ho: Mean push-ups before Training = Mean Push-ups after Training


 Ha: Mean push-ups after training is up by 5 than before

At 95% significance. Let,

- µb represents the mean push-ups of the candidate before training


- µa represents mean push-ups of the candidate after training program

Step 2: Decide the significance level

Here we select α = 0.05 as given I the problem statement

Step 3: Identify the test statistics

 Sample sizes for both samples are same


 We have two paired samples and we do not know the population standard deviation
 The sample value is 100, n=100 and degree of freedom is 99(n-1)

Step 4: Calculate the test statistics and p-value

From the data gathered,


we have the test statistics can be calculated by

We use the scipy.stats.ttest_ind to calculate the t-test for the means of TWO INDEPENDENT samples
of scores given the two sample observations. This function returns t statistic and two-tailed p value.

This is a two-sided test for the null hypothesis that 2 independent samples have identical average
(expected) values. This test assumes that the populations have identical variances.

T-statistic = -19.32261

P-value for a single tail = 1.14602E-35

Step 5: Decide to Reject or Accept the null hypothesis

Since the p-value < 0.05, We can reject the null hypothesis (H0) which states The number of push-ups
before and after enrolling is same.

The program is successful if the candidate is able to do more than 5 push-ups, as compared to when
he/she enrolled in the program. From the analysis, we can see that the average number of push-ups
that the candidates could do when they enrolled in the program is around ±3.5.
Hence, we can conclude saying that the program conducted by Aquarius health club was a Success.
Problem 7:
Problem Statement:
Dental implant data: The hardness of metal implant in dental cavities depends on multiple factors,
such as the method of implant, the temperature at which the metal is treated, the alloy used as well
as on the dentists who may favour one method above another and may work better in his/her
favourite method. The response is the variable of interest.
Dataset - Link

Assumption & Solution:


7.1 Test whether there is any difference among the dentists on the implant hardness. State the
null and alternative hypotheses. Note that both types of alloys cannot be considered
together. You must state the null and alternative hypotheses separately for the two
types of alloys.?
As per the problem statement, there are many factors governing the outcome of dental implant.
Various factors include the method of implant, the temperature at which the metal is treated, and
the alloy used and the most important factors is the dentist favouritism of the procedure involved.

To test whether there is a difference among dentists in terms of implant hardness, we would need to
compare the hardness of implants with a variable of procedure performed by different dentists.

Upon Analysis if we find a significant difference, then we can conclude that there is a difference in
the variable level of interest among the dentists.

Hypothesis 1

 Ho1: Mean Hardness of Dental implant is same for all 5 dentists, Alloy 1 is used
 Ha1: At least 1 dentist the mean Hardness of Dental implant is different when using Alloy 1

Hypothesis 2

 Ho2: Mean Hardness of Dental implant is same for all 5 dentists, Alloy 2 is used
 Ha2: At least 1 dentist the mean Hardness of Dental implant is different when using Alloy 2

7.2 Before the hypotheses may be tested, state the required assumptions. Are the assumptions
fulfilled? Comment separately on both alloy types.?
Looking into the analysis and evaluation of the test data, we can conclude that this can be inferred
using a 2-Way ANOVA Test.

An ANOVA test compares the variation between the groups to the variation within the groups to
make inferences about the means of the groups.

The assumptions for this test to be accurate are

- The data are normally distributed and that the variances among the groups are equal.
- The variances among the groups should be equal.

If the variances are not equal and data is not normally distributed, then the results of the test may
not be accurate.

For a 2-way ANOVA, we need


- Two factors of interest on the Dependent variable and it should be measured at the
continuous level.
- Interaction between the different levels of these two factors
- There should not be any significant outliers
- Independent Random Samples are drawn

7.3 Irrespective of your conclusion in 2, we will continue with the testing procedure. What do
you conclude regarding whether implant hardness depends on dentists? Clearly state
your conclusion. If the null hypothesis is rejected, is it possible to identify which pairs
of dentists differ?

Based on the results of the statistical test, We can conclude that there is a difference in the hardness
of the implants among the dentists. This implies that the null hypothesis is false and that the
alternative hypothesis is true.

In both the Hypothesis, we can see that p-value is greater than 0.05 we fail to reject null hypothesis.
Hence, we can conclude that the implant hardness is not dependent on dentists

It is not possible to identify which pairs of dentists differ based on the results of the test. This is
because the test only tells us that there is a difference among the groups, but not which groups are
different.

Analysis based on Alloy 1:

Analysis based on Alloy 2:

Plot based on the Dentists:


7.4 Now test whether there is any difference among the methods on the hardness of dental
implant, separately for the two types of alloys. What are your conclusions? If the null
hypothesis is rejected, is it possible to identify which pairs of methods differ?

Based on the results of the statistical test, I conclude that there is a difference in the hardness of the
implants among the methods. This means that the null hypothesis is false and that the alternative
hypothesis is true.

It is not possible to identify which pairs of methods differ based on the results of the test. This is
because the test only tells us that there is a difference among the groups, but not which groups are
different.

In both the Hypothesis, we can see that p-value is less than 0.05 we can reject null hypothesis. Hence,
we can conclude that the implant hardness is dependent on Method used

Analysis based on Alloy 1:

Analysis based on Alloy 2:

Plot based on the Method used:


7.5 Now test whether there is any difference among the temperature levels on the hardness of
dental implant, separately for the two types of alloys. What are your conclusions? If
the null hypothesis is rejected, is it possible to identify which levels of temperatures
differ?

The null hypothesis would be that there is no difference among the temperature levels on the
hardness of the dental implant. The alternative hypothesis would be that there is a difference among
the temperature levels on the hardness of the dental implant.

The findings of the statistical test have led me to the conclusion that there is a difference between
the various temperature levels and the degree of hardness shown by the dental implant.

In both the Hypothesis, we can see that p-value is greater than 0.05 we fail to reject null hypothesis.
Hence, we can conclude that the implant hardness is dependent on Temperature.

Based on the findings of the test, it is not feasible to determine which temperature levels are distinct
from one another.
Analysis based on Alloy 1:

Analysis based on Alloy 2:


Plot based on the temperature levels:

7.6 Consider the interaction effect of dentist and method and comment on the interaction
plot, separately for the two types of alloys?
Based on the interaction plot, there is a significant difference in the hardness of the implants among
the dentists and methods used. This implies that the null hypothesis is false and that the alternative
hypothesis is true for both the Alloys

We can conclude that different dentists and methods do have an effect on the hardness of the final
implant. But it is not possible to identify which pairs of dentists and methods differ based on the
results of the test. This is because the test only tells us that there is a difference among the groups,
but not which groups are different.

Plot based on Alloy 1:


Plot based on Alloy 2:

7.7 Now consider the effect of both factors, dentist, and method, separately on each alloy.
What do you conclude? Is it possible to identify which dentists are different, which
methods are different, and which interaction levels are different?

Based on the results of the statistical 2-way ANOVA test, We can come to a conclusion on the
problem statement, we can imply that different dentists, methods, and interaction levels do have an
effect on the hardness of the final implant.
This implies that the null hypothesis is false and that the alternative hypothesis is true.
But it is not possible to identify which dentists are different, which methods are different, and which
interaction levels are different based on the results of the test.
The Hypothesis testing bases only tells us that there is a difference among the groups, but not which
groups are different.

Analysis based on Alloy 1:


Analysis based on Alloy 2:

You might also like