Professional Documents
Culture Documents
Amit Singh - Ssjcet20024 - Business Statistic Assignment
Amit Singh - Ssjcet20024 - Business Statistic Assignment
S JHONDHALE COLLEGE
SSJCET20024
BUSINESS STATISTICS
AMIT SINGH/MBA MMS/SSJCET20024/S. S JHONDHALE COLLEGE
BUSINESS STATISTICS
• Median: The middle value in a dataset that is arranged in ascending order (from the
smallest value to the largest value). If a dataset contains an even number of values,
the median of the dataset is the mean of the two middle values.
• Mode: Defines the most frequently occurring value in a dataset. In some cases, a
dataset may contain multiple modes while some datasets may not have any mode at
all.
4.Under what conditions is the median more suitable than other measures of central
tendency?
• The median is usually preferred to other measures of central tendency when your
data set is skewed (i.e., forms a skewed distribution) or you are dealing with ordinal
data. However, the mode can also be appropriate in these situations, but is
not as commonly used as the median.
5."Every average has its own peculiar characteristics. It is difficult to say which average is
the best." Explain with examples.
• yes it's difficult.
• for ex:- I and u got 90 percentage and this is the average marks.so to see who is the
best among us mam said to give one answer again and she thought anyone would
answer but we both replied and it was correct so as a conclusion we can say that
"every best".
6.What do you mean by dispersion? What are the different measures of dispersion?
• Dispersion is the state of getting dispersed or spread. Statistical dispersion means the
extent to which a numerical data is likely to vary about an average value. In other
words, dispersion helps to understand the distribution of the data.
• Types of Measures of Dispersion
AMIT SINGH/MBA MMS/SSJCET20024/S. S JHONDHALE COLLEGE
There are two main types of dispersion methods in statistics which are:
1. Range: It is simply the difference between the maximum value and the minimum value
given in a data set. Example: 1, 3,5, 6, 7 => Range = 7 -1= 6
2. Variance: Deduct the mean from each data in the set then squaring each of them and
adding each square and finally dividing them by the total no of values in the data set
is the variance. Variance (σ2)=∑(X−μ)2/N
3. Standard Deviation: The square root of the variance is known as the standard
deviation i.e. S.D. = √σ.
4. Quartiles and Quartile Deviation: The quartiles are values that divide a list of numbers
into quarters. The quartile deviation is half of the distance between the third and the
first quartile.
5. Mean and Mean Deviation: The average of numbers is known as the mean and the
arithmetic mean of the absolute deviations of the observations from a measure of
central tendency is known as the mean deviation (also called mean absolute
deviation).
AMIT SINGH/MBA MMS/SSJCET20024/S. S JHONDHALE COLLEGE
7.Why is the standard deviation the most widely used measure of dispersion? Explain.
• Standard deviation is considered to be the best measure of dispersion and is
therefore, the most widely used measure of dispersion.
• It is based on all values and thus, provides information about the complete series.
Because of this reason, a change in even one value affects the value of standard
deviation.
• It is independent of origin but not of scale.
• It is useful in advance statistical calculations like comparison of variability in two data
sets.
• It can be used in testing of hypothesis.
• It is capable of further algebraic treatment.
10.“Correlation and Regression are two sides of the same coin”. Explain?
• correlation and regression are two sides of the same statistical coin. When you
measure the linear correlation of two variables, what you are in effect doing is laying
out a straight line that best fits the average "together-movement" of
these two variables.
12.What is a scatter diagram? How does it help in studying correlation between two
variables, in respect of both its nature and extent?
• The scatter diagram is a technique used to examine the relationship between both the
axis (X and Y) with one variable. In the graph, if the variables are correlated, the point
will drop along a curve or line. A scatter diagram or scatter plot, is used to give an idea
of the nature of relationship.
• Perfect Positive Correlation: Perfect Negative.
• Low Degree of Positive Correlation: Low Degree
• High Degree of Positive Correlation: High Degree.
Now, Median is the value of N/2 =143/ 2 = 71.5 item, which lies in the class (1,200-
1,400). Thus (1,200-1,400) is the median class. For determining the median in this class,
we use interpolation formula as follow:
=RS 1393.2
AMIT SINGH/MBA MMS/SSJCET20024/S. S JHONDHALE COLLEGE
18.What is a Test Statistic? What are the commonly used test statistics in hypotheses
testing?
• A test statistic is a statistic (a quantity derived from the sample) used in statistical
hypothesis testing. A hypothesis test is typically specified in terms of a test statistic,
considered as a numerical summary of a data-set that reduces the data to one
value that can be used to perform the hypothesis test.
• Concept of null hypothesis- A classic use of a statistical test occurs in process control
studies. For example, suppose that we are interested in ensuring that photomasks in
a production process have mean linewidths of 500 micrometers. The null hypothesis,
in this case, is that the mean linewidth is 500 micrometers. Implicit in this statement
is the need to flag photomasks which have mean linewidths that are either much
greater or much less than 500 micrometers. This translates into the alternative
hypothesis that the mean linewidths are not equal to 500 micrometers. This is a two-
sided alternative because it guards against alternatives in opposite directions; namely,
AMIT SINGH/MBA MMS/SSJCET20024/S. S JHONDHALE COLLEGE
that the linewidths are too small or too large.The testing procedure works this way.
Linewidths at random positions on the photomask are measured using a scanning
electron microscope. A test statistic is computed from the data and tested against pre-
determined upper and lower critical values. If the test statistic is greater than the
upper critical value or less than the lower critical value, the null hypothesis is rejected
because there is evidence that the mean linewidth is not 500 micrometers.
• One-sided tests of hypothesis- Null and alternative hypotheses can also be one-sided.
For example, to ensure that a lot of light bulbs has a mean lifetime of at least 500
hours, a testing program is implemented. The null hypothesis, in this case, is that the
mean lifetime is greater than or equal to 500 hours. The complement or alternative
hypothesis that is being guarded against is that the mean lifetime is less than 500
hours. The test statistic is compared with a lower critical value, and if it is less than
this limit, the null hypothesis is rejected.
• Thus, a statistical test requires a pair of hypotheses; namely,
• H0: a null hypothesis
• Ha: an alternative hypothesis.
• Significance levels- The null hypothesis is a statement about a belief. We may doubt
that the null hypothesis is true, which might be why we are "testing" it. The alternative
hypothesis might, in fact, be what we believe to be true. The test procedure is
constructed so that the risk of rejecting the null hypothesis, when it is in fact true, is
small. This risk, α, is often referred to as the significance level of the test. By having a
test with a small value of α, we feel that we have actually "proved" something when
we reject the null hypothesis.
AMIT SINGH/MBA MMS/SSJCET20024/S. S JHONDHALE COLLEGE
19.Distinguish between a One-tailed and Two-tailed test, give a diagram and an example
in each case.
• One tailed test- If you are using a significance level of .05, a one-tailed test allots all
of your alpha to testing the statistical significance in the one direction of interest. This
means that .05 is in one tail of the distribution of your test statistic. When using a one-
tailed test, you are testing for the possibility of the relationship in one direction and
completely disregarding the possibility of a relationship in the other direction. Let’s
return to our example comparing the mean of a sample to a given value x using a t-
test. Our null hypothesis is that the mean is equal to x. A one-tailed test will test either
if the mean is significantly greater than x or if the mean is significantly less than x, but
not both. Then, depending on the chosen tail, the mean is significantly greater than or
less than x if the test statistic is in the top 5% of its probability distribution or bottom
5% of its probability distribution, resulting in a p-value less than 0.05. The one-tailed
test provides more power to detect an effect in one direction by not testing the effect
in the other direction. A discussion of when this is an appropriate option follows.
• Two tailed test- If you are using a significance level of 0.05, a two-tailed test allots half
of your alpha to testing the statistical significance in one direction and half of your
alpha to testing statistical significance in the other direction. This means that .025 is
AMIT SINGH/MBA MMS/SSJCET20024/S. S JHONDHALE COLLEGE
in each tail of the distribution of your test statistic. When using a two-tailed test,
regardless of the direction of the relationship you hypothesize, you are testing for the
possibility of the relationship in both directions. For example, we may wish to
compare the mean of a sample to a given value x using a t-test. Our null hypothesis is
that the mean is equal to x. A two-tailed test will test both if the mean is significantly
greater than x and if the mean significantly less than x. The mean is considered
significantly different from x if the test statistic is in the top 2.5% or bottom 2.5% of
its probability distribution, resulting in a p-value less than 0.05.
AMIT SINGH/MBA MMS/SSJCET20024/S. S JHONDHALE COLLEGE