Professional Documents
Culture Documents
Covariance
• Covariance is a measure of direction of linear
relationship between two variables.
• Positive covariance shows positive relationship
between two variables while negative covariance
describes the negative relationship between two
variables.
• 0 covariance shows no linear relationship between
two variables.
Illustration
• The details of the number of commercials and sales
(Rs. Million) are given in the following table. Find the
covariance of the two variables.
• Week No. Of Commercials Sales (Rs. Million)
1 2 50
2 5 57
3 1 41
4 3 54
5 4 54
6 1 38
7 5 63
8 3 48
9 4 59
10 2 46
Calculations of Covariance
Correlation coefficient
• Correlation coefficients are similar to covariance, the
only difference is that correlation coefficient not only
describes the nature of linear relationship between
two variables but also show the strength of the
relationship.
• The correlation coefficient ranges from −1 to
+1. Values close to −1 or +1 indicate a strong
linear relationship. The closer the correlation
is to zero, the weaker the relationship.
Correlation coefficient -
Formula
Correlation coefficient
• The formulas return a value between -1 and 1,
where:
• 1 indicates a strong positive relationship.
• -1 indicates a strong negative relationship.
• zero indicates no relationship at all.
Illustration
• Calculate the correlation co-efficient from the
following table:
Date Stock price (X) Stock price (X)
6/7/2021 1103.99 345.50
6/8/2021 1086.78 352.75
6/9/2021 1076.90 343.35
6/10/2021 1090.06 344.75
6/11/2021 1133.00 350.75
6/14/2021 1139.30 355.95
6/15/2021 1148.60 352.70
6/16/2021 1117.15 349.35
6/17/2021 1103.75 345.65
6/18/2021 1092.30 337.40
6/21/2021 1106.15 334.30
Normal distribution
• It is a distribution of a continuous random variable.
• Due to the large contribution of eighteenth-century
mathematician–astronomer Karl Gauss, it is also
known as Gaussian distribution.
• It is a very important distribution in statistics because it
comes close to fitting the actual observed frequency
distribution of many phenomenon, including human
characteristics (weights, heights, and IQs) test scores,
scientific measurements, amounts of rainfall, and other
similar values.
• The form, or shape, of the normal distribution is
illustrated by the bell-shaped normal curve.
Characteristics of the Normal
Probability Distribution
• The curve has a single peak;
it is unimodal.
• The mean lies at the centre
of the curve.
• Mean, median, and mode
have same value.
• Two tails of NPD never
touches the horizontal axis.
Normal Probability Distribution
The Standard Normal
Distribution
= 78-(4*3*6)/44-(4*3^2)
= 78-72/44-36
= 6/8
= 0.75
= 3.75 + 0.75.X
Standard error of estimates
• To measure the reliability of the estimating equation,
statisticians have developed the standard error of estimate.
• This standard error is symbolized se and is similar to the
standard deviation.
• The standard deviation is used to measure the dispersion of a
set of observations about the mean.
• The standard error of estimate, on the other hand, measures
the variability, or scatter, of the observed values around the
regression line.
𝑌−𝑌 2
• 𝑆𝑒 =
𝑛−2
Standard error of estimates
Correlation Regression
• Signifies the degree of • Indicates the causal
relationship between the relationship between
two variables. variables.
• It is limited between the • It can be more than two
two variables only.
variables.
• The variables are
interchangeable, which is • The independent and
symmetrical. dependent variables can
• Can not be helpful in not be interchangeable.
prediction. • Used for prediction.
Business Statistics
Books for business statistics
• Numerically expression
• Definite purpose
• Variables
• Data
• Population
• Sample
• Parameter
Variables
• Mean = 396/10=39.6
• The original mean, 39.6 minutes, had a middle, or central, position
among the data values: 5 of the times were less than that mean
and 5 were greater than that mean.
Day 1 2 3 4 5 6 7 8 9 10
Time (Minutes) 39 29 103 52 39 44 40 31 44 35
• New Mean = 456/10 = 45.6
• In contrast, the mean using the extreme value is greater than 9 of
the10 times, making the new mean a poor measure of central
tendency.
Median
• Mean = 45.6
• Median = 10+1/2 = 5.5th
Mode
• The mode is the value that appears most frequently. Like the
median and unlike the mean, extreme values do not affect the
mode. For a particular variable, there can be several modes or no
mode at all.
• For example, for the sample of 10 times to get ready in the
morning:
29 31 35 39 39 40 43 44 44 52
• There are two modes, 39 minutes and 44 minutes, because each of
these values occurs twice.
• However, for this sample of 14 smartphone prices offered by a
cellphone provider (stored in Smartphones ):
56 71 73 74 90 179 213 217 219 225 240 250 500 513
• There is no mode. None of the values is “most typical” because
each value appears the same number of times (once) in the data
set.
Geometric Mean
• The Geometric Mean (GM) is the average value or
mean which signifies the central tendency of the set of
numbers by finding the product of their values.
Basically, we multiply the numbers altogether and take
the nth root of the multiplied numbers, where n is the
total number of data values. For example: for a given
set of two numbers such as 3 and 1, the geometric
mean is equal to √(3×1) = √3 = 1.732.
Plot-A Plot-B
• Where
• Y = sample values of dependent variable
• ŷ = corresponding estimated values from the
regression equation
• N = number of data points
• K = number of independent variables
Standard Error of Estimates