Professional Documents
Culture Documents
(SBA)
sridhar.v@imthyderabad.edu.in
sridhar.vaithianathan@gmail.com Mobile: 99899 04245
Recap
Population and Sample,
Datasets: Elements, Variables and Observations.
Scales of Measurements
(Nominal, Ordinal, Interval and Ratio)
Understanding DATA
Tabular Summary
Graphical Representation
Graphical Summaries
Bar Chart Categorical
Pie Chart Variables
Histogram Numerical
Box Plot Variables
Retails Case Dataset
Graphical Summaries
Frequency of ITEM Types
Item Type Count of Item_Type
Baking Goods 647
Breads 251
Breakfast 110
Canned 649
Dairy 676
Frozen Foods 860
Fruits and Vegetables 1232
Hard Drinks 214
Health and Hygiene 278
Household 637
Meat 425
Others 161
Seafood 64
Snack Foods 1199
Soft Drinks 449
Starchy Foods 148
Grand Total 8000
BAR Chart - Frequency of ITEM Types
1232
1199
860
676
649
647
637
Total
449
425
278
251
214
161
148
110
64
s s st ry s s ks e ld t s d s ks s
od
d ed d le n ea er o d d
a
kf
a n ai o n e o o o n o
o re n D Fo ab ri gi eh M th af Fo ri Fo
g
G B ea Ca n et D
H
y
u
s O
Se k D y
in Br ze eg ar
d
d o ac ft ch
ak ro
V H an
H
Sn So r
B F d ta
an lt
h S
s
it ea
u H
Fr
PIE CHART - Item FAT Content Type
Re
gul
ar Lo
[PE w
RC Fat
EN [PE
TA RC
GE EN
] TA
GE
]
Low Fat 4993
Regular 3007
Grand Total 8000
BINS (10) Frequency
From (Rounded) To (Rounded)
$ 33 $ 1,339 3104
$ 1,339 $ 2,644 2301
$ 2,644 $ 3,949 1418
$ 3,949 $ 5,225 677
$ 5,255 $ 6,560 334
$ 6,560 $ 7,865 111
$ 7,865 $ 9,171 35
$ 9,171 $ 10,476 16
$ 10,476 $ 11,782 2
$ 11,782 $ 13,087 2
8000
Box Plot - Outlet Sales ($)
Salaries($) of Twelve Senior Managers in an IT Firm.
Find outliers using Box plot method?
Salary ($)
Descriptive Statistics
Measures of Dispersion :
Quiz
Measures of
Measures of Variability
Central Tendency Range
Median Variance
Mode Standard Deviation
Co-efficient of Variation
Mean
Other
summary
measures:
Skewness
Kurtosis Dr. Sridhar Vaithianathan IMT
Hyderabad 15
e…
i m
T
i z
Q u
Central Tendency & Dispersion
In computing the mean of a sample, the value
of sum of xi’s are divided by
a. n
b. n-1
c. n+1
d. n-2
Central Tendency & Dispersion
The most frequently occurring value of a data
set is called the
a. range
b. mode
c. mean
d. median
Central Tendency & Dispersion
The standard deviation of a sample of 100
observations equals 64. The variance of the
sample equals
a. 8
b. 10
c. 6400
d. 4,096
Central Tendency & Dispersion
The variance of a sample of 81 observations
equals 64. The standard deviation of the
sample equals
a. 9
b. 4096
c. 8
d. 6561
Central Tendency & Dispersion
The descriptive measure of dispersion that is
based on the concept of a deviation about the
mean is
a. the range
b. the interquartile range
c. the absolute value of the range
d. the standard deviation
Central Tendency & Dispersion
The measure of location which is the most
likely to be influenced by extreme values in
the data set is the
a. range
b. median
c. mode
d. mean
Central Tendency & Dispersion
The coefficient of variation is
times 100
c. the square of the standard deviation
d. the mean divided by the standard deviation
Central Tendency & Dispersion
The heights (in inches) of 25 individuals were recorded
and the following statistics were calculated
mean = 70 range = 20
mode = 73 variance = 784
median = 74
population mean
b. is always larger than the true value of the
population mean
c. is always equal to the true value of the
population mean
d. could be larger, equal to, or smaller than
Hint : CoV
Sri’s AQs
Sri’s CQs
Sri’s CQs
Covariance :
◦ Sxy =Covar (x,y) = √ ( ∑(x- x ) * (y – y) / n-1)
Correlatiion
◦ Γxy = Correl (x,y) = Sxy
(Rho) S x * Sy
e…
i m
T
i z
Q u
Relationship between Two Variables
A numerical measure
of linear association
between two variables
is the
a. variance
b. covariance
c. standard deviation
d. coefficient of
variation
Sri’s CQs
a. a positive variance of
the x values
b. a positive variance of
the y values
c. the standard deviation
is positive
d. positive relation
between the independent
and the dependent
variables
Sri’s CQs
Sri’s CQs
Measures Sep
Oct
12.0 10.84
7.0 4.82
12.0
7.0
9.13
7.26
12.0
7.0
8.15
6.42
8.0
8.0
5.56
7.91
Nov 5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89
Average
Variance
Average price is the same. (9) Average sales is the same too (7.5).