Professional Documents
Culture Documents
Aser Unit I Basics of Statistics
Aser Unit I Basics of Statistics
RANDOM VARIABLE
A variable which can take different values is called a random
PART 1 variable. It is generally denoted by capital letters: A, B, ……X,Y, Z,
………
Values taken by X are either numbers (numeric/ quantitative data)
BASICS OF STATISTICS or non – numeric (Qualitative data)
Examples of Numeric Data
E.g. X= time taken to deliver a particular order,
Y = Shipping record of time of receipt of an order to delivery.
Z = Scores received by employees in performance test conducted.
A = In a test conducted for the mother board the time to failure
B= The Business per employee in the public sector bank
X= Return on assets in a private banks.
DATA_SCIENCE_2019_20 1 DATA_SCIENCE_2019_20 2
1 2
Work
Examples of Non – Numeric Data No. CGPA UG Qualification Specialisation Experience Age (in years)
E.g. X= gender of an employee, 1 3.24 B.Com. Finance 0 23
Y = Graduate stream of a candidate. 2 3.14 B.Sc. HR 1 21
Z = Specialization offered by MMS students 3 3.72 BAF Finance 2 23
A = Sector to which a particular industry belongs 4 3.06 B.E. Systems 4 21
B= Names of states in a government data 5 3.14 BMS HR 7 22
X= Names of car – models in an automobile industry 6 3.14 CA Finance 2 23
…………………………………….. Etc. 7 3.06 B.A. Economics Operations 0 22
8 3.17 B.Sc.(IT) Systems 3 21
NOTE:
9 2.97 BCA Systems 2 22
In a given data there can be a combination of numeric and
non – numeric data. 10 3.14 B.Com. Finance 0 23
11 3.69 BMS Marketing 3 24
For example:
12 3.85 B.E. Operations 0 25
DATA_SCIENCE_2019_20 3 13 3.92 BCA DATA_SCIENCE_2019_20
Systems 0 23 4
3 4
26-09-2023
Name
Ratan Tata
Wealth in Crores
125674.12
Sector
Large diversified
Types of Data
Two types of data:
P. R. S. Oberoi 183739.00 Hospitality
(1) Ungrouped data is a data given in the form scattered values:
Azim H. Premji 64855.27 Software
X x1 x2 x3 x4 ---- --- ---- ---- xn
Mukesh Ambani 56414.35 Petrochemicals
(2) Grouped data is a data consisting of values or class intervals along with their
Sunil Mittal & Family 35558.22 Telecom frequencies: (Here frequency = number of times a particular value repeats)
Anil Ambani 34993.98 Large diversified
X x1 x2 x3 x4 x5 x6 ...... ……. xn
Tulsi R. Tanti & Family 26139.69 Wind energy
Anil Agarwal 18108.75 Metals OR F(frequency) f1 f2 f3 f4 f5 f6 …... ……. fn
Shiv Nadar 16698.47 Software & hardware (Here frequency = Number of observations/ data points/ values falling in a
Kumarmangalam Birla 16643.04 Large diversified particular interval)
Rahul Bajaj 12455.99 Auto Class Intervals 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70
Dilip S. Shanghvi 10584.49 Pharmaceuticals F( Frequency) 23 12 5 67 52 34 19
Baba Kalyani 7857.83
DATA_SCIENCE_2019_20
Auto components 5 DATA_SCIENCE_2019_20 6
5 6
7 8
26-09-2023
9 10
11 12
26-09-2023
DATA_SCIENCE_2019_20 13 DATA_SCIENCE_2019_20 14
13 14
15 16
26-09-2023
19 50
20
17
40
15
12
cumulative frequency
10 10 30
10 8 Series1
7
5 5 20
5 3 3
1 10
0
0
0 2 4 6 8 10 12 2.5 - 2.7 2.7 - 2.9 2.9 - 3.1 3.1 - 3.3 3.3 - 3.5 3.5 - 3.7 3.7 - 3.9
X
DATA_SCIENCE_2019_20 17 DATA_SCIENCE_2019_20 18
17 18
19 20
26-09-2023
21 22
23 24
26-09-2023
25 26
27 28