Professional Documents
Culture Documents
Engineering Statistics
Third year class
2022-2023
Lecture 1-page-1
Syllabus for Engineering Statistics:
Chapter one: Introduction
Chapter two: Describing Data
Chapter three: The probability
Chapter four: Discrete probability distributions
Chapter five: Continuous probability distributions
Chapter six: Sampling distributions
Chapter seven: Estimation theory
Chapter eight: Tests of hypothesis
Chapter nine: Regression and correlation analysis
Lecture 1-page-2
Chapter 1 Chapter One: Introduction
Types of Statistics:
Types of Variables:
A- Discrete Variables: Can assume only certain values and there are usually gaps between
the values. Examples: number of rooms in a house, number of cars arriving at the exit
of the road over an hour, number of students in each section, etc.
Lecture 1-page-3
B- Continuous Variable: Can assume any value within a specific range resulting from
measuring something. Examples: the time it took the examination is 3 hours: 30
minutes and 25 seconds depending on the accuracy of the timing device.
Levels of Measurements:
The level of measurements of the data dictates the calculations that can be done to
summarize and present the data. It will also determine the statistical tests that should be
performed.
C- Interval-Level Data: It includes all the characteristics of the ordinal level, but
in addition, the difference between values is a constant size.
Example: temperature (28, 31, 20) can easily be ranked and determine the
difference between temperatures. [0] degree does not represent the absence of
heat, just that it is cold. The properties of interval level are:
1- Data classifications are mutually exclusive and exhaustive.
2- Data classifications are scaled according to the amount of the characteristic
they possess.
3- Equal differences in the characteristic are represented by equal difference in
the measurement.
Lecture 1-page-4
D- Ratio-Level Data: Practically all quantitative data are ratio level of
measurement. It is the highest level of measurement. Here the [0] point is
meaningful and the ratio between two numbers is meaningful. Examples: wages,
weight, changes in stock prices, height, etc.
Money is a good example; if I earn $30000 per year and you earn $60000 per
year then you earn twice as much as I.
The properties of the ratio level are:
1- Data classifications are mutually exclusive and exhaustive.
2- Data classifications are scaled according to the amount of the characteristic
they possess.
3- Equal differences in the characteristic are represented by equal difference in
the numbers assigned to the classifications.
4- The [0] point is the absence of the characteristic.
Lecture 1-page-5
Chapter two: Describing Data
Descriptive statistics organize data to show the general pattern of the data and
where values tend to concentrate and to expose extreme or unusual data values. The first
procedure to organize and summarize a set of data is a frequency table.
Frequency Distribution: A grouping of data into mutually exclusive classes showing the
number of observations in each class.
How do we develop a frequency distribution? The first step is to tally the data into
a table showing the classes and the number of observations in each class. The steps in
constructing a frequency distribution are:
𝐻−𝐿
𝑖 ≥
𝐾
Lecture 1-page-6
Where:
i: class interval
K: number of classes
(H-L): range.
Class interval of the type {x: a ≤ x ˂ b} called exclusive that exclude the upper
limit of the class. Class interval of the type {x: a ≤ x ≤b} called inclusive that include the
upper limit of the class. In order to ensure continuity of class-limits, we convert inclusive
class intervals into exclusive class intervals after an adjustment to the upper limits by
using:
Adjustment = 0.5 (Lower-limit of the current class – Upper-limit of the preceding class)
Therefore, the common points between classes are called true limits.
Consider an example where the marks obtained by (45) students in a Statistics paper, the
maximum marks allotted is 30:
22 9 17 13 15 18 13 2 21 27 27 29
15 0 10 29 26 16 25 10 24 22 28 26
26 19 14 18 25 21 25 18 28 25 27 29
17 10 3 24 3 12 16 18 26
Here the variate is “number of marks”. The data in the original form as above are called
“raw or ungrouped data”. When arranged in ascending or descending order of
magnitude, the data are said to be arranged in an array. Counting the number of times
each variate value occurs, we get the following frequency table.
Lecture 1-page-7
Table 2.1: Frequency table of marks in Statistics of (45) students
Marks in Number of students Cumulative
Statistics (tally marks) frequency
0 I 1 1
2 I 1 2
3 II 2 4
9 I 1 5
10 III 3 8
12 I 1 9
13 II 2 11
14 I 1 12
15 II 2 14
16 II 2 16
17 II 2 18
18 IIII (IV) 4 22
19 I 1 23
21 II 2 25
22 II 2 27
24 II 2 29
25 IIII (IV) 4 33
26 IIII (IV) 4 37
27 III 3 40
28 II 2 42
29 III 3 45
To get a still better picture about the structure of the frequency distribution, we re-classify
the data into groups as follows:
Lecture 1-page-8
This type of representation of frequencies is called a grouped frequency distribution or
simply a grouped distribution.
Class limits: boundaries of the class. They are called lower limit and upper limit.
Class interval or class width ( i ): difference between the upper and lower limits of a
class.
Class frequency (fi): number of individuals or observations in each class.
Cumulative frequency ( Fx): total number of observations less than or equal to the
upper limit of the class.
Class length: difference between the upper- and lower-class limits.
Class mark or class mid-point (xi): 0.5 [(upper limit + lower limit) of the class].
𝐾
∑ 𝑓𝑖 = 𝑛
𝑖=1
fi : class frequency
Probability of a class (pi ): is the relative frequency of the class which represent the
ration of the class to the total number of observations.
𝐾
∑ 𝑝𝑖 = 1.00
𝑖=1
The following points should be kept in mind while tabulating or classifying an observed
frequency distribution:
Lecture 1-page-9
Graphic Presentation of Qualitative Data:
1- Bar Chart: A graph in which the classes are reported on the horizontal axis and the
class frequencies on the vertical axis. The class frequencies are proportional to the
heights of the bars.
2- Pie Chart: A chart that shows the proportion or percent that each class represent of
the total number of frequencies
Example: SkiLodges.com is test-marketing its new website and is interested in how easy
its Web page design is to navigate. It randomly selected 200 regular Internet users and
asked them to perform a search task on the Web page. Each person was asked to rate the
relative ease of navigation as poor, good, excellent, or awesome. The results are shown
in the following table:
Solution
Lecture 1-page-10
Graphic Presentation of a Frequency Distribution
1- Histogram: A graph in which the classes are marked on the horizontal axis and the
class frequencies on the vertical axis. The class frequencies are represented by the
heights of the bars and the bars are drawn adjacent to each other.
For a given grouped frequency distribution, we first mark off along the x-axis all
the class intervals on a suitable scale. Draw rectangles with the class intervals as bases,
areas proportional to the frequencies of the class intervals. For equal class intervals, the
heights of the rectangles will be proportional to the frequencies, unequal class intervals
the height will be proportional to the ratios of the frequencies to the widths of the classes.
Example 1: Refer to table 2.1 which shows the frequency distribution of selling prices at
a car company last month. Construct a histogram
Table 2.1
Selling Prices
Frequency
($ thousands)
15 up to 18 8
18 up to 21 23
21 up to 24 17
24 up to 27 18
27 up to 30 8
30 up to 33 4
33 up to 36 2
Total 80
Lecture 1-page-11
Example 2: Draw the histogram for the frequency distribution given in the table which
shows the number of boys on the registers at primary schools in a certain state.
Solution:
Rectangles
Age
Number of boys
(in years) base height
(in thousands)
2 -5 150 5-2=3 50
5 -11 3066 11-5=6 511
11-12 497 12-11=1 497
12 -13 477 13-12=1 477
13 -14 496 14-13=1 496
14-15 143 15-14=1 143
15-17 162 17-15=2 81
Lecture 1-page-12
Notes
For equal class intervals, the frequency polygon can be obtained by joining the middle
points of the upper sides of the adjacent rectangles of the histogram by straight lines.
Lecture 1-page-13
Example: For the frequency distribution shown: Selling Prices
Frequency
($ thousands)
15-18 8
18-21 23
Construct a frequency polygon 21-24 17
24-27 18
27-30 8
30-33 4
33-36 2
Solution
Lecture 1-page-14
3-Cumulative Frequency Polygon: To plot a cumulative frequency distribution, scale
the upper limit of each class along the X-axis and the corresponding cumulative
frequencies along the Y-axis.
Example: To find the variate value below which half the cars sold, we draw a horizontal
line from the (50%) mark on the right-hand vertical axis over to the polygon, then drop
down to the X-axis and read the selling price which is about 22.5, so we estimate that 50
percent of the vehicles sold for less than $22500
Lecture 1-page-15