Professional Documents
Culture Documents
Unit - 1 PDF
Unit - 1 PDF
Unit - 1 PDF
Contents Contents
Engineering methods and Collection of data
statistical Thinking
Collecting Engineering Data Classification and Tabulation of
data
• Basic Principles
Stem and Leaf diagram
• Retrospective Study
Frequency distribution and
• Observational Study Histogram
• Designed Experiments Box Plots
• Observing Processes over Time Sequence Plots
Time Probability Plots
Mechanical and Empirical Models
Course outcome
Example:-
• We are interested in the numeric average molecular weight (Mn) of a
polymer. Now we know that Mn is related to the viscosity of the material (V),
and it also depends on the amount of catalyst (C) and the temperature (T ) in
the polymerization reactor when the material is manufactured. The
relationship between Mn and these variables is
Mn = f(V,C,T)
where the form of the function f is unknown.
• We estimate the model from experimental data to be of the following form
where the b’s are unknown parameters.
Collection of Data
Mailed questionnaire
– Data is collected through questionnaire
[list of questions] mailed to the informant.
Questionnaire filled by enumerators
– Data is collected by trained enumerators
who fill questionnaires.
Telephonic interviews
– Data is collected through an interview over
the telephone with the interviewer.
Difference Between Census & Sampling
Method
Census Method
Every unit of population studied
Reliable and accurate results
Expensive method
Suitable when population is of homogenous nature
Sampling Method
Few units of population are studied
Less Reliable and accurate results
Less expensive method
Suitable when population is of heterogeneous nature
Advantages and Disadvantages -
Mailed Questionnaire Method:
Advantages Disadvantages
Advantages Disadvantages
Advantages Disadvantages
Published Source
- Government publications, Semi-government
publications etc.
Unpublished Source
- Census of India [They are collected by the
organizations for their own record]
Classification of Data
Chronological classification:
- In chronological classification the collected data are
arranged according to the order of time expressed in
years, months, weeks, etc.,
Eg:- The estimates of birth rates in India during 970 – 76 are
Geographical classification:
- The data are classified according to geographical
region or place.
Eg:- The production of paddy in different states in Iraq,
production of wheat in different countries etc.
Country America China Denmark France Iraq
Qualitative classification:
- Data are classified on the basis of same attributes or
quality like sex, literacy, religion, employment etc.,
Such attributes cannot be measured along with a
scale.
Eg:- If the population to be classified in respect to one attribute,
say sex, then we can classify them into two namely that of
males and females. Similarly, they can also be classified into
‘married or ‘ single’ on the basis of another attribute ‘marital
status’.
Types of Classification –Cont.
Quantitative classification:
- It refers to the classification of data according to
some characteristics that can be measured such as
height, weight, etc.,
Eg:-The group of a children may be classified according to weight
Table Number__________
Title ___________
[Head Note]
Stub caption Total[Rows]
Sub head Sub head
Column Column Column Column
head head head head
The columns & rows should be clearly separated with dark lines.
Demarcation should also be made between data of one class and
that of another.
Comparable data should be put side by side.
The figures in percentage should be approximated before
tabulation.
The alignment of the figures, symbols etc. should be properly aligned
and adequately spaced to enhance the readability of the same.
Abbreviations should be avoided.
Representation of Data
Ans:-
Problem-2
The following are the numbers of text messages sent last week by
the cellular phone users on one floor of a college dormitory.
Display the data in a stem-and-leaf plot. What can you conclude?
155 159 144 129 105 145 126 116 130 114 122 112 112 142 126 118 118
108 122 121 109 140 126 119 113 117 118 109 109 119 139 139 122 78
133 126 123 145 121 134 124 119 132 133 124 129 112 126 148 147
Problem-2 – Cont.
Ans:-
Interpretation :- From the display,you can conclude that more than 50% of the
cellular phone users sent between 110 and 130 text messages.
Frequency distribution
58 72 64 65 67 92 55 51 69 73 64 59 65 55 75 56
89 60 84 68 74 67 55 68 74 43 67 71 72 66 62 63
83 64 51 63 49 78 65 75
Problem-1-cont.
Histogram
Students
3
0
1- 10 11- 20 21- 30 31- 40
Number of Pages
Problem-1
58 72 64 65 67 92 55 51 69 73 64 59 65 55 75 56
89 60 84 68 74 67 55 68 74 43 67 71 72 66 62 63
83 64 51 63 49 78 65 75
Problem-1- Cont.
Frequency polygons
58 72 64 65 67 92 55 51 69 73 64 59 65 55 75 56
89 60 84 68 74 67 55 68 74 43 67 71 72 66 62 63
83 64 51 63 49 78 65 75
Problem-1- Cont.
Ans:-
Ogive (or) Cumulative frequency polygon
Quartiles split the data into four parts. For ungrouped data, arrange
the observations in order from smallest to largest.
The second quartile is the median: Q2 = x.
If n is even: The first quartile, Q1, is the median of the smallest n/2
observations; and the third quartile, Q3, is the median of the largest
n/2 observations.
If n is odd: The first quartile, Q1, is the median of the smallest (n +1)/2
observations; and the third quartile, Q3, is the median of the largest
(n+1)/2 observations.
Quartiles
73 67 75 81 67 75
85 69
Problem-1-Cont.
Step 1: Order the data from least to greatest. Then find the least
and greatest values, the median, and the lower and upper
quartiles.
Step 2: Draw a number line. Above the number line, plot points
for each value in Step 1.
Step 3: Draw a box from the lower to the upper quartile. Inside
the box, draw a vertical line through the median. Then draw the
“whiskers” from the box to the least and greatest values.
Problem-1-Cont.
Problem-1 –Cont.
Problem-2:
Comparing Box-and-Whisker Plots
Problem-2 – Cont.
Ans:-
Time Sequence Plots