You are on page 1of 12

BASIC STATISTICS (3685)

SUSTAINABLE ENVIRONMENTAL DESIGN Supplementary Material


Autumn, 2020

ALLAMA IQBAL OPEN UNIVERSITY, ISLAMABAD


Department of Environmental Design, Health & Nutritional Sciences
ALLAMA IQBAL OPEN UNIVERSITY, ISLAMABAD
(Department of Environmental Design, Health & Nutritional Sciences)

Course: Basic Statistics (3685) Semester: Autumn, 2020


Level: M.Sc.

CONTENT LIST

This guideline for “Basic Statistics” includes the following items:

1. Name of Course Books


2. Course Outline (Units 1-9)
3. Tutor Guide
4. Assignment 1 & 2
5. Schedule for submitting the Assignment and Tutorial Meeting

Note: For any query, please contact at the following address:

Programme Coordinator
Sustainable Environmental Design Programme
Allama Iqbal Open University, Block 6,
H-8, Islamabad

2
STUDENT GUIDE
Dear student,
Assalam-o- Alaikum,
We welcome you to the MSc Sustainable Environmental Design Course entitled “Basic
Statistics”. This course is the part of the fourth semester of your MSc program. For
introduction to the course please go through the guidelines provided here for successful
completion.

This is a single semester course introducing students about number of statistical


techniques that are needed in research.

Course Objectives

1. To enable the students to understand the main features of traditional statistics.


2. To enable the students understand how to analyze statistical data properly.
3. Enhance students critical thinking in domains involving judgments
4. Based on data and stimulate the type of independent thinking requiring research
beyond the confines of the textbook.

COURSE OUTLINE
BASIC STATISTICS
Reference Book:
Statistics for Engineers as Introduction, by S. J. Morrison

Unit 1: Introduction
1.1 Need to study Statistics
1.2 Nature of Variability
1.3 Variance
1.4 Covariance and Correlation

Unit 2: Basic Statistical Methods


2.1 Normal Distribution
2.2 Cumulative Frequency Distributions
2.3 Binomial & Poisson Distribution
2.4 Chi-Squared Distribution

Unit 3: Production
3.1 Sampling Inspection
3.2 Control Charts
3.3 Cusum Charts

3
Unit 4: Regression
4.1 Significance Tests
4.2 Analysis of Variance
4.3 Linear Regression

Unit 5: Engineering Design


5.1 Variance Synthesis
5.2 Factors of Safety
5.3 Tolerances
5.4 The Future

Unit 6: Research and Development-I


6.1 Design of Experiments
6.2 Evolutionary Operation

Unit 7: Research and Development-II


7.1 Multiple Regressions
7.2 More Statistical Methods

Unit 8: Quality Management-I


8.1 Measurement & Statistical Computing
8.2 Quality Planning
8.3 Quality Organization

Unit 9: Quality Management-II


9.1 Directing the Quality Function
9.2 Controlling the Quality Function
9.3 Statistical Engineering
9.4 Conclusion

4
TUTOR GUIDE
Dear tutor,
Students enrolled in MSc Sustainable Environmental Design, offered through the blended
learning system belong to the built environment and have varied experience. These
students have very limited contact with their course mates and the part time tutors, it is
therefore, important to keep in mind that some of the distance-learning students have had
no links with education during the past few years after completing their formal education.
You are therefore, requested to guide and help the students, while keeping these issues in
mind. Some students may need help in developing professional attitudes towards research
and statistical analysis.

Study Center
The main purpose of establishing the study center for blended learning students is to
provide help and guidance for the difficulties faced by the students while studying at
home. For this semester students have to come to the Main Campus Islamabad. The mode
of teaching would be Face to Face.

Assignments
In the blended learning system, studying the course units has its own importance but
assignments and workshops are the major source of link between tutor and the student.
Therefore it is important to offer your comments through these assignments. Express
your views in such a way that the student is not discouraged hurt or feels depress after
going through your comments.

You are also expected to guide on issues like methods of solving assignments, effective
methods of studying and methods to improve study habits and working hard.
It is anticipated that the student will submit their assignments in time according to the
prescribed schedule.

You are therefore requested to mark the assignments within 15 days and return these with
detailed comments within the scheduled dates.

Marking guides are provided to you. You are expected to follow the instructions and
make full use of these guides while marking the assignments. The students are expected
to avoid giving unnecessary details and try to be brief and comprehensive. While marking
the assignments the tutor has to assess whether the students have followed the
instructions provided to them or not.

The students are given another assignment which is tool based and would involve the
student with the clarity to use the tool of SPSS; after which they will be representing their
work report. The students are not required to submit this assignment to you before it is
presented however, after the workshop the assignments will be submitted to the tutor and
will be marked by you.

5
Workshop
A three-days workshop will be arranged for the students for the each course in their
respective study center. You will be intimated before time, as your presence in the
workshop is necessary.
During the workshop the experts will deliver lectures focusing on the main areas of the
subject.

Marking Guide
It is anticipated that the tutor will mark the assignments carefully and follow the similar
marking standard for all the students. For both assignments you are requested to follow
the marks division as indicated on the assignment that is five questions, making a total of
hundred for each assignment. The questions that are further divided into parts a, b etc.
accompany a further division of marks as well. For marking of assignments & workshop,
the allocation of marks is indicated as under:

S. Assignment Number Component Pass Marks Total


No Marks
01 Theoretical
01 40 100
Questions
02 Theoretical
02 40 100
Questions
03 Participation Participation and 70%
Workshop
Attendance attendance is mandatory

Teachers are expected to stick to the guidelines provided to maintain the standardization
and uniformity.

6
ALLAMA IQBAL OPEN UNIVERSITY, ISLAMABAD
(Department of Environmental Design, Health & Nutritional Sciences)

WARNING
1. PLAGIARISM OR HIRING OF GHOST WRITER(S) FOR SOLVING
THE ASSIGNMENT(S) WILL DEBAR THE STUDENT FROM
AWARD OF DEGREE/CERTIFICATE, IF FOUND AT ANY STAGE.
2. SUBMITTING ASSIGNMENT(S) BORROWED OR STOLEN FROM
OTHER(S) AS ONE’S OWN WILL BE PENALIZED AS DEFINED IN
“AIOU PLAGIARISM POLICY”.

Course: Basic Statistics (3685) Semester: Autumn, 2020


Level: M.Sc Sustainable Environmental Design Total Marks: 100

Credit Hours: 3 (2+1) Pass Marks: 40


ASSIGNMENT No. 1

Q.1 (a) Explain importance of presentation of data and measures of central tendency
in data analysis. (10)
(b) The following scores represent the final examination grade for an elementary
statistics course:-
23 60 79 32 57 74 52 70 82 36 80 77 81 95 41 65 92 85 55 76 52
10 64 75 78 25 80 98 81 67 41 71 83 54 64 72 88 62 74 43 60 78
89 76 84 48 84 90 15 79 34 67 17 82 69 74 63 80 85 61
i. Find its mean, median, mode, Quartiles and standard deviation.
ii.Make a grouped frequency distribution of this data and again calculate mean,
median, mode, Quartiles and standard deviation.
iii. Compare your results of part (i) and part (ii), also comment on your results.
iv. Define the skewness of this data set. (10)

Q.2 (a) Define importance of moments and skewness in data analysis. (10)
(b) In the manufacturing of a certain scientific instrument great
importance is attached to the life of a particular critical component.
This component is obtained in bulk from two sources, A and B, and in
the course of inspection, the lives of 1000 of the components from
each source are determined. The following frequency tables are
obtained:-
Source A Source B
Life (hours) No of Life(hours) No of
components Components
1000-1020 40 1030-1040 339
1020-1040 96 1040-1050 136
1040-1060 364 1050-1060 25

7
1060-1080 372 1060-1070 20
1080-1100 85 1070-1080 130
1100-1120 43 1080-1090 350
i. Find Median and two quartiles for each group.
ii. Find Mean and Standard deviation for each source and compare them.
iii. Which source do you think provide better quality of components and why?

Note: answer this part by considering results of mean and standard deviation.

Calculate skewness for each group and through skewness comment on life of the
components for each source. (10)

Q.3 (a) Define the terms: Experiment, Outcome, Event, Sample Space, Simple and
Compound Events, Mutually Exclusive Events. (10)
(b) There are 20 computers in a store. Among them, 15 are brand new and 5 are
refurbished. Six computers are purchased for a student lab. From the first
look, they are indistinguishable, so the six computers are selected at random.
Compute the probability that among the chosen computers, two are
refurbished. At least one is refurbished. (10)

Q.4 (a) Define the properties of normal probability distribution in detail. (10)
(b) A research scientist reports that mice will live an average of 40 months when
their diets are sharply restricted and then enriched with vitamins and
proteins. Assuming that the lifetimes of such mice are normally distributed
with a standard deviation of 6.3 months, find the probability that a given
mouse will live
(i) more than 32 months;
(ii) less than 28 months;
(iii) between 37 and 49 months. (10)

Q.5 (a) Define properties of least squares regression line in detail. (10)
(b) A study of the amount of rainfall and the quantity of air pollution removed
produced the following data: (10)
Daily Rain Fall x (0.01 cm) Particulate Removed y (µg/m3)
4.3 126
4.5 121
5.9 116
5.6 118
6.1 114
5.2 118
3.8 132
2.1 141
7.5 108
(i) Find the equation of the regression Line to predict the particular removed
from the amount of daily rainfall.
8
(ii) Estimate the amount particular removed when the daily rainfall is x = 4.8
units.
(iii) Interpret the regression co-efficient.

9
ASSIGNMENT No. 2
Total Marks: 100
Pass Marks: 40
Q.1 (a) Define the following:-
i. Type-I and Type-II error
ii. Level of significance
iii. Acceptance and Rejection Regions (10)
(b) In the American Heart Association journal Hypertension, researchers report
that individuals who practice Transcendental Meditation (TM) lower their
blood pressure significantly. If a random sample of 225 male TM
practitioners meditate for 8.5 hours per week with a standard deviation of
2.25 hours, does that suggest that, on average, men who use TM meditate
more than 8 hours per week at 5% level of significance? (10)

Q.2 (a) To find out whether a new serum will arrest leukemia, 9 mice, all with an
advanced stage of the disease, are selected. Five mice receive the treatment
and 4 do not. Survival times, in years, from the time the experiment
commenced are as follows:
Treatment: 2.1 5.3 1.4 4.6 0.9
No Treatment: 1.9 0.5 2.8 3.1
At the 0.05 level of significance, can the serum be said to be elective? Assume
the two populations to be normally distributed with equal variances. (10)

(b) A criminologist conducted a survey to determine whether the incidence of


certain types of crime varied from one part of a large city to another. The
particular crimes of interest were assault, burglary, larceny and homicide.
The following table shows the numbers of crimes committed in four areas of
the city during the past year. Can we conclude from these data at the 0.01
level of significance that the occurrence of these types of crime is dependent
on the city district? (10)
Type of Crime
District Assault Burglary Larceny Homicide
1 162 118 451 18
2 310 196 996 25
3 258 193 458 10
4 280 175 390 19

Q.3 (a) A study was conducted to see if increasing the substrate concentration has an
appreciable effect on the velocity of a chemical reaction. With a substrate
concentration of 1.5 moles per liter, the reaction was run 15 times, with an
average velocity of 7.5 micromoles per 30 minutes and a standard deviation
of 1.5. With a substrate concentration of 2.0 moles per liter, 12 runs were
made, yielding an average velocity of 8.8 micromoles per 30 minutes and a

10
sample standard deviation of 1.2. Is there any reason to believe that this
increase in substrate concentration causes an increase in the mean velocity of
the reaction of more than 0.5 micromole per 30 minutes? Use a 0.01 level of
significance and assume the populations to be approximately normally
distributed with equal variances. (10)
(b) In a study conducted at Virginia Tech, the plasma ascorbic acid levels of
pregnant women were compared for smokers versus nonsmokers. Thirty-two
women in the last three months of pregnancy, free of major health disorders
and ranging in age from 15 to 32 years, were selected for the study. Prior to
the collection of 20 ml of blood, the participants were told to avoid breakfast,
forgo their vitamin supplements, and avoid foods high in ascorbic acid
the following plasma
content. From the blood samples,
ascorbic acid values were determined, in
milligrams per 100 milliliters:
Plasma Ascorbic Acid
Values
Non
Smokers
Smokers
0.97 1.16 0.92 0.74
0.48 0.72 0.78 0.88
0.86 0.71 1.24 0.94
1.00 0.85 1.18
0.98 0.81
0.58 0.68
0.62 0.57
1.18 1.32
0.64 1.36
1.24 0.98
0.78 0.99
1.09 1.64
0.90
Is there sufficient evidence to conclude that there is a difference between
plasma ascorbic acid levels of smokers and nonsmokers? Assume that the
two sets of data came from normal populations with unequal variances. (10)

Q.4 (a) Define chi square test of independence and its importance in data analysis
(10)
(b) In the experiment to study the dependence of hypertension on smoking
habits, the following data were taken on 180 individuals:
Non-Smokers Moderate Smokers Heavy Smokers
Hypertension 21 36 30
No Hypertension 48 26 19

11
Test the hypothesis that presence or absence of hypertension is independent of
smoking habits. Use a 0.05 level of significance. (10)

Q.5 (a) From the area planted in one variety of guayule, 25 plants were selected at
random. Of these plants, 13 were “ Off types” and 12 were “Aberrant” The
rubber percentages of these plants were:

Off 4.4 5.8 6.2 5.5 6.0 5.7 5.8 4.8 5.5 5.5 5.2 4.4 6.7
types 7 8 1 5 9 0 2 4 9 9 2 5 6
Aberra 6.4 6.3 4.2 7.7 6.4 7.0 5.5 8.9 7.7 7.2 7.3 5.9
nt 8 6 8 1 0 6 1 3 1 0 7 1

Compute a 90% confidence interval for the difference of two population


means. Also interpret your results. Also test the hypothsis that two types of
plants have equal rubber production. (10)

(b) Eights pots, growing three barley plants each, were exposed to a high tension
discharge while nine similar pots were enclosed in an earthed wire cage. The
number of tillers(shoots) in each pot were as follows: (10)
Caged Electrified
17 16
27 16
18 20
25 16
27 21
29 17
27 15
23 20
17

i. Test the hypothesis that Electrification and caged has an equal effect
against alternative that this has not equal effect at 5% level of
significance.
ii. Test the hypothesis that two populations have same variance.
iii. Find 95% confidence Interval for the real difference.

12

You might also like