You are on page 1of 41

Course

Course Name ACTS


Code

Statistics for
STAT2054 4,00
Engineers
Course Material
Textbook Probability & Statistics for Engineers & Scientists
(9th Edition), by R. E. Walpole, R. H. Myers, S. L.
Myers and K. Ye, Pearson Prentice Hall,2011.

References 1) Probability and Statistics for Engineering and the


(Books and Sciences (8th Edition), by J. L. Devore, Duxbury,
Periodicals) Addison Wesley, 2012
2) Miller and Freund's Probability and Statistics for
Engineers (8th Edition) , by Richard A. Johnson

Tools Excel, MATLAB, Python, Mathematica


Course Plan

Week 1 Introduction to Statistics and Data Analysis


Week 2 Probability
Week 3 Random Variables and Probability Distributions
Week 4 Mathematical Expectation
Week 5 Some Discrete Probability Distributions
Week 6 Some Continuous Probability Distributions.
Week 7 MIDTERM EXAM
Week 8 Functions of Random Variables (Optional).
Week 9 Fundamental Sampling Distributions and Data Descriptions
Week 10 One- and Two-Sample Estimation Problems.
Week 11 One- and Two-Sample Tests of Hypotheses
Week 12 Simple Linear Regression and Correlation
Week 13 Multiple Linear Regression and Certain Nonlinear Regression Models
Week 14 FINAL EXAM
Grading Weight (%)

Active Attendance 10
Homeworks 20
Midterm Exam 30
Final Exam 40
Example. Suppose that the prevalence of a disease is 1%. This
disease has no particular symptoms but can be screened by a
medical test that is 90% accurate. This means that the test
result is positive about 90% of the times when it is applied on
patients who have the disease and that the test result is negative
about 90% of the time when it is applied on patients who do not
have the disease.
• Suppose that you take the test and the test shows a positive
result.
• Then the burning question is: how likely is it that you have the
disease?
• Similarly, how likely is it that the patient is healthy if the test
result is negative?
Single Number Statistics
Single number statistics: means, medians, standard deviations,
Definitions and illustrations of the single-number statistics
and graphs,
 histograms,
 stem-and-leaf plots,
 scatter plots,
 dot plots,
 box plots,
Overview
• A batch process is a process in which the product comes out in groups and
not continuously.
• The data gathered may be discrete
or continuous, depending on the area of application.
• The product density will not always be the same.
• In a batch process rather than continuous, there will be
• batch-to-batch variation
• within-batch variation.
• Statistical methods are used to analyze data
• to gain more sense of where in the process changes may be made to
improve the quality of the process.
Toyota's strong market share gains of the last
decade
Overview: Statistical Inference, Samples,
Populations and the Role of Probability
Inference: a conclusion reached on the basis of evidence and reasoning.

• Beginning in the 1980s


• improvement of quality
• Use of Scientific Data,
• statistical methods in manufacturing
• gathering of data: done over a thousand years.
• Data: collected, summarized, reported, and stored.
• profound distinction between collection of scientific
information and inferential statistics.
Probability and Inferential Statistics
Statistical inference: making scientific judgments (drawing conclusions ) in
the face of uncertainty and variation, based on data.
Biased Sample
Biased: unfairly prejudiced for or against someone or something.
Variability in Scientific Data and Sampling
• Information is gathered in the form of samples, or collections
of observations gathered from populations .
Single-number Statistics or Descriptive Statistics

• These numbers give a sense of


• center of the location of the data,
• variability in the data,
• the general nature of the distribution of
• observations in the sample.
Measures of Location
• The mean is simply a numerical average.
Some Excel Applications
Measures of Location
• the sample mean is the centroid of the data in a sample.
• it is the fulcrum to balance a system of “weights” of the
individual data.
The Weighted Arithmetic Mean
Measures of Location
• Trimmed Means. “trimming away” a certain percent of both
the largest and the smallest set of values.
• For example, the 10% trimmed mean is found by eliminating
the largest 10% and smallest 10%
• and computing the average of the remaining values.
Measures of Location
sample median is to reflect the central tendency
of the sample uninfluenced by extreme values or
outliers
Some Excel Applications
Measures of Variability
• there are many measures of spread or variability.
• Perhaps the simplest one is the sample range Xmax − Xmin.
• The range can be very useful
• The sample measure of spread used most often is the sample
standard deviation.
Probability: Sample Space and Events
• Experiment: any process that generates a set of data.
• numerical data, representing counts or measurements,
• or categorical data, which can be classified according
to some criterion
• The set of all possible outcomes of a statistical experiment is called
the sample space and is represented by S.
Probability: Sample Space and Events
• An event is a subset of a sample space.
• (A’) The complement of an event A with respect to S is the subset of
all elements of S that are not in A.
• The intersection of two events A and B, denoted by the symbol A ∩ B,
is the event containing all elements that are common to A and B.
• Two events A and B are mutually exclusive, or disjoint, if A ∩ B = Ф,
that is, if A and B have no elements in common.

You might also like