Professional Documents
Culture Documents
1 - Introduction To Statistics - Jan-2, 2012 (Compatibility Mode) PDF
1 - Introduction To Statistics - Jan-2, 2012 (Compatibility Mode) PDF
Dedicated to
Professor. S. G. Deshmukh
Ramesh Anbanandam ramesh@nitc.ac.in Department of Mechanical Engineering, NIT Calicut Kerala, India -673 601.
ramesh.anbanandam@gmail.com 1
Lab/tutorial
The laboratory content will require prerequisite of working with Excel. There will be quizzes/assignments every week. The lab assignments are to be submitted on that day itself. Students will be also required to visit and consult useful web resources.
ramesh.anbanandam@gmail.com
ramesh.anbanandam@gmail.com
Reference
Meyer PL, Introductory Probability and Statistical Applications, Oxford and IBH Publishers Miller IR, Freund JE, Johnson R, Probability and Statistics for Engineers, Prentice-Hall (I) Ltd Walpole RE and Myers RH, Probability & Statistics for Engineers and Scientists, Macmillan Levin, R. I. and Rubin, D.S., Statistics for Management (Pearson Education ) Levine,David., Stephan,David., Krehbiel, Timothy and Berenson, Mark., Statistics for Managers using Microsoft Excel, Prentice Hall
5 ramesh.anbanandam@gmail.com 6
15 %
15 %
40 %
10%
10%
ramesh.anbanandam@gmail.com
Page 1
Statistics..
Plays an important role in many facets of human endeavour Occurs remarkably frequently in our everyday lives It is often incorrectly thought of as just a collection of data, graphs and diagrams
Statistics in Business
Accounting auditing and cost estimation Economics regional, national, and international economic performance Finance investments and portfolio management Management human resources, compensation, and quality management Management Information Systems (ERP): performance of systems which gather, summarize, and disseminate information to various managerial levels Marketing market analysis and consumer research International Business market and demographic analysis
7 ramesh.anbanandam@gmail.com 8
ramesh.anbanandam@gmail.com
What is Statistics?
Science of gathering, analyzing, interpreting, and presenting data Branch of mathematics Facts and figures Measurement taken on a sample Statistics is the scientific method that enables us to make decisions as responsibly as possible.
Statistics
The science of data to answer research questions
Formulate a research question(s) (hypothesis) Collect data Analyze and summarize data Draw conclusions to answer research questions
Statistical Inference
ramesh.anbanandam@gmail.com
ramesh.anbanandam@gmail.com
11
ramesh.anbanandam@gmail.com
12
Page 2
Census gathering data from the entire population Sample a portion of the whole
a subset of the population a part of the population from which we actually collect information, used to draw conclusions about the whole (statistical inference
13 ramesh.anbanandam@gmail.com 14
--Charles Grosvenor
Descriptive Statistics
Collect data
ex. Survey
Present data
ex. Tables and graphs
Characterize data
ex. Sample mean =
X
n
ramesh.anbanandam@gmail.com
15
Descriptive statistics..
Encompasses the following:
Graphical or pictorial display Condensation of large masses of data into a form such as tables Preparation of summary measures to give a concise description of complex information (e.g. an average figure) Exhibition of patterns that may be found in sets of information
ramesh.anbanandam@gmail.com 17
Inferential Statistics
Estimation
ex. Estimate the population mean weight using the sample mean weight
Hypothesis testing
ex. Test the claim that the population mean weight Drawing conclusions and/or making decisions is 120 pounds
Page 3
Inferential Statistics..
Especially relates to:
Determining whether characteristics of a situation are unusual or if they have happened by chance Estimating values of numerical quantities and determining the reliability of those estimates Using past occurrences to attempt to predict the future
Population
Sample x (statistic )
(parameter)
ramesh.anbanandam@gmail.com
19
ramesh.anbanandam@gmail.com
23
ramesh.anbanandam@gmail.com
24
Page 4
Types of Variables
Categorical (qualitative) variables have values that can only be placed into categories, such as yes and no. Numerical (quantitative) variables have values that represent quantities.
Types of Variables
Data
Categorical
Examples: Marital Status Political Party Eye Color (Defined categories)
Numerical
Discrete
Examples: Number of Children Defects per hour (Counted items)
Continuous
Examples: Weight Voltage (Measured characteristics)
Levels of Measurement
A nominal scale classifies data into distinct categories in which no ranking is implied.
Categorical Variables
Personal Computer Ownership Type of Stocks Owned Internet Provider
Categories
Yes / No Growth Value Other Microsoft Network / AOL
ramesh.anbanandam@gmail.com
27
Levels of Measurement
An ordinal scale classifies data into distinct categories in which ranking is implied
Categorical Variable Student class designation Product satisfaction Faculty rank Standard & Poors bond ratings Student Grades Ordered Categories Freshman, Sophomore, Junior, Senior Satisfied, Neutral, Unsatisfied Professor, Associate Professor, Assistant Professor, Instructor AAA, AA, A, BBB, BB, B, CCC, CC, C, DDD, DD, D A, B, C, D, F
Levels of Measurement
An interval scale is an ordered scale in which the difference between measurements is a meaningful quantity but the measurements do not have a true zero point. A ratio scale is an ordered scale in which the difference between the measurements is a meaningful quantity and the measurements have a true zero point.
Page 5
ramesh.anbanandam@gmail.com
32
Ratio
Parametric
Before presentation always check: the source of the data that the data has been accurately transcribed the figures are relevant to the problem
33 ramesh.anbanandam@gmail.com 34
ramesh.anbanandam@gmail.com
ramesh.anbanandam@gmail.com
35
ramesh.anbanandam@gmail.com
36
Page 6
3rd Qtr
2nd Qtr
1st Qtr
0
ramesh.anbanandam@gmail.com 37
20
40
60
80
100
38
ramesh.anbanandam@gmail.com
Frequency distributions
Frequency tables
Observation Table Frequency Cumulative Frequency 13 13 18 31 25 56 15 71 9 80
100 80 60 40 20 0 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East
West
North
ramesh.anbanandam@gmail.com
39
ramesh.anbanandam@gmail.com
40
Frequency diagrams
Frequency 30 25 20 15 10 5 0 < 20 <40 <60 <80 <100 Frequency
have not been summarized in any way are also called raw data
Grouped data
<40 <60 <80 <100
ramesh.anbanandam@gmail.com
41
ramesh.anbanandam@gmail.com
42
Page 7
Frequency 6 18 11 11 3 1
ramesh.anbanandam@gmail.com 44
ramesh.anbanandam@gmail.com
Data Range
42 30 53 50 52 30 55 49 61 74 26 58 40 40 28 36 30 33 31 37 32 37 30 32 23 32 58 43 30 29 34 50 47 31 35 26 64 46 40 43 57 30 49 40 25 50 52 32 60 54
Smallest Largest
ramesh.anbanandam@gmail.com
45
ramesh.anbanandam@gmail.com
Class Midpoint
beginning class endpoint + ending class endpoint 2 30 + 40 = 2 = 35
1 class width 2
Relative Frequency
Class Interval 20-under 30 30-under 40 40-under 50 50-under 60 60-under 70 70-under 80 Total
47
Class Midpoint =
Frequency 6 18 11 11 3 1 50
ramesh.anbanandam@gmail.com
ramesh.anbanandam@gmail.com
Page 8
Cumulative Frequency
Cumulative Class Interval 20-under 30 30-under 40 40-under 50 50-under 60 60-under 70 70-under 80 Total Frequency 6 18 11 11 3 1 50 Frequency 6 18 + 6 24 11 + 24 35 46 49 50
Class IntervalFrequency Midpoint Frequency Frequency 20-under 30 6 25 .12 6 30-under 40 18 35 .36 24 40-under 50 11 45 .22 35 50-under 60 11 55 .22 46 60-under 70 3 65 .06 49 70-under 80 1 75 .02 50 Total 50 1.00
49 ramesh.anbanandam@gmail.com 50
ramesh.anbanandam@gmail.com
ramesh.anbanandam@gmail.com
52
Histogram
Class Interval Frequency 20-under 30 6 30-under 40 18 40-under 50 11 50-under 60 11 60-under 70 3 70-under 80 1
20
Histogram Construction
Class Interval Frequency 20-under 30 6 30-under 40 18 40-under 50 11 50-under 60 11 60-under 70 3 70-under 80 1
Frequency
Frequency
10
10 20 30 40 50 60 70 80 Years
0 0
10
20
10 20 30 40 50 60 70 80 Years
ramesh.anbanandam@gmail.com
53
ramesh.anbanandam@gmail.com
54
Page 9
Frequency Polygon
Class Interval Frequency 20-under 30 6 30-under 40 18 40-under 50 11 50-under 60 11 60-under 70 3 70-under 80 1
20
Ogive
Cumulative Class Interval Frequency 20-under 30 6 30-under 40 24 40-under 50 35 50-under 60 46 60-under 70 49 70-under 80 50
Fre quency 10
Frequency
0 0
20
40
60
10 20 30 40 50 60 70 80 Years
10
20
30
40 Years
50
60
70
80
ramesh.anbanandam@gmail.com
55
ramesh.anbanandam@gmail.com
56
Complaints by Passengers
COMPLAINT NUMBER 28,000 14,700 10,500 9,800 7,000 70,000 PROPORTION .40 .21 .15 .14 .10 1.00 DEGREES 144.0 75.6 50.4 50.6 36.0 360.0
58
1.00 0.90 0.80 0.70 0.60 0.50 0.40 0.30 0.20 0.10 0.00 0 10 20 30 40 Years 50 60 70 80
ramesh.anbanandam@gmail.com
57
ramesh.anbanandam@gmail.com
Complaints by Passengers
Company
Personnel 14% Equipment 15% Schedules, Etc. 10%
A B
C D E Totals
ramesh.anbanandam@gmail.com
ramesh.anbanandam@gmail.com
60
Page 10
39% 39%
C D
357,411 = 920,190
E Totals
ramesh.anbanandam@gmail.com
61
ramesh.anbanandam@gmail.com
Pareto Chart
100 90 80 70 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Poor Wiring Short in Coil Defective Plug Other
15 9 15 7 5 Registered Vehicles (1000's)
Scatter Plot
Gasoline Sales (1000's of Gallons)
Gasoline Sales
Frequency
60 50 40 30 20 10 0
200
60 120 90
100
140 60
10 15 Registered Vehicles
20
ramesh.anbanandam@gmail.com
63
ramesh.anbanandam@gmail.com
64
Good Presentation
$
4 2
Minimum Wage
1980: $3.10
0
1990: $3.80
1960
1970
1980
1990
Page 11
Good Presentation
50 25 0 Q1 Q2 Q3 Q4
Good Presentations
$ Monthly Sales
Quarterly Sales
Quarterly Sales
39 36
Thank You
69
Page 12