/  256
 
http://www.netnam.vn/unescocourse/statistics/stat_frm.htm 
MODULE “STATISTICAL DATA ANALYSIS”
By Dr. Dang Quang A and Dr. Bui The Hong Institute of Information Technology Hoang Quoc Viet road, Cau giay, HANOI 
Preface
Statistics is the science of collecting, organizing and interpreting numerical and non-numericalfacts, which we call data.The collection and study of data is important in the work of many professions, so that training inthe science of statistics is valuable preparation for variety of careers, for example, economistsand financial advisors, businessmen, engineers and farmers.Knowledge of probability and statistical methods also are useful for informatics specialists invarious fields such as data mining, knowledge discovery, neural networks, and fuzzy systemsand so on.Whatever else it may be, statistics is first and foremost a collection of tools used for convertingraw data into information to help decision makers in their work.The science of data - statistics - is the subject of this course.Chapter 1 is an introduction into statistical analysis of data. Chapters 2 and 3 deal withstatistical methods for presenting and describing data. Chapters 4 and 5 introduce the basicconcepts of probability and probability distributions, which are the foundation for our study of statistical inference in later chapters. Sampling and sampling distributions is the subject of Chapter 6. The remaining seven chapters discuss statistical inference - methods for drawingconclusions from properly produced data. Chapter 7 deals with estimating characteristics of apopulation by observing the characteristic of a sample. Chapters 8 to 13 describe some of themost common methods of inference: for drawing conclusions about means, proportions andvariances from one and two samples, about relations in categorical data, regression andcorrelation and analysis of variance. In every chapter we include examples to illustrate theconcepts and methods presented. The use of computer packages such as SPSS andSTATGRAPHICS will be evolved.
 Audience
 
This tutorial as an introductory course to statistics is intended mainly for users such asengineers, economists and managers who need to use statistical methods in their work and for students. However, many aspects will be useful for computer trainers.
Objectives
Understanding statistical reasoningMastering basic statistical methods for analyzing data such as descriptive and inferentialmethodsAbility to use methods of statistics in practice with the help of computer software
Entry requirements
High school algebra course (+elements of calculus)Elementary computer skills
 
ii
CONTENTS
Chapter 1 Introduction....................................................................................................11.1 What is Statistics...................................................................................................11.2 Populations and samples......................................................................................21.3 Descriptive and inferential statistics......................................................................21.4 Brief history of statistics........................................................................................31.5 Computer softwares for statistical analysis...........................................................3Chapter 2 Data presentation..........................................................................................42.1 Introduction...........................................................................................................42.2 Types of data........................................................................................................42.3 Qualitative data presentation................................................................................52.4 Graphical description of qualitative data................................................................62.5 Graphical description of quantitative data: Stem and Leaf displays.....................72.6 Tabulating quantitative data: Relative frequency distributions..............................92.7 Graphical description of quantitative data: histogram and polygon......................112.8 Cumulative distributions and cumulative polygons..............................................122.9 Summary.............................................................................................................142.10 Exercises..........................................................................................................14Chapter 3 Data characteristics: descriptive summary statistics.....................................163.1 Introduction.........................................................................................................163.2 Types of numerical descriptive measures...........................................................163.3 Measures of location (or measures of central tendency).....................................173.4 Measures of data variation..................................................................................203.5 Measures of relative standing.............................................................................233.6 Shape.................................................................................................................263.7 Methods for detecting outlier...............................................................................283.8 Calculating some statistics from grouped data....................................................303.9 Computing descriptive summary statistics using computer softwares.................313.10 Summary...........................................................................................................323.11 Exercises..........................................................................................................33Chapter 4 Probability: Basic concepts..........................................................................354.1 Experiment, Events and Probability of an Event..................................................354.2 Approaches to probability.....................................................................................364.3 The field of events...............................................................................................364.4 Definitions of probability......................................................................................384.5 Conditional probability and independence...........................................................414.6 Rules for calculating probability...........................................................................434.7 Summary............................................................................................................464.8 Exercises............................................................................................................46
 
iiiChapter 5 Basic Probability distributions......................................................................485.1 Random variables................................................................................................485.2 The probability distribution for a discrete random variable...................................495.3 Numerical characteristics of a discrete random variable......................................515.4 The binomial probability distribution....................................................................535.5 The Poisson distribution.......................................................................................555.6 Continuous random variables: distribution function and density function..............575.7 Numerical characteristics of a continuous random variable...............................595.8 Normal probability distribution..............................................................................605.10 Exercises...........................................................................................................63Chapter 6. Sampling Distributions..............................................................................656.1 Why the method of sampling is important............................................................656.2 Obtaining a Random Sample...............................................................................676.3 Sampling Distribution...........................................................................................686.4 The sampling distribution of 
 x 
: the Central Limit Theorem.................................736.5 Summary.............................................................................................................766.6 Exercises.............................................................................................................76Chapter 7 Estimation...................................................................................................797.1 Introduction..........................................................................................................797.2 Estimation of a population mean: Large-sample case..........................................807.3 Estimation of a population mean: small sample case...........................................887.4 Estimation of a population proportion...................................................................907.5 Estimation of the difference between two population means................................927.6 Estimation of the difference between two population means: Matched pairs.......957.7 Estimation of the difference between two population proportions.........................977.8 Choosing the sample size....................................................................................997.9 Estimation of a population variance...................................................................1027.10 Summary.........................................................................................................1057.11 Exercises.........................................................................................................105Chapter 8 Hypothesis Testing..................................................................................1078.1 Introduction........................................................................................................1078.2 Formulating Hypotheses....................................................................................1078.3 Types of errors for a Hypothesis Test................................................................1098.4 Rejection Regions..............................................................................................1118.5 Summary...........................................................................................................1188.6 Exercises...........................................................................................................118Chapter 9 Applications of Hypothesis Testing...........................................................1199.1 Introduction........................................................................................................1199.2 Hypothesis test about a population mean..........................................................1199.3 Hypothesis tests of population proportions........................................................1259.4 Hypothesis tests about the difference between two population means...............1269.5 Hypothesis tests about the difference between two proportions.........................1319.6 Hypothesis test about a population variance......................................................1349.7 Hypothesis test about the ratio of two population variances..............................135

Share & Embed

More from this user

Recent Readcasters

Add a Comment

Characters: ...

luckynameleft a comment

muneer

Sinnerismleft a comment

Nice effort.