Professional Documents
Culture Documents
Statistics is a scientific body of knowledge that deals with the collection, organization or presentation,
analysis and interpretation of data.
APPLICATION OF STATISTICS
In Business
Statistics is used to summarize and describe data that a business firm collects or gathers
such as the amount of sales, expenditures, and productions to enable the management to
understand and determine the status of the firm.
In Education
Through statistical tools, a teacher can determine the effectiveness of a teaching method
by analyzing test scores obtained by students.
In Psychology
Psychologists can interpret meaningful aptitude test, IQ tests, and other psychological
test using statistical procedure or tools.
In Medicine
Statistics is used to determine the effectiveness of new drug in a disease.
In Agriculture
Through statistical tools, an agriculturist can determine the effectiveness of a new fertilizer in the growth
of plants
In Entertainment
● the most favorite actress and actor can be determined by using surveys
● Ratings of the members of the board of judges in a beauty contest are statistically analyzed
● Interviews are used to determine the most widely viewed television show
● The top grosser movie for this year is reported based on statistical records
In Everyday Life
● The number of cars passing through the streets are recorded to enable traffic enforcers to
manage efficiently.
● The number of pedestrians crossing the street, the number of people entering a department
store, the number of people engaged in video games involve the use of statistics.
In Ancient Judea
A census of the population was taken on several occasions, such as in 2030 B.C. when the population
was estimated at 3 800 000.
435 B.C.
The first Roman census was taken in the presence of the censors. Such census was repeated sixty-nine
times in the following 470 years of the Roman history.
Luke 2: 1-4
Now in those days was a degree went forth from Ceasar Augustus for all the inhabited earth to be
registered. This registration took place when Quirinus was a governor of Syria and all people went
travelling to be registered, each one to his own city. Of course, Joseph also went up from Galilee out of
the city of Nazareth into Judea to David’s city which is called Bethlehem, because of his being a member
of the house and the family of David.
MIDDLE AGES
When William the Conqueror took possession of England, he ordered that a survey be made of the lands
of England for purposes of taxation and military services. Taxes, military services and custom duties were
also recorded.
BEGINNING OF 16 TH CENTURY
A large number of Statistical handbooks were published. This type of descriptive statistics was referred to
as Die Tabellen Statistik. Captain John Graunt worked on the scientific analysis of publicly recorded
data. The registration of deaths was started by Henry VIII in 1532 and weekly bills of mortality were
instituted during the period of plague.
Karl Pearson, Marqauis de Laplace and Carl Freidrich Gauss independently worked on the methods
of finding correlation among several variables.
Sir Ronald Fisher discovered a unified theory for drawing conclusions from statistical data and
contributed to the theory of design experiments which provided a technique of collecting primary data.
Scales of Measurement
Why Is Level of Measurement Important?
● Helps you decide what statistical analysis is appropriate on the values that were assigned
● Helps you decide how to interpret the data from that variable
1. Nominal Scale- data that cannot be ordered nor can it be used in calculations
● The values “name” the attribute uniquely.
● The value does not imply any ordering of the cases
Examples:
● jersey numbers in football
● Categories
● Colors
● names
2. Ordinal Scale - data that can be ordered; the differences cannot be measured
● When attributes can be rank-ordered
● Distances between attributes do not have any meaning
Examples:
● finishing position in a race
● Educational Attainment as 0=less than H.S.; 1=some H.S.; 2=H.S. degree; 3=some college;
4=college degree; 5=post college
3. Ratio Scale- data with a starting point that can be ordered; the differences have meaning, and ratios
can be calculated.
● Has an absolute zero that is meaningful
● Can construct a meaningful ratio (fraction), for example, number of clients in past six months
● It is meaningful to say that “...we had twice as many clients in this period as we did in the
previous six months
Examples:
● Weight of 200 cancer patients in the past 5 months
● Height of 549 newborn babies
● Diameter of 150 donuts
4. Interval Scale - data with a definite ordering but no starting point; the differences can be measured,
but there is no such thing as a ratio.
● When distance between attributes has meaning
Example:
● Temperature scales like Celsius (C) and Fahrenheit (F) ,distance from 30-40 is same as distance
from 70-80
b. Quantitative data are data which are numerical in nature. These data are obtained from counting or
measuring. In addition meaningful arithmetic operations can be done with these type of data.
Examples:
● Test score
● Height and Weight
● Amount of money you have
● Number of students who take statistics
Continuous variable is one that can assume infinite values within a specified interval. The values of the
variable are obtained through measuring.
Examples:
● Height of children
● Weight of cars
● Time to wake up in the morning
● Speed of the train
*Discrete data is counted, Continuous data is measured
Secondary sources - build upon primary resources by analyzing, interpreting, synthesizing or discussing
them.
Examples:
● Books
● Encyclopedia
● Journals
● research or studies conducted by other individuals
Methods of Collecting Data
The Direct or Interview Method- It is one of the most effective method in collecting data. To obtain
accurate responses, the interview may be done by well – trained interviewers.
- It is a person-to-person encounter between the one soliciting the information and the interviewee
Examples:
● A business firm would interview residents of a certain barangay regarding their favorite brand of
toothpaste, soap or shoes.
● TV personnel would ask the tele viewers about their favorite noontime show.
The Indirect or Questionnaire Method- It is one of the easiest methods of gathering data. It takes time
to prepare because questionnaires need to be attractive. Its contents, especially the directions, must be
precise, clear, and self-explanatory. In this method, the researcher makes use of questionnaire which may
be distributed either by personal delivery or by mail.
Registration Method- Through this, the respondents provide information in compliance with certain
laws, policies, rules, regulations, decrees or standard practices.
Examples:
● marriage contracts
● birth certificates
● motor registrations
● license of firearms
Experimentation Method- An experiment is applied to collect data if the investigator wants to control the
factors that affecting the variable being studied.
Examples:
● How does the weather affect the growth of bacteria?
● Does the new brand of fertilizer increase the harvest of rice per hectare?
Focus groups- It uses open-ended questions. These questions ground the research in a state of mind
eliminating external interference.
Examples of open-ended questions:
● “How did you feel about the presentation?”
Oral histories- Is a precisely defined recording, preservation and interpretation of historical information
based on the experiences of people involved in the event. It is linked into a single phenomenon.
Example:
● A researcher studying the effect of a flood on a community.
Grouped data
● are data that are organized.
● are the which are categorized and put into different classes.
Textual Method- One of the ways on how to present data is through textual method or paragraph
Method. In this method, data are presented by enumerating the characteristics, features, and significant
figures of the data.
Stem-and-Leaf Plot is a table which sorts a data into a pattern, separating a number into two parts.
Frequency Distribution Table for Ungrouped Data – is an arrangement of data from lowest to highest
which shows the frequency of occurrence of each value in a set.
Frequency Distribution Table for Grouped Data – is an arrangement of data into different classes or
categories. It also involves counting the data fall into each class.
Class Boundaries
- known as real or true class limits.
- more accurate expressions of class limits by at least 0.5
Class Marks
- is the midpoint or middle value of a class interval.
Class Size
- refers to the difference between the upper class boundary and the lower class boundary of a class
interval.
Example: Class size of the class boundaries 4.5 and 9.5 is 5
Class Frequency
- means the number of observations belonging to a class interval.
Range
- refers to the difference between the highest and lowest scores.
2. Histogram– is a graph represented by vertical or horizontal rectangles whose bases are the class
marks and whose heights are the frequencies.
3. Frequency Polygon – is a line graph whose bases are the class marks and whose heights are the
frequencies.The class marks are plotted against the frequencies and the points are connected by a
smooth curve.
4. Pie Chart – is a circle graph showing the properties of each class through either the relative or
percentage frequency.
5. Ogive – is a line graph where the bases are the class boundaries and the heights are the <cf for less
them ogive and >cf greater than ogive.