Professional Documents
Culture Documents
Statistics Lecture Part 3
Statistics Lecture Part 3
TO RESEARCH
Presentation of Data
Textual Presentation
Tabular Presentation
Graphical Presentation
Textual Presentation
Definition:
• It
is a statement or claim regarding a
characteristic of one or more
populations.
• Itis a preconceived idea, assumed to be
true but has to be tested for its truth or
falsity.
Textual Presentation
Definition:
A frequency distribution list each
category of data and the number of
occurrences for each category of data.
Example:
Solution:
10 - 19 25
Sample Table 20 - 29 36
30 - 39 40
40 and over 12
Guidelines for Determining the Lower Class
Limit of the First Class and Class Width
✦ Choosing the Lower Class Limit of the First
Class
Choose the smallest observation in the data
set or a convenient number slightly lower
than the smallest observation in the data set.
For example, the smallest observation is
10.2. A convenient lower class limit of the
first class is 10.
Guidelines for Determining the Lower Class
Limit of the First Class and Class Width
✦ Determining the Class Width
• Decide on the number of classes. Generally, there
should be between 5 and 20 classes. The smaller
the data set, the fewer classes you should have.
• Determine the class width by computing
xmax − xmin
cw =
nc
cw is the class width
nc is the number of classes
Round this value up to a convenient number.
Remember!
Creating the classes for summarizing
continuous data is an art form. There is no
such thing as the correct frequency
distribution. However, there can be less
desirable frequency distributions. The larger
the class width, the fewer classes a
frequency distribution will have.
Example:
Final Output
Example:
Identify problems with the following table.
Answer:
✦ Useless Information – Don’t show decimals if they
are not needed.
✦ Poor Alignment – Make sure alignment makes
sense.
• Don’t center numbers, always right justify – try
to align decimal points.
• Consider the appropriate placement of row titles.
• Headers are left justified.
✦ Difficult to Read – Use commas used when the
number exceeds a thousand.
Exercises:
Review the table and consider questions such as the
following.
Needs
Origin / Rating Poor Satisfactory V Good Excellent Total
Improvement
External 0% 2% 12% 19% 9% 41%
Internal 4% 8% 15% 23% 9% 59%
Grand Total 4% 10% 27% 41% 17% 100%
1. What percentage of the employees originated from within the
organization?
2. What percentage of the employees are both internal and rated
‘Very Good’?
3. What percentage of the employees received ‘Needs Improvement’
or ‘Poor’?
4. What category contains the greatest number of employees?
5. Do you see any notable differences in the percentage by category?
Exercises:
Consider the above Frequency Distribution of
Salaries.
Salary Frequency Percentage
41,000 - 50,000 1 1%
51,000 - 60,000 20 13%
61,000 - 70,000 53 35%
71,000 - 80,000 43 29%
81,000 - 90,000 26 17%
91,000 - 100,000 6 4%
101,000 - 110,000 1 1%
Total 150 100%
1.What percentage of the employees earns less than
80,000?
2.What is the salary range of values?
3.What is a mid-range of salaries?
4.What salary category includes the most employees?
Graphical Presentation
Definition:
!It is the most effective way to present
results in a study since it shows the
statistical values and relationship in a
pictorial or diagram form.
Graphical Presentation
✦ It is considered an important medium of
communication because we are able to create
a pictorial representation of the numerical
figures.
✦ Suited when we need to show the results of
the study to nonprofessionals and or people
who dislike numbers and too lengthy texts.
Bar Graph
✦ It is constructed by labeling each category
of data on either the horizontal or vertical
axis and the frequency or relative frequency
of the category on the other axis. Rectangles
of equal width are drawn for each category.
The height of each rectangle represents the
category’s frequency or relative frequency.
✦ It is use to organize discrete data.
Example:
Bar Graph
Remember!
Bar graphs may also be drawn with
horizontal bars. Horizontal bars are
preferable when category names are
lengthy.
Remember!
In bar graphs, the order of the categories
does not usually matter. However, bar
graphs that have categories arranged in
decreasing order of frequency help prioritize
categories for decision-making purposes in
areas such as quality control, human
resources, and marketing.
Histogram
✦ It is constructed by drawing rectangles for
each class of data. The height of each
rectangle is the frequency or relative
frequency of the class. The width of each
rectangle is the same and the rectangles
touch each other.
✦ It is a graph used to present quantitative
data, is similar to the bar graph.
✦ It is use to organize continuous data.
Example:
Histogram
Example:
Grouped Column Chart
Example:
Subdivided Column Chart
Exercises:
Which one do you think is the most
informative? Why?
Pie Chart
✦ It is a circle divided into sectors. Each
sector represents a category of data.The
area of each sector is proportional to the
frequency of the category.
✦ Pie charts are typically used to present the
relative frequency of qualitative data.
Inmost cases the data are nominal, but
ordinal data can also be displayed in a pie
chart.
Example:
Pie Chart
When should a bar graph or a
pie chart be used?
✦ Pie charts are useful for showing the
division of all possible values of a
qualitative variable into its parts.
✦ Bar graphs are useful when we want to
compare the different parts, not necessarily
the parts to the whole.
Line Graph
✦ A graph that shows information that is
connected in some way (such as change
over time)
✦ Line segments are then drawn connecting
the points.It is use to organize continuous
data.
✦ Very useful in identifying trends in the data
over time.
Example:
Simple Line Graph
Example:
Multiple Line Graph
Guidelines for Constructing
Good Graphics
✦ Title and label the graphic axes clearly,
providing explanations if needed. Include units
of measurement and a data source when
appropriate.
✦ Avoid distortion. Never lie about the data.
✦ Minimize the amount of white space in the
graph. Use the available space to let the data
stand out. If you truncate the scales, clearly
indicate this to the reader.
Guidelines for Constructing
Good Graphics
✦ Avoid clutter, such as excessive gridlines and
unnecessary backgrounds or pictures.
✦ Don’t distract the reader.
✦ Avoid three dimensions. Three-dimensional
charts may look nice, but they distract the
reader and often lead to misinterpretation of
the graphic.
Guidelines for Constructing
Good Graphics
✦ Do not use more than one design in the same
graphic. Sometimes graphs use a different
design in one portion of the graph to draw
attention to that area. Don’t try to force the
reader to any specific part of the graph. Let
the data speak for themselves.
✦ Avoid relative graphs that are devoid of data
or scales.
Exercises:
What features of the ‘Good Presentation’
make it better than the ‘Bad Presentation’?
Exercises:
What features of the ‘Good Presentation’
make it better than the ‘Bad Presentation’?
Measures of Central Tendency
1. Mean
2. Median
3. Mode
Mean
1. Range
2. Standard Deviation
3. Variance
Based on the figure below, determine which
between the two scatter diagram illustrate larger
variability?
Figure 1 Figure 2
Since the data points in figure 2 is more
scattered than the data points in figure 1,
then the data set depicted in figure 2 is
more varied.
Range
Bottom
20%
Percentiles
Descriptive measures that split the
ordered data into 100 equal parts.
Bottom Top 1%
1%
Bottom 2% Top 2%
Interpretation
Example:
1. Jennifer just received the results of her SAT
exam. Her SAT Mathematics score of 600 is in
the 74th percentile. What does this mean?
A percentile rank of 74% means that 74%
of SAT Mathematics scores are less than
or equal to 600 and 26% of the scores are
greater. So 26% of the students who took
the exam scored better than Jennifer.
Interpretation
Example:
2. A test mark is calculated to be at the 84th
percentile, what does this mean?
84% of the people who wrote the test
got the same mark or less than the test
mark and 16% of the people who
wrote the test scored higher than the
test mark.
Interpretation
Example:
3. Time taken to finish a test is 35 minutes.
This time was the first quartile. What does this
mean?
25% of the learners finished the exam
in 35 minutes or less, and 75% of the
learners finished the exam in more
than 35 minutes.
Interpretation
Example:
2. If your percentile score on the GRE is 90
then you scored better than 90 Than 90% of
those taking the test and you scored lower
than 10% of those taking the test.
3. If 25% of the population is shorter than you,
then your height is said to be at the 70%
percentile.