Professional Documents
Culture Documents
1
CHAPTER FIVE
2
Quotes
• Data will talk to you if you are willing to listen. ‘Bergeson’
determine with what validity data can be said to indicate any conclusions”
Data analysis
• Data Analysis is the process of organizing, displaying, summarizing, and asking
questions about data.
Data analysis is the process of developing answers to questions through the examination and
interpretation of data.
identifying issues,
deciding on which methods are appropriate for answering the questions of interest,
• Data analysis
• An attempt by the researcher to summarize collected data.
• Data Interpretation
• Attempt to find meaning
Cont…….
Data can be analyzed manually or by using some soft wares
Example
2.Weka
4.Stata
5.Minitab
6. MATLAB
PROCESSING OPERATIONS
Editing is done to assure that the data are accurate, consistent (reliable) with
other facts gathered, uniformly entered, as completed as possible and have been
well arranged to facilitate coding and tabulation
2.Coding
2. Coding: Coding refers to the process of assigning numerals or other symbols to
answers so that responses can be put into a limited number of categories or classes
3. Classification: Most research studies result in a large volume of raw data which must
be reduced into homogeneous groups if we are to get meaningful relationships.
Tabulation is the process of summarizing raw data and displaying the same in
compact form (i.e., in the form of statistical tables) for further analysis
• Most people find ‘pictures’ much more helpful than ‘numbers’ in the sense
that, in their opinion, they present data more meaningfully.
Various possible types of presentation of data and justification for their use in given situations.
1. TABULAR FORMS
1. Minimum = 2
2. Maximum = 68
3. Number of observations = 25
4. Mode = 19
5. Median = 24
2. Line Graphs
A line graph is usually meant for showing the frequencies for various
values of a variable
• A pie chart or circular diagram is one which essentially displays the relative
figures (proportions or percentages) of classes or strata of a given sample or
population
Bar Charts
The bar chart is one of the most common methods of presenting data in a visual
form.
A bar chart consists of a set of bars whose heights are proportional to the
frequencies that they represent.
Exploring, Displaying and examining data
• Data exploration is the first step in data analysis and typically involves
summarizing the main characteristics of a dataset (collection of data
(usually in digital form).
• It is commonly conducted using visual analytics tools, but can also be done in
more advanced statistical software, such as R.
• The open-source programming language R has for a long time been popular
(particularly in academia) for data processing and statistical analysis
Cont…
Cont…
Hypothesis Testing
• WHAT IS A HYPOTHESIS?
• In the context of statistical analysis, we often talk about null hypothesis and
alternative hypothesis
(b) The level of significance: The significance level is the maximum value
of the probability of rejecting H0 when it is true and is usually
determined in advance before testing the hypothesis.
It is always some percentage (usually 5%) which should be chosen with great care,
thought and reason
The 5 percent level of significance means that researcher is willing to take as much
as a 5 percent risk of rejecting the null hypothesis when it (H0) happens to be true.
Cont….
• These may be decisions that are important to us such as, "Will I improve my
Programming grades if I spend more time studying C++?"
• For example, if we were to test the hypothesis that college freshmen study 20
hours per week, we would express our null hypothesis as:
H0: µ = 20
Cont…………
We test the null hypothesis against an alternative hypothesis, which is given
the symbol Ha.
The alternative hypothesis is often the hypothesis that you believe yourself!
In this example, our alternative hypothesis would express that freshmen do not
study 20 hours per week:
Ha: µ ≠ 20
Example A
• We have a medicine that is being manufactured and each pill (tablet) is supposed
to have 14 milligrams of the active ingredient. What are our null and alternative
hypotheses?
Solution
H0: µ = 14
Ha: µ ≠14
Our null hypothesis states that the population has a mean equal to 14 milligrams.
Our alternative hypothesis states that the population has a mean that is different
from 14 milligrams.
Example B
• The school principal wants to test if it is true what teachers say – that high
school juniors use the computer an average 3.2 hours a day. What are our null
and alternative hypotheses?
H0: µ = 3.2
Ha: µ ≠ 3.2
Our null hypothesis states that the population has a mean equal to 3.2 hours.
Our alternative hypothesis states that the population has a mean that differs
from 3.2 hours.
Deciding Whether to Reject the Null Hypothesis
For example, let’s say we want to find out why some people think they will
eventually graduate from a four-year college while others don’t.
The independent variable is some variable that you think might help you
answer this question.
• For variables that have an underlying order from low to high they can be
positive or negative. A positive value indicates that as one variable increases,
the other variable also increases. A negative value indicates that as one
The independent variable is some variable that you think might help explain
For example, if your two variables were education and voting you
might choose education as the independent variable and voting as your
dependent variable because you think that education will help you
explain why some people vote Democrat and others vote Republican
Example
• Today, one of the most basic means of communication in our professional life is
written presentation, such as scientific paper, technical report, assignment
report, abstract, thesis, conference report, etc.
Guidelines and tips that will improve your writing skills sufficiently to serve a purpose.
The main elements of good writing are:
• Once an idea has been identified or formulated, then the effort will be
to present this idea in the best possible way to the target audience
• Having properly identified the relevant reader of the scientific paper, we need
to understand this audience
Cont..
• Anyone who picks up to read your writing is either interested in acquiring
new information or achieving a better understanding.
• Therefore, in order to serve the reader, your paper should have pertinent
information
• The organization of the paper refers to the structure, i.e. the sequence in
which you present each type of information
• The scientific report should have distinctive and clearly evident component
parts.
• It is always desirable for you to create an outline of the paper based on the
component parts and filling in the major points you want to cover in each
part.
Cont…..
-Title
- Acknowledgement
- Abstract
- Introduction
- Results
- Discussion/Conclusion
- References
Chapter
Next