Professional Documents
Culture Documents
Math 212-Course Pack - Module 1
Math 212-Course Pack - Module 1
MATH 212
ENGINEERING DATA ANALYSIS
1|Page
ENGINEERING DATA ANALYSIS
Welcome Message
It is hoped that after completing all the tasks and activities in this
course pack, you will be able to demonstrate the expected outcomes and
gained insights that are useful in your field of specialization.
God bless!
Faculty Information:
Getting help
2|Page
ENGINEERING DATA ANALYSIS
TABLE OF CONTENTS
CONTENTS PAGE
Module 1 Outcomes……………..………………….. 12
Lesson 1 ………..……………………………….. 13
Application 1…………………………………………. 20
Application 2 …………………………………………. 24
Lesson 3 ……………………………………………… 26
Application 3 ………………………………………….. 31
Summary ………………………………………….. 32
Assessment …………………………………………… 33
References …………………………………... 35
3|Page
ENGINEERING DATA ANALYSIS
VISION
MISSION
GOALS
4|Page
ENGINEERING DATA ANALYSIS
LEADERSHIP SKILLS
SERVICE ORIENTED
LIFELONG LEARNING
PROFESSIONAL COMPETENCE
UNITY
STEWARDSHIP
EXCELLENCE
PROFESSIONALISM
5|Page
ENGINEERING DATA ANALYSIS
Good luck!
6|Page
ENGINEERING DATA ANALYSIS
COURSE INFORMATION
COURSE DESCRIPTION
COURSE OUTCOMES:
7|Page
ENGINEERING DATA ANALYSIS
COURSE ASSESSMENT
Learning Evidence
Learning
Description and other Details
Evidence
Multiple choice examination having problem-
LE1 Results of major solving components through UVE, Edmodo,
examinations Google Classroom, or other platform that will
be agreed upon. Midterm and/or Final
examinations maybe conducted physically in
the classroom if the situation permits.
Measurement Rubrics
Beyond Unsatisfactory
Area to Expected Satisfactory (25%)
Expectation
Assess (75%) (50%)
(100%)
Solution to Used an Used an Used an Attempted to
Problems appropriate appropriate appropriate solve the
concept to concept to concept but problem but
come up with come up come up with used an
a correct with a an entirely inappropriate
solution and solution. wrong concept that
use other But a part solution that led to a wrong
relevant of the led to an solution.
strategy to solution led incorrect
arrive at a to an answer.
correct incorrect
8|Page
ENGINEERING DATA ANALYSIS
answer answer.
Grading System
Submission of Requirements
o Digital submission
9|Page
ENGINEERING DATA ANALYSIS
Name of Student:
Course:
Subject:
Date of Submission:
Professor’s Name:
10 | P a g e
ENGINEERING DATA ANALYSIS
11 | P a g e
ENGINEERING DATA ANALYSIS
MODULE OVERVIEW
MODULE OUTCOME
12 | P a g e
ENGINEERING DATA ANALYSIS
Learning Outcome:
Introduction
A primary goal of statistical studies is to collect data that can then be used to
make informed decisions. It should come as no surprise that the ability to make good
decisions depends on the quality of the information available. This lesson introduces
the data analysis process, types of data , and the different method of data collection.
Abstraction
Statistics involves collecting, summarizing, and analyzing data. All three tasks are
critical. Without summarization and analysis, raw data are of little value, and even
sophisticated analyses can’t produce meaningful information from data that were not
collected in a sensible way.
Data collection and analysis allow researchers to answer such questions. The process
can be organized into the following six steps:
13 | P a g e
ENGINEERING DATA ANALYSIS
2. Deciding what to measure and how to measure it. The next step in the
process is deciding what information is needed to answer the questions of
interest.
3. Data collection. The data collection step is crucial. The researcher must first
decide whether an existing data source is adequate or whether new data must be
collected. Even if a decision is made to use existing data, it is important to
understand how the data were collected and for what purpose, so that any
resulting limitations are also fully understood and judged to be acceptable. If new
data are to be collected, a careful plan must be developed, because the type of
analysis that is appropriate and the subsequent conclusions that can be drawn
depend on how the data are collected.
4. Data summarization and preliminary analysis. After the data are collected,
the next step usually involves a preliminary analysis that includes summarizing
the data graphically and numerically. This initial analysis provides insight into
important characteristics of the data and can provide guidance in selecting
appropriate methods for further analysis.
5. Formal data analysis. The data analysis step requires the researcher to select
and apply statistical methods.
14 | P a g e
ENGINEERING DATA ANALYSIS
From Illustration 1, deciding how to select the 100 students and what data should
be collected from each student are steps 2 and 3 in the data analysis process.
These 100 students constitute a sample.
Definition 1.
Methods for organizing and summarizing data, such as the use of tables,
graphs, or numerical summaries, make up the branch of statistics called descriptive
statistics.
Definition 2.
A variable is any characteristic whose value may change from one individual or
object to another.
15 | P a g e
ENGINEERING DATA ANALYSIS
Definition 3.
The other two variables, number of textbooks purchased and distance to the
university, are both numerical in nature. Determining the value of such a numerical
variable (by counting or measuring) for each student results in a numerical data set.
Bivariate data result from obtaining a category or value of pairs of numbers on two
different characteristics.
Multivariate data result from obtaining a category or value for each of two or more
attributes (so bivariate data are a special case of multivariate data).
Illustration 4. Both height (in inches) and weight (in pounds) might be
recorded for each student in a class. The resulting is called a bivariate data
set. If the researcher is interested in determining height, weight, age, and
systolic blood pressure for each student in the class, the resulting data set is
called a multivariate data set.
There are two different types of numerical data: discrete and continuous.
16 | P a g e
ENGINEERING DATA ANALYSIS
Discrete data usually arise when observations are determined by counting. So that
data a and b are discrete data. The rest (c and d) are continuous data.
Definition 4.
A numerical variable results in discrete data if the possible values of the
variable correspond to isolated points on the number line.
Data collection is a vital step in the data analysis process. It is important to keep in
mind the questions hope to answer on the basis of the resulting data. Sometimes the
researcher is interested in answering questions about characteristics of a single
existing population or in comparing two or more well-defined populations. To
accomplish this, sample is selected from each population under consideration and use
the sample information to gain insight into characteristics of those populations.
Sometimes the questions you are trying to answer deal with the effect of
certain explanatory variables on some response and cannot be answered using data
from an observational study.
Illustration 6.
A professor may wonder what would happen to test scores if the required
laboratory time for a chemistry course were increased from 3 hours to 6 hours
per week. To answer such questions, the researcher conducts an experiment to
collect relevant data. The value of some response variable (test score in the
chemistry ) is recorded under different experimental conditions (3-hour lab and
6-hour lab).
17 | P a g e
ENGINEERING DATA ANALYSIS
Definition 5.
A study is an observational study if the investigator observes characteristics of
a sample selected from one or more existing populations.
A well-designed experiment can result in data that provide evidence for a cause-and-
effect relationship. This is an important difference between an observational study
and an experiment. In an observational study, it is impossible to draw clear cause-and-
effect conclusions because we cannot rule out the possibility that the observed effect
is due to some variable other than the explanatory variable being studied. Such
variables are called confounding variables.
Definition 6.
A confounding variable is one that is related to both group membership and
the response variable of interest in a research study.
1.4 Sampling
Bias in sampling is the tendency for samples to differ from the corresponding
population in some systematic way. The most common types of bias encountered in
sampling situations are selection bias, measurement or response bias, and
nonresponse bias.
18 | P a g e
ENGINEERING DATA ANALYSIS
Note: Bias is introduced by the way in which a sample is selected or by the way
in which the data are collected from the sample. Increasing the size of the sample,
although possibly desirable for other reasons, does nothing to reduce bias if the
method of selecting the sample is flawed or if the nonresponse rate remains high.
Note: Be careful not to confuse clustering and stratification. Even though both of
these sampling strategies involve dividing the population into subgroups, both the
way in which the subgroups are sampled and the optimal strategy for creating the
subgroups are different. In stratified sampling, we sample from every stratum,
whereas in cluster sampling, we include only selected whole clusters in the sample.
Because of this difference, to increase the chance of obtaining a sample that is
representative of the population, we want to create homogeneous groups for strata
and heterogeneous (reflecting the variability in the population) groups for clusters.
19 | P a g e
ENGINEERING DATA ANALYSIS
to form a sample. Results from such samples are rarely informative, and it is a
mistake to try to generalize from a convenience sample to any larger population.
Application: Exercise #1
3. For each of the situations described, state whether the sampling procedure is
simple random sampling, stratified random sampling, cluster sampling,
systematic sampling, or convenience sampling.
Closure
Well done! You have just finished Lesson 1 of this module. Should there
be some parts of the lesson which you need clarification, please ask your tutor during
your face-to-face or on-line interactions.
20 | P a g e
ENGINEERING DATA ANALYSIS
Now if you are ready, please proceed to Lesson 2 of this module which will
discuss the most widely used data collection procedure, the survey. Information from
surveys impact nearly every facet of our daily lives. Planning and conducting surveys
will be the focus of our next lesson.
21 | P a g e
ENGINEERING DATA ANALYSIS
Learning Outcome:
Introduction:
Abstraction:
The interviewer gets to decide what is relevant to the conversation and may
ask questions— possibly personal or even embarrassing questions. The
respondent, in turn, may refuse to participate in the conversation and may
refuse to answer any particular question. But having agreed to participate in
the survey, the respondent is responsible for answering the questions
truthfully.
22 | P a g e
ENGINEERING DATA ANALYSIS
3.Questions should not create opportunities for the respondent to feel threatened or
embarrassed.
In a perfect survey, the target population would be the same as the sampled
population. This type of survey rarely happens. There are always difficulties in
obtaining a sampling frame or being able to identify all elements within the target
population.
Having chosen a particular sample survey, how does one actually collect the
data?
The most commonly used methods of data collection in sample surveys are:
23 | P a g e
ENGINEERING DATA ANALYSIS
1. Interviews
a. Personal Interview. The procedure usually requires the interviewer
to ask prepared questions and to record the respondent’s answers. The
primary advantage of these interviews is that people will usually respond
when confronted in person. In addition, the interviewer can note specific
reactions and eliminate misunderstandings about the questions asked.
Application: Exercise #2
3. For the given situation, decide what sampling method you would use. Provide
an explanation of why you selected a particular method of sampling.
The major state university in Region A is attempting to lobby the state
legislator for a bill that would allow the university to charge a higher tuition
rate than the other universities in the country. To provide a justification, the
university plans to conduct a mail survey of its alumni to collect information
concerning their current employment status. The university grants a wide
variety of different degrees and wants to make sure that information is
obtained about graduates from each of the degree types. A 5% sample of
alumni is considered sufficient.
24 | P a g e
ENGINEERING DATA ANALYSIS
Closure
Congratulations! You have successfully completed the tasks and activities for
Lesson 2. It is expected that you have gained insights about planning and conducting
survey as data collection method.
Now if you are ready, please proceed to Lesson 3 of this module which will
discuss planning and conducting an experiment.
25 | P a g e
ENGINEERING DATA ANALYSIS
Learning Outcome:
Introduction:
Sometimes the questions you are trying to answer deal with the effect of
certain explanatory variables on some response. Such questions are often of the form,
―What happens when . . . ?‖ or ―What is the effect of . . . ?‖ Experiments provide a way
to collect data to answer these types of questions.
This lesson present the key concept of experiment design, and the methods of
experimental design.
Activity
Suppose in an experiment, the researchers decide to use two room
temperature settings, 18°C and 24°C. Further suppose that there are 10 sections of
first-semester Calculus1 that have agreed to participate in the study. The experiment
is designed in this way:
Set the room temperature to 18°C in five of the rooms and to 24°C in the other
five rooms on test day, and then compare the exam scores for the 18°C group and the
24°C group. Suppose that the average exam score for the students in the 18°C group
was noticeably higher than the average for the 24°C group.
Analysis
Based on the information given in the activity, could you conclude that the
increased temperature resulted in a lower average score? Yes or No.
If no, are their any factors that affects or are related to the exam scores? Can you
enumerate them?
Abstraction:
26 | P a g e
ENGINEERING DATA ANALYSIS
Before we describe the concepts of experimental design, the following terms are
defined:
Definition 1.
The explanatory variables are those variables that have values that are controlled
by the experimenter. Also called independent variable or factors.
The response variable is a variable that is not controlled by the experimenter and
that is measured as part of the experiment. Also called dependent variable.
In general, we can identify the explanatory variables and the response variable easily
if we can describe the purpose of the experiment in the following terms:
27 | P a g e
ENGINEERING DATA ANALYSIS
Definition 3.
An extraneous variable is one that is not one of the explanatory variables in the
study but is thought to affect the response variable.
If one instructor taught all the 20° sections and another taught all the 27°
sections, we would be unable to distinguish the effect of temperature from the effect
of the instructor. In this situation, the two variables (temperature and instructor) are
said to be confounded.
Definition 4.
Two variables are confounded if their effects on the response variable cannot be
distinguished from one another.
28 | P a g e
ENGINEERING DATA ANALYSIS
29 | P a g e
ENGINEERING DATA ANALYSIS
Even though this experiment consists of a sequence of trials all with the same
car, random assignment of trials to experimental conditions is still important because
there will always be uncontrolled variability. For example, temperature or other
environmental conditions might change over the sequence of trials, the physical
condition of the car might change slightly from one trial to another, and so on.
In experiments that use human subjects, use of a control group may not be
enough to determine whether a treatment really does have an effect. People
sometimes respond merely to the power of suggestion.
30 | P a g e
ENGINEERING DATA ANALYSIS
Definition
A placebo is something that is identical (in appearance, taste, feel, etc.) to the
treatment received by the treatment group, except that it contains no active
ingredients.
As long as the subjects did not know whether they were taking the placebo,
the placebo group would provide a better basis for comparison and would allow the
researchers to determine whether the treatment had any real effect over and above the
―placebo effect.
Application: Exercise #3
1. The head of the quality control department at a printing company would like to
carry out an experiment to determine which of three different glues results in the
greatest binding strength. Although they are not of interest in the current
investigation, other factors thought to affect binding strength are the number of
pages in the book and whether the book is being bound as a paperback or a
hardback.
a. What is the response variable in this experiment?
b. What explanatory variable will determine the experimental conditions?
c. What two extraneous variables are mentioned in the problem description? Are
there other extraneous variables that should be considered?
Closure
Congratulations! You have successfully completed the tasks and activities for
Lesson 3. It is expected that you are knowledgeable about obtaining data, through
survey and experiments.
You are almost done with this module. The module summary and assessment
will follow.
31 | P a g e
ENGINEERING DATA ANALYSIS
SUMMARY
32 | P a g e
ENGINEERING DATA ANALYSIS
ASSESSMENT
2. The ―A‖ City school district is planning a survey of 300 of its 15, 000 parents
or guardians who have students currently enrolled. They want to assess the
parents’ opinion about mandatory drug testing of all students participating in
any extracurricular activities, not just . An alphabetical listing of all parents or
guardians is available for selecting the sample. In each of the following
descriptions of the method of selecting the 300 participants in the survey,
identify the type of sampling method used (simple random sampling, stratified
sampling, or cluster sampling).
b. The schools are divided into five groups according to grade level
taught at the school:
Grade 6 –7, 8–9, 10 –12. Three separate sampling frames are constructed,
one for each group. A simple random sample of 100 parents or guardians is
selected from each group.
c. The school district is also concerned that the parent or guardian’s opinion
may differ depending on the age and sex of the student. Each name is
randomly assigned a number. The names with numbers 1 through 300 are
selected for the survey. The parent is asked to fill out a separate survey for
each of their currently enrolled children.
33 | P a g e
ENGINEERING DATA ANALYSIS
34 | P a g e
ENGINEERING DATA ANALYSIS
References
Peck, R., Olsen, C. and Devore, J.L. (2012): Introduction to Statistics and Data
Analysis(4th edition). Brooks/Cole/Cengage Learning, 20 Channel
Center Street Boston, MA 02210, USA
Walpole, RE, & Myers, RH.(1993). Probability and Statistics for Engineers and (5th
ed.). Macmillan Publishing Company, New York.
35 | P a g e