Professional Documents
Culture Documents
DATE: 23-10-2022
Learning Outcomes 学习成果
At the end of this class, students will be able to:
❑ For example, data that is hard or impossible to replace (e.g. the recording of an event at a specific time and place) requires extra backup procedures to
reduce the risk of data loss. Or, if you will need to combine data points from different sources, you will need to follow best practices to prevent data corruption.
Research data can be generated for different purposes and through different processes.
❖ Observational data is captured in real-time, and is usually irreplaceable, for example sensor data, survey data, sample data, and neuro-images.
❖ Experimental data is captured from lab equipment. It is often reproducible, but this can be expensive. Examples of experimental data are gene sequences,
❖ Simulation data is generated from test models where model and metadata are more important than output data. For example, climate models and economic
models.
❖ Derived or compiled data has been transformed from pre-existing data points. It is reproducible if lost, but this would be expensive. Examples are data
❖ Reference or canonical data is a static or organic conglomeration or collection of smaller (peer-reviewed) datasets, most probably published and curated.
For example, gene sequence databanks, chemical structures, or spatial data portals.
Sources of data
As I told you before
source of data….!
Records/previous
Sources of data
studies
(secondary data)
Comprehensive
(universal)
Surveys
(primary data)
Sample
Experiments
(primary data)
Classification of Research Data
❑ Qualitative data describes qualities or characteristics.
Qualitative
❑ It is collected using questionnaires, interviews, or observation,
(Categorical)
and frequently appears in narrative form.
Research Data ❑ For example, it could be notes taken during a focus group on
the quality of the food at KFC, or responses from an open-
Quantitative ended questionnaire.
(Numerical) ❑ Qualitative data may be difficult to precisely measure and
analyze.
❑ Quantitative data are used when a researcher is trying to quantify a problem, or address ❑ The data may be in the form of descriptive words that can be
the "what" or "how many" aspects of a research question. examined for patterns or meaning, sometimes through the use
❑ It is data that can either be counted or compared on a numeric scale. of coding.
❑ For example, it could be the number of third semester students at LUC, or the ratings on
❑ Coding allows the researcher to categorize qualitative data to
a scale of 1-4 of the quality of food served at McDonald’s or KFC.
identify themes that correspond with the research questions
❑ This category of data are usually gathered using instruments, such as a questionnaire
and to perform quantitative analysis.
which includes a ratings scale or a thermometer to collect weather data.
❑ Statistical analysis software, such as SPSS, is often used to analyze quantitative data.
Qualitative or Quantitative?
❑ Research topics may be approached using either quantitative or qualitative
methods. Should I use qualitative
or quantitative data in
❑ Choosing one method or the other depends on what you believe would provide the my research?
best evidence for your research objectives.
❑ Researchers sometimes choose to incorporate both in their research since these
methods provide different perspectives on the topic.
❑ For example, you want to know the locations of the most popular study spaces in
LUC Wisma campus, and why they are so popular.
❑ To identify the most popular spaces, you might count the number of students
studying in different locations at regular time intervals over a period of days or
weeks. This quantitative data would answer the question of how many people
study at different locations on campus.
❑ To understand why certain locations are more popular than others, you might use a
survey to ask students why they prefer these locations. This is qualitative data.
Classification of Research Data Cont’d
Categories
Ordinal
Qualitative
Ranks
(Categorical)
Binary
Nominal
Non-Binary
Research Data
Discrete
(Counting)
❑ For example, ordinal data is said to have been collected when a responder inputs his/her
❑ In ordinal data, there is no standard scale on which the difference in each score is measured.
❑ This is to show that the scale is usually influenced by personal factors and not due to a set rule.
❑ Examples include:
❖ Agreement (strongly disagree, disagree, neutral, agree, strongly agree)
❖ Degree/severity of illness (mild, moderate, severe)
❖ Rating (excellent, good, fair, poor)
❖ Frequency (always, often, sometimes, never)
❖ Classification
✓ (1st , 2nd, 3rd, …..)
✓ primary, secondary, tertiary….
✓ grades (A B C D E F)
Ordinal Scale
❑ The Ordinal scale includes statistical data type where variables are in order or rank but
without a degree of difference between categories.
❑ The ordinal scale contains qualitative data.
❑ It places variables in order/rank, only permitting to measure the value as higher or
lower in scale.
❑ You can use an ordinal scale for research and survey purposes to understand the
higher or lower value of a data set. The scale identifies the magnitude of the variables.
❑ It does not explain the distance between the variables.
❑ The ordinal scale cannot answer “how much” different the two categories are.
❑ Like a Likert scale, the ordinal scale can measure frequency, importance, satisfaction,
likelihood, quality, and experience, etc.
❑ The measures in ordinal scale do not have absolute value hence the real difference
between adjacent values may not have the same meaning.
❑ For example, the values in the age scale “less than 20” and “20-50” do not have the
same meaning as “50-80” and “over 80”.
Likert Scale
❑ Likert scale is a point scale used by researchers to take surveys and ❑ A 4 point Likert scale is basically a forced Likert scale.
❑ The reason it is named as such is that the user is
get people's opinion on a subject matter.
forced to form an opinion.
❑ It is usually a 5 or 7-point scale with options that range from one ❑ There is no safe 'neutral' option.
extreme to another. ❑ It is mostly used by market researchers to get specific
responses.
Take for example:
1 2 3 4 5
number). On the other hand, various types of qualitative data can be nominal data, while that of 2 is an ordinal data.
❑ A nominal scale does not depend on numbers because it deals with non-
numeric attributes.
❑ For example, in a marathon race, all the contestants are given a number.
❑ These numbers are for the purpose of identifying the contestant. The
numbers don’t have any association with the result of the race or with the
❑ Discrete data includes discrete variables that are finite, numeric, countable, and
non-negative integers.
❑ In many cases, discrete data can be prefixed with “the number of”.
For example:
❑ This type of data is mainly used for simple statistical analysis because it is easy
to summarize and compute.
❑ Some continuous data will change over time, such as the weight of a
baby in its first year or the temperature in a room throughout the day.
❑ The numbers of continuous data are not always clean and integers,
as they are usually collected from very precise measurements.
Interval Scale
❑ An interval scale can be defined as a quantitative measurement scale where
variables have an order, the difference between two variables is equal, and the
presence of zero is arbitrary.
❑ It can be used to measure variables that exist along a common scale in equal
intervals.
❑ Interval scales are best suited in surveys where respondents must enter values
regarding temperature, time, and dates.
❑ Interval scales can be easily integrated into multiple choice questions or rating
scale questions by asking respondents to use a numerical scale to make a rating.
For example:
❑ Net Promoter Score surveys measure the likelihood of customers recommending
a company’s products or services to others.
❑ It does so by asking them to rate their likelihood to do so on a numeric scale from
0 to 10, where 0 indicates they are not likely at all, and 10 indicates they are very
likely.
Ratio Scale
❑ Ratio scale is a type of variable measurement scale which is
quantitative in nature.
❑ It allows any researcher to compare the intervals or differences.
❑ Ratio scale is the 4th level of measurement and possesses a zero
point or character of origin. This is a unique feature of this scale.
❑ For example, the temperature outside is 0-degree Celsius. 0 degree
doesn’t mean it’s not hot or cold, it is a value.
❑ A ratio scale is the most informative scale as it tends to tell about
the order and number of the object between the values of the scale.
❑ The most common examples of this scale are height, money, age,
weight, blood pressure etc.
❑ With respect to market research, the common examples that are
observed are sales, price, number of customers, market share etc.
Discrete vs Continuous Data
❑ Both data types are important for statistical analysis. However, some major
differences need to be noted before drawing any conclusions or making
decisions.
❑ Discrete data is the type of data that has clear spaces between values.
Continuous data is data that falls in a constant sequence.
❑ To accurately represent discrete data, the bar graph is used. Histogram or line
graphs are used to represent continuous data graphically.
❑ Each row corresponds to a given member of the dataset, as per the given
question.
❑ Datasets describe values for each variable for unknown quantities such as height,
weight, temperature, volume, etc of an object or values of random numbers.
❑ The dataset consists of data of one or more members corresponding to each row.
Data Table
❑ A dataset organized into a table, with one column for each variable and one row
for each person.
Definitions for Variables & Typical Data Table
OBS AGE BMI FFNUM TEMP( 0F) GENDER EXERCISE LEVEL QUESTION
❑ AGE: Age in years 1 26 23.2 0 61.0 0 1 1
2 30 30.2 9 65.5 1 3 2
❑ BMI: Body mass index, weight/height2 in kg/m2
3 32 28.9 17 59.6 1 3 4
To understand the general characteristics or Distribute a list of questions to a sample online, in person or
Survey
opinions of a group of people. over-the-phone.
To gain an in-depth understanding of perceptions Verbally ask participants open-ended questions in individual
Interview/focus group
or opinions on a topic. interviews or focus group discussions.
Observation To understand something in its natural setting. Measure or survey a sample without trying to affect them.
To study the culture of a community or Join and participate in a community and record your
Ethnography
organization first-hand. observations and reflections.
To understand current or historical events, Access manuscripts, documents or records from libraries,
Archival research
conditions or practices. depositories or the internet.
To analyze data from populations that you can’t Find existing datasets that have already been collected, from
Secondary data collection access first-hand. sources such as government agencies or research
organizations.
❑ Carefully consider what method you will use to gather data that helps you directly answer your research questions.
Class Attendance 课堂出勤
Please click on the link below to submit your class attendance.
https://forms.gle/SPizKfEhKFNGrbNh6