Professional Documents
Culture Documents
Questions can be asked and data recorded in many ways. Different types of questions are
appropriate for different purposes and different types of data can be used and analyzed
differently. It is important for the questionnaire writer to understand the range of question types
available because the choice of question type will determine the information that is elicited. It is
also important to understand the different types of data that will be generated, because that will
determine the types of analysis that can be carried out. The questionnaire writer should thus be
thinking about how the data are to be analyzed at the time that the questions are being formulated
so that the information collected can be analyzed in the way that is required.
a) Nominal Data: - The scale of measurement for a variable is nominal if the data
describing it are simple labels or names which cannot be ordered. This is the lowest level
of measurement. Even if it is coded numerically, or takes the form of numbers, these
numbers are still only labels. However, that number is purely arbitrary and implies no
value that can be given to the response category. The numbers are given for identification
purposes only.
Thus if a sampling point is described as ‘Urban’ and is given a code of 1, and ‘Rural’ is
assigned a code 2, there is no relative value implied between the two categories. All
nominal data are placed in a limited number of exhaustive categories and any analysis is
carried out on the frequencies within these categories. No other arithmetic is meaningful.
b) Ordinal Data: - If the exhaustive categories into which the set of data is divided can be
placed in a meaningful order, without any measurement being taken on each case, then it
is classed as ordinal data. This is one level up from nominal. We know that the members
of one category are more, or less, than the members of another but we do not know by
how much. Degree classifications are only ordinal. Athletes’ results depend on their
order of finishing in a race, not by 'how much' separates their times. Questionnaires are
often used to collect opinions using the categories: 'strongly agree', 'Agree', 'No opinion',
'Disagree' or 'strongly disagree'. The responses may be coded as 1, 2, 3, 4 and 5 for the
computer but the differences between these numbers are not claimed to be equal. In
ordinal data there will be meaningful inequalities the inequalities are qualitative and it is
not possible to find out the exact difference.
c) Interval Data: - Interval scales provide for a rating of each item on a scale that has a
numerically equal distance between each point, and an arbitrary, and therefore
meaningless, zero point. Such scales are used in order to determine the relative strength
of relationships between items. There are very few examples of genuine interval scales.
Temperature in degrees Centigrade provides one example with the 'zero' on this scale
being arbitrary. The difference between 30°C and 50°C is the same as the difference
between 40°C and 60°C but we cannot claim that 60°C is twice as hot as 30°C. It is
therefore interval data but not ratio data. Dates are measured on this scale as again the
zero is arbitrary and not meaningful. Test scores also considered as an interval scale for
instance two students Lemlem, Meron and Eyerusalem scored 20, 40 and 60 the
difference is equal but we can’t say Meron has twice knowledge than Lemlem.
d) Ratio Data:- Ratio scales are a particular type of interval scale. The distance between
each point on a ratio scale is constant, but the zero point has a real meaning, such that the
ratio between any two scores also has a meaning. Age is a ratio scale, with a 50-year-old
person being twice as old as a 25-year-old. Income is another scale of ratio data for
example Mame earns 4,000 birr/month, Selam earns 6,000 birr per month and Ayalew
earns 8,000 birr per month. The difference is visible and we can say that Ayalew’s income
is double than Mame’s income per month.
It's important to recognize that there is a hierarchy implied in the level of measurement
idea. At lower levels of measurement, assumptions tend to be less restrictive and data
analyses tend to be less sensitive. At each level up the hierarchy, the current level
includes all of the qualities of the one below it and adds something new. In general, it is
desirable to have a higher level of measurement (such as interval or ratio) rather than a
lower one (such as nominal or ordinal).
ii) Based on sources of data. The data will be primary and secondary data.
a) Primary sources of data: - these data are collected by the investigator/s/ (researcher/s/)
by himself/herself/ or enumerator/s/ on behalf of the investigator/s/(researcher/s/) for the
purpose of some specific objectives. Such kinds of data are original or firsthand
information in its nature for the investigator/s/ or researcher/s/.
b) Secondary sources of data: - these data are found in the archives of other organization
because of this secondary data is called library methods. Based on the use the secondary
data will be primary for the collector and secondary for other users who are not involved
in collecting the data.
iii) Based on role of time data will be cross sectional and time series data
Time is an important element of any research design, and here I want to introduce one of the
most fundamental distinctions in research design nomenclature: cross-sectional versus
longitudinal studies.
a) Cross sectional data: - A cross-sectional study is one that takes place at a single point in
time. In effect, you are taking a slice or cross-section of whatever it is you're observing or
measuring. Are used to gather information on a population at a single point in time. An
example of a cross sectional survey would be a questionnaire that collects data on
b) Time series data: - sets of observations collected for a sequence times usually at
equal intervals. A longitudinal study is one that takes place over time—you have at
least two (and often more) waves (distinct times when observations are made) of
measurement in a longitudinal design. Time series analysis requires that you have at
least twenty or so observations over time. Repeated measures analyses aren't often
used with as many as twenty waves of measurement.
iv) Based on the information desired data will be attribute (qualitative) and variable
(quantitative) data.
The data of past sales also have been collected. The primary and secondary data have been
collected to cover every aspect of the study. The primary data are related to behavior and
response of employees, dealers and customers. The secondary data shows the sales of the
company product wise. These data used in combination as per need of the study.
i) Secondary Data
Secondary data are the data collected by a party not related to the research study but collected
these data for some other purpose and at different time in the past. If the researcher uses these
data then these become secondary data for the current users. Secondary analysis, like content
analysis, makes use of already existing data sources. However, secondary analysis typically
refers to the reanalysis of quantitative data, rather than text. In our modern world, an
unbelievable mass of data is routinely collected by governments, businesses, schools, and other
organizations. Much of this information is stored in electronic databases that can be accessed and
analyzed. In addition, many research projects store raw data in electronic form in computer
archives so that others can also analyze the data. Secondary data is also used to gain initial
insight into the research problem. Secondary data is classified in terms of its source – either
internal or external. Internal, or in-house data, is secondary information acquired within the
organization where research is being carried out. External secondary data is obtained from
outside sources. Examples of data available for secondary analysis include: Census Bureau data,
Crime records, Standardized testing data, Economic data and Consumer data.
There are several important uses of secondary data even though the research design
might require the use of primary data. The most common uses are:
In some cases the information and insights gained from secondary data are sufficient to
answer the research questions.
Secondary data can provide the background necessary to understand the problem
situation and provide an overview of the market dynamics.
Secondary data often can provide exploratory information that can aid in the planning
and design of the instruments used to gather primary data.
Secondary data can serve as a check and standard for evaluating primary data.
Secondary data can suggest research hypotheses or ideas that can be studied in the
primary data phase of the research process.
Before using secondary data, the researcher must look the following factors;
A) Reliability of data: The reliability can be tested by finding out such things about the said
data:
(a) Who collected the data? (b) What were the sources of data? (c) Were they collected by using
proper methods (d) At what time were they collected? (e) Was there any bias of the compiler? (f)
What level of accuracy was desired? (g) Was it achieved?
B) Suitability of data: The data that are suitable for one enquiry may not necessarily be found
suitable in another enquiry. Hence, if the available data are found to be unsuitable, they should
not be used by the researcher. In this context, the researcher must very carefully scrutinize the
definition of various terms and units of collection used at the time of collecting the data from the
primary source originally. Similarly, the object, scope and nature of the original enquiry must
also be studied. If the researcher finds differences in these, the data will remain unsuitable for the
present enquiry and should not be used.
C) Adequacy of data: If the level of accuracy achieved in data is found inadequate for the
purpose of the present enquiry, they will be considered as inadequate and should not be used by
the researcher. The data will also be considered inadequate, if they are related to an area which
may be either narrower or wider than the area of the present enquiry. From all this we can say
that it is very risky to use the already available data. The already available data should be used by
the researcher only when he finds them reliable, suitable and adequate. But he should not blindly
discard the use of such data if they are readily available from authentic sources and are also
suitable and adequate for in that case it will not be economical to spend time and energy in field
surveys for collecting information. At times, there may be wealth of usable information in the
already available data which must be used by an intelligent researcher but with due precaution.
ii) Primary Data: - Primary data are those data that are collected for the first time by the
researcher for the specific research project at hand. Primary data are information collected by a
researcher specifically for a research assignment. In other words, primary data are information
that a company must gather because no one has compiled and published the information in a
forum accessible to the public. The first tenet of data gathering among researchers is to exhaust
all sources of secondary data before engaging in a search for primary data. Many research
questions can be answered more quickly and with less expense through the proper use of
secondary information. However, caution must be used to ensure that primary sources of
secondary data are used since they are generally more accurate and complete than secondary
sources of secondary data. Primary data are original in nature and directly related to the issue or
problem and current data. Primary data are the data which the researcher collects through various
methods like observations, interviews, surveys, questionnaires etc.
b) Scope of the Enquiry: Scope refers to the coverage of the survey with regard to the type
of information, the subject matter, geographical area covered, etc. If the information to be
obtained happens to be quite comprehensive and basic, the primary data would prove to
be more suitable.
c) Financial Resources: Finance, in fact, is a big constraint in statistical survey and you
have to act within this limitation. The availability of funds determines, to a large extent,
the type of data to be used in an investigation. In most of the cases, collection of primary
data 'requires more funds than the secondary data. When funds are very limited, we
usually go for secondary data. If adequate finance is available, then we can plan for
collecting primary data.
d) Time Factor: The required time for the collection of primary data and secondary data
should also be considered in deciding whether to select primary data or to use secondary
data. Collection of' primary data requires relatively more time than the collection of
secondary data. If there is sufficient time for the accomplishment of the investigation, we
may use primary data. But if the time is a constraint, then we should consider secondary
data. The time factor thus, affects the type of data to be collected for the survey.
e) Status of the Investigating Agency: This is another important factor in making choice of
data to be collected. Much depends upon whether the investigating agency is the
Government or some public organization/institution or an individual. In the first case one
can think of collecting primary data once large scale. However, for individuals, it is very
difficult to collect primary data on a large scale. For an individual it is economical and
practically feasible to use secondary data. Public organizations or institutions may also
take up field surveys for obtaining relevant information but that may not be the case with
private organizations. The Government or the public institutions can afford to spend more
money and employ adequate number of trained and competent staff for the collection of
primary data. But individuals or private organizations have lots of constraints in this
regards.
f) Human Resources: Availability of human resources also affects our choice concerning
the data. As you know, for collecting primary data you require more persons. If you have
competent and well trained staff you can easily organize field surveys and may collect
primary data. If you do not have enough human resources, you can plan to use secondary
data for your investigation.
g) Availability of Secondary Data: Secondary data can be made use of only if they are
available. If the secondary data are not available or if they are not adequate or not
suitable, there is no alternative except to collect primary data.
h) Degree of Accuracy Desired (Precision): The choice of data also depends upon the
degree of accuracy desired. Before making the choice of data, we have to decide on the
degree of accuracy desired. If the secondary data attain the same degree of accuracy as
desired in the present investigation, you can use secondary data. Otherwise it is advisable
to plan for collecting primary data that fulfill the requirement of the desired level of
accuracy.
Among all the factors discussed above, no single factor can form the basis of our choice for data.
Keeping all the factors in view, a decision has to be taken whether to use primary or secondary
data.
Closed questions, on the other hand, tend, in conversation, to bring it to a stop. This is because
there is a predictable and usually small set of answers to a closed question that the respondent
can give. Any question that simply requires the answer ‘yes’ or ‘no’ is a closed question, and not
helpful to opening out a conversation.
Open-ended questions, ask the participant a question and either the interviewer pauses
for the answer (which is unaided) or the participant records his or her ideas in his or her
own words in the space provided on a questionnaire. Survey researchers usually try to
reduce the number of such questions because they pose significant problems in
interpretation and are costly in terms of data analysis. Questionnaires will be
administered through enumerators, postal and email.
b) Questions should proceed in logical sequence starting with easy questions and then
moving on to more difficult ones. Personal questions should generally be avoided or may
be left to the end.
d) In order to ensure the reliability of respondent there should be some control questions.
They introduce a cross-check to see whether the information collected is correct or not.
e) Adequate space for answers should be provided in the questionnaire itself. There should
always be provision for indications of uncertainty e.g., "do not know", "no preference",
“other, please specify___” and so on.
f) Layout and design of the questionnaire should also be attractive so that it may attract the
attention of the respondents.
investigator contact third person for witness to supply necessary information about the
respondent.
d) Schedules: - This method of data collection is similar to that of the questionnaire. The
schedule is also a proforma containing a set of questions. The difference between the
questionnaire and the schedule is that the schedule is being filled in by the enumerators
who are specially appointed for the purpose. These enumerators go to respondents with
the schedules and ask them the questions from the schedule in the order they are listed.
The enumerator records the replies in the space meant for the same in the schedule itself.
In certain situations, schedules are handed over to respondents and the enumerators help
the respondents in recording the answers. Enumerators explain .the objectives of the
investigation and also remove the difficulties which the respondent may feel in
understanding the implications of a particular question(s) or the definition or concept of
difficult terms. Thus, the essential difference between the questionnaire and schedule is
that the former (i.e., questionnaire) is sent to the informants by post and in the latter case
the enumerators carry the schedule personally to informants and fill them in their own
handwriting. This method is usually adopted in investigations conducted by governmental
agencies or by some big organizations. For instance, population census all over the world
is conducted through this method.
Data collection through schedules requires enumerators for filling up schedules and as
such they should be very carefully selected. They should be trained to perform their job
well. They should be intelligent and must possess the capacity of cross-examination in
order to find out the fact. Above all, they should be honest, sincere, and hardworking and
should have the patience and perseverance. In drafting the schedules, all points stated for
a good questionnaire, must as well be observed.
e) Focus Group Discussions: - A person who leads a focus group interview and ensures
that everyone gets a chance to speak and contribute to the discussion. A focus group is a
group interview of approximately six to twelve or it depends up on the people who share
similar characteristics or common interests. A facilitator guides the group based on a
predetermined set of topics. The facilitator creates an environment that encourages
participants to share their perceptions and points of view. Focus groups are a qualitative
data collection method, meaning that the data is descriptive and cannot be measured
numerically. All records of the group discussions will not be taken as a data except those
data relevant to the topic of investigation.
Topics like employment, savings, investments, imports/exports. These publish the data
periodically; i.e. monthly, quarterly or annually.
b) Official publication of foreign government or international bodies like the UNO, World
Bank, ADB, WTO, UNESCO, etc.
f) Newspaper and Periodicals. Like reporter, Ethiopian herald, Adiss zemen, the economist
magazine and financial times etc.
This types of data will not distributed widely it may be found in the archives of private business
and shelf of an individuals. The statistical data needn’t always be published. There are various
sources of unpublished statistical material such as the records maintained by private firms,
business enterprises, scholars, research workers, etc. They may not like to release their data to
any outside agency.
References
1. Kothari, C.R., Research methodology: - Methods and Techniques, 2nd ed New Age
International (P) Ltd., New Delhi, 2004.
2. S.c. Gupta and V.k. Kapoor (2002). Fundamentals of Mathematical Statistics 10th ed.
Sultan Chand and Sons Publishers New Delhi.