Professional Documents
Culture Documents
The term ‘data’ is a plural form of the Latin word In Information Science, “data as quantitative facts
‘datum,’ and literally, it means anything that is derived from experimentation, calculation, or
given. direct observation.”
Data are defined in McGraw-Hill Encyclopaedia of i) Data with reference to scale of measurement:
Science and Technology as “numerical or Based on the scale of measurement, data can be
qualitative values derived from scientific categorised as follows:
experiments.”
• Nominal data
In social sciences, “data are stated as values or • Ordinal data
facts, together with their accompanying study • Interval data
design, codebooks, research reports, etc. and are • Ratio data
used by researchers for the purpose of secondary ii) Data with reference to continuity: Data with
analysis.” reference to continuity can be categorised as
follows:
In humanities, the text, such as Biblical materials or
• Continuous data
Shakespeare’s drama deals with a fixed quantity of
• Discrete data
data represented by a finite amount of text to be
interpreted.
iii) Data with reference to the number of a) primary data: The data obtained first-hand
characteristics: Data can also be categorised on the from individuals by direct observation, counting,
basis of the number of variables considered. These and measurement or by interviews or mailing a
are: questionnaire are called primary data. It may be
complete enumeration or sampling, e.g., data
• Univariate data –Univariate data are collected from a market survey.
obtained when one characteristic is used for
observation, e.g., the performance of students in a b) Secondary data: The data collected initially
given class. for the purpose and already published in books or
• Bivariate data – Bivariate data result when reports but are used later on for some other purposes
instead of one, two characteristics are measured are referred to as secondary data. For example, data
simultaneously, e.g., height and weight of tenth class collected from census reports, books, data
students. monographs, etc.
• Multivariate data – Multivariate data
vi) Data with reference to characteristic: Data can
consist of observations on three or more
be categorised on the basis of the characteristics as
characteristics, e.g., family size, income, and
follows:
savings in a metropolitan city in India.
iv) Data with reference to time: There are two a) Quantitative data: When the characteristic
types of data under this category. These are: of observation is quantified, we get quantitative
data. Quantitative data result from the measurement
a) Time series data: Data recorded in a of the magnitude of the characteristic used. For
chronological order a cross-time are referred to as example, age of a person, price of a commodity,
timeseries data. It takes different values at different income of a family, etc.
times, e.g., the number of books added to a library
in different years, monthly production of steel in a b) Qualitative data: When the characteristic of
plant, yearly intake of students in a university. observation is a quality or attribute, we get
qualitative data. For example, sex or colour of a
b) Cross-sectional data: This refers to data for person, or intelligence of a student
the same unit or for different units at a point of time,
e.g., data across sections of people, regions or
Sources of Data
segments of the society.
The sources of data can be categorised as the
v) Data with reference to origin: Data under this followings:
category can be put as follows:
Static Data: Static data is those data that do not Here, some common examples of primary source
change during processing. This type of data cannot documents:
be changed when written or printed.
• Historical documents (letters, pamphlets, political
Examples: A newspaper story: hardcopy cannot be tracts, manifestos)
changed once printed. Data stored on a CD ROM • Data and Research Results (scientific article
(not rewritable): a CD ROM cannot be edited. presenting original findings, statistics)
• Original works of art
These are two examples of static data or information • Video footage & photographs
as they cannot be changed. • Works of literature
• Interview transcripts
Dynamic data: Dynamic data refers to data that • Eyewitness accounts, newspapers articles &
changes during processing – it is updated as and autobiographies
when necessary. The data is never expected to be the • Blogs articles, tweets, and other social media entries
same when re-input. • Lab notebooks and case studies
Examples: Data on a webpage that is updated from Secondary Sources: a secondary source is an
time to time. Data on a CD RW can be rewritten or interpretation, analysis, discussion, or evaluation
edited Data from a stock market. of an event or issue that is based on primary source
evidence. Secondary sources list, summarize,
These are examples of dynamic data, as they change compare, and evaluate information and studies, so as
with time. to draw conclusions or present on the current state
of knowledge on a topic. Secondary sources are
Based on the Origin often in the form of scholarly discourse or reviews.
Primary source: A primary source is an original Common examples of secondary sources are:
document that contains first-hand information about
a topic or an event. Primary sources exist on a • Biographies
spectrum, and different fields of study may use • Indexes, Abstracts, Bibliographies
Several types of mediums are used for expressing depict frequency distribution. • Size of the class
Mapping of Data
Ans: (b) From the graph given in the question: Q. 1: Which of the following units shows
continuous increase in production of sugar over
Tickets sold by A=16 months?
Tickets sold by B= 8
500
a) B b) A c) C 450
450
d) D 400
350
300
250
Ans. (b) 250
200
150
150
75
Q. 2: In the case of Unit E, in which of the following 100
50
50 25
pairs of months the production of sugar was equal? 0
15 20 25 30 35 40 45
a) June & July b) April & June Age ( in Years )
c) July & August d)April & May The total number of persons in the age group of
15 yr to 45 yr is
Ans. (d) (A) 450 (B) 800 (C) 1000 (D) 500
Sol. (C) Total number of persons = 450 + 250 + 150
Q. 3: In the month of June, how many units have a + 75 + 50 + 25 = 1000
share of more than 25% of the total production of
sugar? The number of persons in the age group (20-30)
a) one b) Three yr is
c) Two d) Four A) 475 (B) 400 (C) 300 (D) 700
Sol .(B) Number of persons = 250 + 150 = 400
Ans. (a)
The ratio of the number of persons between the
Q 4: What was approximate percentage decrease in age group of (20-25) and (30-35) yr is
sugar production of unit B in June as compared to A)1:3 (B) 2:1 (C) 10:3 (D) 6:1
April? Sol. (C) Required ratio = 250: 75 = 10:3
a) 8 % b) 1 0% c)
1 5% d) 1 8% The ratio of maximum population in an age
group to the total number of persons under study
Ans. (b) is
Histogram A)4:5 (B)9:10 (D)2:5
Directions (Example Nos. 1-5) Study the Mowing (C)9:20
histogram and answer the questions Sol. (C) Maximum population = 450 Total
population = 1000
Required ratio
The percentage of population under study which A) 0·8 (B) 1·25 C) 8 (D)
is in the age group of (40-45) yr is 0·25
(A) 2.5 (B) 3.5 (C) 1.5 (D) Sol. (A) Let the required value is x,
5 then 320 = x × 400
1989 to 1990 = =
4·76%
Required amount = Rs. (420 – 320) × 1000 = Rs. and are below 1986, 1987
1,00,000
∴Required ratio = 4 : 2 = 2 : 1
What are the approximate average sales (in
The sales in 1987 are how many times to that in
thousands) for the years 1988 to 1991?
1988?
A) 420 (B) 425 C) 430
(D) None of these
= 18 𝐿𝑎𝑘ℎ 𝑢𝑛𝑖𝑡𝑠
30
25
3. Which year had the highest percent
20
increase in production over the previous year?
15
60
in 1998 are in the ratio 3 : 2 and the population
50 of the state ‘M’ in 1999 was 126 lakh, then what
40 was the population of the state ‘R’ in 2000—
30
(A) 70 lakh (B) 93·60 lakh (C)
20
152 lakh (D) 65 lakh
10
Sol. (B) Let the population of the states ‘M’ and ‘R’
0
1996 1997 1998 1999 2000 2001
Years in 1998 is = 3x and 2x respectively
= 66 %
Sol. (D) The population of the state ‘P’ in 1998
2. The population of the state ‘O’ in the year
of 1998 was 8 lakh, then what was its = 62 × × = 40 lakh
approximate population in the year of 2000?
The population of state ‘N’ in 1998 = 40 lakh
(A) 24 lakh (B) 26 lakh (C) 14 lakh
and the population in 2000
(D) 23 lakh (E) None of these
Sol. (E) The population of the state ‘O’ in the year = 40 × 1·2 × 1·4 = 67·20 lakh
of 2000
5. The population of the state ‘M’ in 2000
was what fraction of its population in 1998?
=8× × = 23 lakh
= Rs. 780 per month
(A) (B) (C) (D)
∴ 100% → 360°
Pie Chart
∴
Directions (1-4)—Study the following diagram of
35% →
Pie chart carefully and answer the questions below
it.
= 126
5. What is the ratio of British tourists below Wikipedia explains as “data governance
20 years to the Russian tourists above 40 years? encompasses the people, processes, and information
(A) 1:2 (B) 12 : 1 (C) 3 : 4 technology required to create a consistent and proper
(D) 4 : 3 handling of an organisation’s data across the
Sol. (B) business enterprise. Goals may be defined at all
British tourists below 20 years levels of the enterprise and doing so may aid in
acceptance of processes by those who will use
Russian tourists above 40 years them.”
organisational data. The main goals of data success for data governance initiatives:
4. Classification is applicable in case of: 11. A series arranged according to each and every
(a) Normal characters item is known as: (a) Discrete series
(b) Quantitative characters (b) Continuous series
(c) Qualitative characters (c) Individual series
(d) Both (b) and (c) (d) Time series
5. In classification, the data are arranged according 12. A frequency distribution can be:
to: (a) Similarities (a) Qualitative
(b) Differences (b) Discrete
(c) Percentages (c) Continuous
(d) Ratios (d) Both (b) and (c)
13. Frequency distribution is often constructed
6. When data are arranged at regular interval of with the help of:
time, the classification is called: (a) Entry table (b) Tally sheet
(a) Qualitative (c) Both (a) and (b)
(b) Quantitative (d) Neither (a) and(b)
(c) Chronological 14. The data given as 3, 5, 15, 35, 70, 84, 96 will be
(d) Geographical called as: (a) Individual series
(b) Discrete series (a) Classification
(c) Continuous series (b) Tabulation
(d) Time series (c) Angle
15. Frequency of a variable is always in: (d)Percentage
(a) Fraction form 22. A pie diagram is represented by a:
(b) Percentage form (a) Rectangle
(c) Less than form (b) Circle
(d) Integer form (c) Triangle
16. Data arranged in ascending or descending (d) Square
order of magnitude is called: 23. A sector diagram is also called:
(a) Ungrouped data (a) Bar diagram
(b) Grouped data (b) Histogram
(c) Discrete frequency distribution (c) Historigram
(d) Arrayed data (d) Pie diagram
17. A series of data with exclusive classes along with 24. Which of the following is not a onedimensional
the corresponding frequencies is called: diagram? (a) Simple bar diagram
(a) Discrete frequency distribution (b) Multiple bar diagram
(b) Continuous frequency distribution (c) Component bar diagram
(c) Percentage frequency distribution (d) Pie diagram
(d) Cumulative frequency distribution 18. The number of 25. Which of the following is a twodimensional
classes in a frequency distribution is obtained by diagram:
dividing the range of variable by the: (a) Total (a) Sub-divided bar
frequency (b) Percentage component bar chart
(b) Class interval (c) Sub-divided rectangles
(c) Mid-point (d) Multiple bar diagram
(d) Relative frequency 26. Pie diagram represents the components of a
19. The largest and the smallest values of any given factor by: (a) Circles
class of a frequency distribution are called: (a) (b) Sectors
Class Intervals (c) Angles
(b) Class marks (d) Percentages
(c) Class boundaries 27. The suitable diagram to represent the data
(d) Class limits relating to the monthly expenditure on different
20. The arrangement of data in rows and columns items by a family is:
is called: (a) Historigram
(a) Classification (b) Histogram
(b) Tabulation (c) Multiple bar diagram
(c) Frequency distribution (d) Pie diagram
(d) Cumulative frequency distribution 28. A graph of time series or historical series is
called: (a) Histogram
21. Diagram are another form of: (b) Historigram
(c) Frequency curve (c) Class intervals
(d) Frequency polygon (d) Class marks
29. The historigram is the graphical presentation of 37. Frequency polygon can be drawn with the help
data which are classified: of:
(a) Geographically (a) Historigram
(b) Numerically (b) Histogram
(c) Qualitatively (c) Circle
(d) According to time (d) Percentage
30. Historigram and histogram are: 38. The graph of the cumulative frequency
(a) Always same distribution is called:
(b) Not same (a) Histogram
(c) Off and on same (b) Frequency polygon
(d) Randomly same (c) Pictogram
31. A distribution in which the observations are (d) Ogive
concentrated at one end of the distribution is called 39. In a cumulative frequency polygon, the
a: cumulative frequency of each class is plotted
(a) Symmetric distribution against:
(b) Normal distribution (a) Mid-point
(c) Skewed distribution (b) Lower class boundary
(d) Uniform distribution (c) Upper class boundary
32. For graphic presentation of a frequency (d) Upper class limit
distribution, the paper to be used is: 40. A frequency polygon is a closed figure which is:
(a) Carbon paper (a) One sided
(b) Ordinary paper (b) Two sided
(c) Graph paper (c) Three sided
(d) Butter paper (d) Many sided
33. Histogram can be drawn only for: 41. Ogive curve can be occurred for the
(a) Discrete frequency distribution distribution of:
(b) Continuous frequency distribution (a) Less than type
(c) Cumulative frequency distribution (d) (b) More than type
Relative frequency distribution (c) Both (a) and (b)
35. Histogram is a graph of: (d) Neither (a) and (b)
(a) Frequency distribution (e) 42. The word ogive is also used for:
(b) Time series (a) Frequency polygon
(c) Qualitative data (b) Cumulative frequency polygon
(d) Ogive (c) Frequency curve
36. Histogram and frequency polygon are two (d) Histogram
graphical representations of: 43. Cumulative frequency polygon can be used for
(a) Frequency distribution the calculation of:
(b) Class boundaries (a) Mean
(b) Median b) Component bar chart
(c) Mode c) Unequal width histogram
(d) Geometric mean
d) Lorenz
44. Which of the following would be MOST suitable
for displaying the proportions of a city’s budget
spent on different items?
(A) Pie chart
(B) Bar chart
(C) Line graph
(D) Histogram
The number of times a certain event has happened
is called:
a) Fate
b) Tally
c) Frequency
d) Probability
Data that can take only a limited number of
different values is called: a) Continuous Data
b) Finite data
c) Limited data
d) Discrete Data
Data that can take an infinite number of different
values is called:
a) Discrete Data
b) Unlimited data
c) Infinite data
d) Continuous Data
A histogram is represented by:
a) A component bar chart
b) A multiple bar charts
c) A simple bar charts
d) An adjoining bar charts
A straight-line diagram representing the same area
as a histogram is:
a) A frequency curves
b) An unequal width histogram
c) An equal width histogram
d) A frequency polygon
‘Frequency density’ is calculated in the case of:
a) Equal width histogram