Professional Documents
Culture Documents
ASSIGNMENT#1
Roll No CA651432
ASSIGNMENT No. 1
(Unit: 1-5)
Question No#1 What do you understand by statistics? What are the characteristics
Answer:
Statistics Definition
Statistics is a branch that deals with each element of the statistics. Statistical understanding
enables to pick out the right technique of amassing the statistics, and rent the ones samples in the
appropriate analysis procedure, for you to efficiently produce the effects. In short, statistics is a
crucial procedure, which facilitates to make the decision based at the statistics. Data concerns the
primary purpose of using data is to plan the gathered Statistics in terms of the experimental
designs and statistical surveys. Data is taken into consideration as a mathematical technological
Statistics is a vast situation with packages in full-size sort of fields. The word “statistics” is
derived from the Latin phrase “status”, which means a political kingdom. Statistics is a branch of
know-how that offers with statistics and figures. The time period Statistics refers to a fixed of
strategies and regulations for organizing, summarizing, and interpreting Statistics. It's far a
Functions of Statistics
Day by day we encounter hundreds of thousands of portions of statistics which are frequently
indistinct, indefinite and unclear. Whilst such pieces of Statistics undergo positive statistical
strategies and are represented within the shape of tables or figures, they constitute things in a
In our everyday life and in studies additionally, we regularly get big amount of Statistics. To get
a clear picture, Statistics enables us both via simplifying such Statistics through taking few
figures to serve as a representative sample or by way of taking common to present a chook’s eye
view of the big masses. Complex statistics may be simplified by way of offering them in the
shape of a tables, graphs or diagrams, or representing it via a median and many others.
Frequently in studies matters grow to be extra clean and big whilst they're in comparison with
others of the identical kind. The assessment between two extraordinary agencies is courtesy of
positive statistical strategies, such as average, coefficients, charges, ratios, and so on.
As a man or woman our knowledge is restricted to what we can have a look at and see; and that
is a very small a part of the ocean of knowledge. Statistics extends our know-how and
numerical investigations. As an example, we daily pay attention and also have trendy affect that
However to realize to what numbers the boom has happened, and the way a long way the upward
thrust in prices have affected special earnings organizations, it might be important to have a
Statistics permits us to make correct choices, whether or not they're taken with the aid of a
intelligence or statistical bureau, the work of which is to collect, examine and coordinate figures
for formulating destiny rules of the firm regarding production and income.
CHARACTERISTICS OF STATISTICS
The statistics which can be studied on the subject of time, place or frequency may be referred to
as Statistics. A single isolated and unconnected fact or determine isn't Statistics due to the fact
we cannot examine it in relation to different statistics and figures. Only aggregate of facts e.g.
educational achievement of the scholars, I.Q. of a group of students, weight of college students
Phenomena may be stricken by such a lot of factors. We cannot examine the effects of 1 thing on
the phenomena simplest with the aid of ignoring others. To have a real picture we ought to
examine the consequences of all factors on the phenomena separately as well as collectively,
because effects of the factors can alternate with change of region, time or state of affairs.
measurement of statistics can be made feasible. It way that the Statistics or the fact ought to be in
quantitative shape as success rankings 60, 50, eighty five, 78, and 91 out of 100. If it isn't in
accuracy
For a clear photo of the phenomena under investigation, it ought to be researched the usage of
affordable preferred of accuracy depending upon the character and reason of collection of
statistics. Statistics collection should be loose from non-public prejudices and biases. Biased and
completely systematic way. Any rough and haphazard method of series will now not be ideal for
Before collection of statistics, investigator/researcher should have a motive and then have to
accumulate statistics therefore. Data gathered without any cause is of no use. While not having a
clear idea about the reason we can not be in a function to distinguish among important statistics
Statistics is a technique for the cause of contrast and so on. It should be able to being as
compared; in any other case, it will lose tons of its significance. Comparison may be made best if
the statistics are homogeneous. It's far with the use of comparison most effective that we are able
to illustrate changes which might also relate to time, location, frequency or some other man or
Statistics is crucial in our daily life. We stay within the statistics global and lots of this statistics
is decided mathematically with the assist of statistics. It manner data keeps us knowledgeable
approximately every day going on. Significance of statistics in our each day life is mentioned
I. Each day we watch climate forecasting. It's miles viable because of some pc models based on
statistical principles. Those fashions examine prior weather with the modern-day weather and
II. Statistics is often used by the researchers. They use statistical strategies to gather applicable
Statistics. In any other case there can be loss of cash, time and different assets.
III. In enterprise market statistics play a greater position. Statistical techniques are the key of
ways buyers and businessmen invest and make money. Additionally, in industry, those
7
EDUCATIONL STATISTICS (8614)
equipment are used in nice testing. Manufacturing managers are constantly fascinated to
discover whether or not the product is confirming the specification or not. He uses statistical
IV. Statistics additionally has a massive function inside the clinical area. Earlier than any pills
is at the back of all different scientific studies. Docs are expecting illnesses on the bases of
statistical standards.
V. Print and electronic media use statistical equipment to make predictions of winner of
VI. Statistics has widely been used in psychology and training to decide the reliability and
VII. Apart from above statistics has a wide software in advertising, manufacturing, finance,
Question No#2 What do you understand by the term “data”? Write in detail the
types of data.
Answer:
Definition of Data
8
EDUCATIONL STATISTICS (8614)
The term “data” refers back to the type of information a researcher obtains to reap goals of his
research. All studies strategies begin with series of data, which performs a substantial position
within the statistical evaluation. This term is used in distinctive contexts. In popular, it suggests
Statistics or figures from which conclusions can be drawn. Or it is a raw fabric from which
information is acquired. Data are the real portions of statistics which you acquire through your
take a look at. In different words data may be defined as collection of facts and info like text,
figures, observations, symbols, or definitely description of things, event or entity accrued with a
view of drawing inferences. It's far an uncooked fact which must be processed to get
information.
TYPES OF DATA
In research, distinct strategies are used to acquire statistics, all of which fall into classes, i.e.
Primary data and secondary data. It's far a common classification primarily based upon who
Primary Data
Because the call shows, is one which is gathered for the first time by means of the researcher
himself. Number one statistics is originated with the aid of the researcher for the primary time
for addressing his studies hassle. It is also referred to as first hand uncooked statistics. The
statistics can be accrued the use of diverse strategies like survey, observations, bodily trying out,
mailed questionnaire, questionnaire stuffed and despatched via enumerators, private interviews,
Secondary Data
9
EDUCATIONL STATISTICS (8614)
Factor in the direction of the second one hand data already amassed and recorded through some
other person with a cause no longer relating to contemporary studies trouble. It's far without
difficulty to be had form of data and saves time and forged of the researcher. But because the
data is collected for the motive apart from the trouble underneath investigation, so the usefulness
of the data may be limited in a number of ways like relevance and accuracy. Also, the targets and
techniques followed to accumulate data may not be appropriate to the cutting-edge scenario.
Consequently, the researcher must be careful when using secondary data. Examples of secondary
data are censuses data, guides, internal statistics of the groups, reviews, books, journal articles,
Primary data refers to the data originated by the researcher for the primary time. Secondary
data is already current Data, accrued by other researchers, companies, and organizations.
Primary Data is actual-time Data whereas secondary data is one that relates to the past.
Primary data is collected to address the problem in hand while the cause at the back of series
Series of primary Data is an onerous system. Then again series of secondary data is easy and
fast.
Assets of primary Data are survey, observations, physical testing, mailed questionnaire,
Awareness Corporation’s dialogue, case research, etc. Then again resources of secondary are
censuses data, guides, internal data of the agencies, reports, books, magazine articles,
Series of primary data calls for a big numbers of sources like time, cost, and human sources.
Primary data is unique to the researcher’s desires. He can control the first-rate of research.
However, secondary data is neither precise to researcher desires nor has he manipulate over
Primary data is to be had in the uncooked shape even as secondary data has undergone a few
Data gathered from number one resources are greater reliable and correct than the secondary
sources
Data can be described as a scientific file of a specific numbers. It's far the extraordinary values of
that numbers represented collectively in a set. It's miles a group of data and figures to be used for
a specific motive such as a survey or evaluation. While organized in an organized shape, may be
referred to as data.
Data may be qualitative or quantitative. Once you recognize the distinction among them, you
Qualitative Data: they constitute some characteristics or attributes. They depict descriptions
the usage of the students of your magnificence a sample could be classified as qualitative.
Quantitative Data: these may be measured and no longer clearly determined. They may be
numerically represented and calculations can be finished on them. For example, data at the
numbers of students playing unique sports activities out of your elegance gives an estimate of
ways the various overall students play which sport. This data is numerical and may be
classified as quantitative.
Cross-sectional data
Cross-sectional data is a type of data amassed by means of gazing many subjects (along with
people, corporations, countries, or regions) at the equal point of time, or without regard to
variations in time.
It’s far the Data for a unmarried time point or single area factor.
This kind of data is constrained in that it can't describe adjustments over the years or cause
Categorical data
Express variables constitute forms of data which may be divided into agencies. Examples of
categorical variables are race, intercourse, age organization, and educational stage.
The data, which cannot be measured numerically, is referred to as the categorical Data.
A Data set inclusive of statement on a unmarried function is a uni variate Data set. A uni
variate data set is categorical if the person observations are specific responses.
Time-series data
12
EDUCATIONL STATISTICS (8614)
Time collection data takes place wherever the identical measurements are recorded on a
ordinary foundation.
Quantities that represent or trace the values taken with the aid of a variable over a period
The values of various phenomenons together with temperature, weight, population, and so
The data in keeping with time intervals is known as time-collection data. e.g Population in a
Spatial data
Also referred to as geospatial data or geographic data it is the data or data that identifies the
geographic place of functions and barriers on the planet, inclusive of natural or built
Spatial data is usually saved as coordinates and topology and is data that can be mapped.
Spatial data is used in geographical data structures and other geolocation or positioning
offerings.
Spatial data includes points, strains, polygons and different geographic and geometric data
primitives, which can be mapped by location, stored with an object as metadata or utilized by
Spatial data may be categorised as scalar or vector data. Each presents wonderful data
convey the meaning? Write down the advantages and drawbacks of using pictograms.
Answer:
Definition of Pictograms
A pictogram is a graphical symbol that conveys it’s that means via its pictorial resemblance to a
physical item. A pictogram might also encompass a symbol plus graphic elements which include
border, lower back sample, or colour this is meant to convey unique data. We can also say that a
pictogram is a kind of graph that makes use of pictures as opposed to bars to represent data
beneath evaluation.
represents the frequency of data as images of symbols. Every photograph or symbols may
additionally represent one or more devices of Statistics. Pictograms shape a part of our each day
lives. They are utilized in delivery, medication, training, computer systems and many others.
They indicate, in iconic shape, locations, directions, moves or constraints on moves in both the
figurative nature at once to suggest an item or to specific an idea. Pictograms can satisfy many
functions. They are used to replace written indications and commands expressing regulatory,
mandatory, caution and prohibitory Statistics, while that data have to be processed fast (e.g.
Road site visitors signs and symptoms), whilst users speak unique languages (i.e. Non-natives),
have constrained linguistic capacity (e.g. Human beings with low levels of literacy or little
schooling), or have visual troubles (e.g. Older human beings), and mainly while there's a felony
14
EDUCATIONL STATISTICS (8614)
obligation to tell, and for the user to conform with statistics, specifically for safety functions (e.g.
A pictograms desires to seize customers' interest (users need to see the pictogram), to improve
users' comprehension of warnings (users want to take care of it), and it additionally needs to
danger.
CHARACTERISTICS OF PICTOGRAM
1. Has to be self-explanatory.
5. Have to be identifiable as a fixed, thru uniform remedy of scale, fashion and situation.
6. Need to be especially visible, smooth to reproduce in any scale and in effective or terrible
form.
7. Need to no longer be based upon a border and need to paintings equally properly in
8. Must keep away from stylistic fads or a business look and ought to mean to extensive target
9. Need to be appealing while used with their layout, elements and typestyles
Pictograms can are available quite regularly while visualizing data in data portraits, reports,
displays, or even resumes. We are able to use a pictogram every time you want to make simple
Whether or not you want to reveal the magnitude of an essential stat or visualize a fraction or
percentage, you may use pictograms to add visible impact to simple Statistics.
ADVANTAGES OF PICTOGRAMS
They will enhance warning comprehension for people with visual or literacy difficulties.
They have got the ability to be interpreted greater as it should be and greater quickly than
words.
They can be recognized and recalled some distance higher than phrases.
DISADVANTAGES OF PICTOGRAM
Even nicely understood pictograms will now not be interpreted similarly through all
organizations of peoples and across all cultures, and it takes years for any pictogram to attain
most effectiveness.
16
EDUCATIONL STATISTICS (8614)
They have got the ability for deciphering the alternative or frequently undesired that means
Question No#4 Define normal curve. Write down the properties of normal curve.
Answer:
NORMAL CURVE
One way of offering out how data are dispensed is to devise them in a graph. If the data is lightly
dispensed, our graph will come upon a curve. In data this curve is known as a normal curve and
in social sciences, it is called the bell curve. Normal or bell curved is distribution of data may
additionally evidently occur in numerous possible approaches, with some of opportunities for
trendy deviation (which may be from 1 to infinity). A well-known normal curve has an average
of 0 and preferred of 1. The larger the usual deviation, the flatter the curve will be and vice versa.
The empirical rule tells you what percentage of your data falls within a sure wide variety of
• 68% of the data falls within one well-known deviation of the mean.
• 95% of the data falls inside two popular deviations of the mean.
• 99.7% of the data falls within three popular deviations of the mean.
A normal distribution is symmetric from the height of the curve, wherein the mean is. Because of
this most of the located data is clustered near the mean, while the data come to be less frequent
while farther far from the mean. The ensuing graph appears as bell-fashioned wherein the mean,
median, and mode are of the same values and appear at the peak of the curve.
The graph is a great symmetry, such that, if you fold it at the center, you will get two equal
halves when you consider that one-half of the observable data points fall on every facet of the
graph.
The two most important parameters of a normal distribution are the mean and standard deviation.
The parameters determine the form and possibilities of the distribution. The form of the
1. Mean
The mean is utilized by researchers as a degree of central tendency. It could be used to describe
the distribution of variables measured as ratios or periods. In a normal distribution graph, the
mean defines the region of the peak, and maximum of the data factors are clustered around the
19
EDUCATIONL STATISTICS (8614)
mean. Any changes made to the cost of the mean move the curve either to the left or proper
2. Standard deviation
The standard deviation measures the dispersion of the data factors relative to the mean. It
determines how a ways far from the mean the data factors are placed and represents the space
On the graph, the same old deviation determines the width of the curve, and it tightens or
expands the width of the distribution along the x-axis. Usually, a small widespread deviation
relative to the mean produces a steep curve, even as a large standard deviation relative to the
Properties
1. It is symmetric
A normal distribution comes with a perfectly symmetrical shape. Which means that the
distribution curve can be divided in the middle to produce two equal halves. The symmetric
shape takes place whilst one-half of the observations fall on each facet of the curve.
The center factor of a normal distribution is the point with the most frequency, which means that
it possesses the maximum observations of the variable. The midpoint is likewise the factor where
those three measures fall. The measures are usually equal in a superbly (normal) distribution.
20
EDUCATIONL STATISTICS (8614)
3. Empirical rule
In commonly dispensed Statistics, there may be a constant share of distance mendacity below the
curve among the mean and unique variety of standard deviations from the mean. As an instance,
68.25% of all instances fall within +/- one widespread deviation from the mean. 95% of all
instances fall inside +/- standard deviations from the mean, while 99% of all instances fall inside
Skewness and kurtosis are coefficients that degree how extraordinary a distribution is from a
normal distribution. Skewness measures the symmetry of a normal distribution at the same time
as kurtosis measures the thickness of the tail ends relative to the tails of a normal distribution.
exactly half of the values are to the left of the middle and half to the right
Acknowledged traits of the normal curve make it feasible to estimate the possibility of
prevalence of any price of a usually dispensed variable. Think that the total vicinity below the
curve is described to be 1. You could multiply that wide variety through a hundred and say there
may be a one hundred percent danger that any cost you could name can be someplace within the
distribution. Similarly, because half the place of the curve is underneath the suggest and half is
21
EDUCATIONL STATISTICS (8614)
above it, you may say that there's a 50 percent chance that a randomly chosen value will be
above the mean and the identical risk that it'll be beneath it.
It makes sense that the location under the normal curve is equal to the possibility of randomly
drawing a value in that variety. The region is best within the center, wherein the “hump” is, and
thins out toward the tails. That is consistent with the reality that there are more values near the
While the region of the same old ordinary curve is split into sections through preferred
deviations above and underneath the mean, the vicinity in each phase is a recognized amount. As
explained in advance, the vicinity sections of the curve above and below the mean can be
delivered collectively to discover the chance of acquiring a value within (plus or minus) a given
numbers of standard deviations of the mean in every phase is the same as the opportunity of
In order to use the area of normal curve to decide the probability of incidence of a given fee, the
to express it in terms of how many widespread deviations it's miles above or below the mean.
After the z‐rating is acquired, you can appearance up its corresponding possibility in a table. The
Where x is the fee to be converted, μ is the population way, and σ is the population standard
deviation.
Question No#5 Explain procedures for determining median, with one example each at
least, if:
Median
Median is the center price of rank order Data. It divides the distribution in two halves (i.e. 50%
of rankings or observations on both aspects of median value). It means that this value separates
higher half of the data from the lower half. The purpose of the median is to determine the perfect
The median is the center variety in a sorted, ascending or descending, listing of numbers and
The median is now and again used as opposed to the mean while there are outliers within the
If there's a bizarre numbers of numbers, the median value is the range that is in the middle,
if there's an even numbers of numbers in the list, the center pair should be decided, added
together, and divided with the aid of two to locate the median fee.
While the wide variety of scores is atypical, truly arrange the ratings in order (from lower to
higher or from higher to lower). The median could be the center rating in the list. Take into
account the set of rankings 2, 5, 7, 10, 12. The score “7”lies in the middle of the rankings, so it's
23
EDUCATIONL STATISTICS (8614)
far median. While there's a fair numbers of scores in the distribution, arrange the ratings in order
(from decrease to higher or from higher to lower). The median might be the common of the
center two score inside the listing. Consider the set of scores 4, 6, 9, 14 16, 20. The common of
the middle two scores 11.5 (i.e. 9+14/2 = 23/2 = 11.5) is the median of the distribution. Median
is less stricken by outliers and skewed Data and is usually desired degree of relevant tendency
when the distribution is not symmetrical. The median cannot be determined for express or
nominal data.
Another way of describing the data is by looking at the middle number. When there are an odd
number of values, we can just find the value so that there is the same number of values above as
there are below this middle value. When there is an even number of values, there is an issue in
there is not one number that acts as a middle value. Instead, the two middle numbers such that
there is the same number of values above as below these two middle numbers. As a
compromise, we take the average of these two middle numbers. We call this result the median of
Step 2: If there is an odd number of numbers, locate the middle number so that there is an equal
number of values to the left and to the right. If there is an even number of numbers locate the
two middle numbers so that there is an equal number of values to the left and to the right of these
two numbers.
24
EDUCATIONL STATISTICS (8614)
Step 3: If there is an odd number of numbers, this middle number is the median. If there is an
even number of numbers add the two middles and divide by 2. The result will be the median.
1. Rosa measured the weight in pounds of seven packages bags of oranges that were purchased
Solution
Step 1: First, put the numbers in numerical order from smallest to largest.
Step 2: Notice that there are 7 numbers. This is an odd number of values, so we locate the
middle number. The middle number is 11. Notice that there are an equal number (3) of numbers
Step 3: We are in the case that there are an odd number of values, so the median is this middle
Example 2: The Jameson family drove through 7 states on their summer vacation. Gasoline
prices varied from state to state. What is the median gasoline price?
$1.79, $1.61, $1.96, $2.09, $1.84, $1. 75, $2.11
25
EDUCATIONL STATISTICS (8614)
Example 3: During the first marking period, Nicole's math quiz scores were 90, 92, 93, 88, 95,
88, 97, 87, and 98. What was the median quiz score?
There are six brothers and sisters in Lupe's family. Their ages are shown below
Find the median age of the 22, 18, 11, 14, 20, 11 brothers and sisters in Lupe's
family.
Solution
Step 1: First, put the numbers in numerical order from smallest to largest.
Step 2: Notice that there are 6 numbers. This is an even number of values, so we locate the two
middle numbers. The two middle numbers are 14 and 18. Notice that there are an equal number
(2) of values to the left and to the right of these two middles.
26
EDUCATIONL STATISTICS (8614)
Step 3: We are in the case that there are an even number of values, so the median is this average
of the two middle numbers. Add these two middle numbers and divide by 2.
So the median age of the brothers and sisters in Lupe's family is 16.
Example 2: A marathon race was completed by 4 participants. What was the median race time?
2.7, 3.5, 5.1, 8.3
Since there is an even number of items in the data set, we compute the median by taking
8.6/2=4.3
MERITS OF MEDIAN
4. Despite the fact that the extreme values are not recognized median may be calculated.
27
EDUCATIONL STATISTICS (8614)
DEMERITS OF MEDIAN
For large data length the preparations of the Data in the growing order is truly hard manner.