Professional Documents
Culture Documents
Week 2
Agricultural
Statistics
(Stat 1e)
1
Table of Contents
Learning Module 1
INTRODUCTION
..... ii
Definition
............ 11
Misuse of Statistics
............................................ 13
Summation Formulas
............ 17
Mid Term
Submit your outputs on time.
Submission is the on schedule of
module retrieval. (see due date in
google classroom)
0
Learning Outcomes:
At the end of the unit, the students are expected to:
1. Define and discuss the different fields of Statistics
2. Discuss and familiarize the uses of statistics
3. Trace back the history of statistics and the people behind its
science.
4. Explain the use, abuse and misuse of statistics
5. Calculate using Summation Forrmulas
1
HISTORY OF STATISTICS ON TIME LINE
Note: BCE ~ Before Common Era (or BC ~ Before Christ); CE ~ Common Era (or
AD ~ Anno Domini)
2
methods of collecting, organizing,
and summarizing data.
24 April 1620 April 1674 John Established the first English school
– 18 April Graunt of political arithmetic, a scientific
1674 school much closer to the modern
understanding of Statistics. John
Graunt was one of the first
demographers and expert in
epidemiology.
27 May 1623 William Petty Sir William Petty, an English
– 16 economist, scientist and professor.
December He suggested efficient methods to
1687 survey the land that was to be
confiscated and given to
Cromwell's soldiers. He is best
remembered for his contribution in
economics and political arithmetic.
He is known for having started the
philosophy of 'laissez-faire' in
relation to government activity.
1701-7 April Thomas Bayes Thomas Bayes is well known for
1761 the Theorem called Bayes'
Theorem. Bayes never published
what would eventually become his
most famous accomplishment; his
notes were edited and published
after his death by Richard Price.
20 October Gottfried Gottfried Achenwall was a German
1719 – 1 Achenwall philosopher and statistician. He is
May 1772 considered among the inventors of
the term ―statistics‖. He first
began to read a new course
"statistics" in the University of
Göttingen, which explained how the
state was arranged. He is
considered as father of Statistics.
19 June Blaise Pascal Pascal was a famous
1623 – 19 mathematician who helped create
August 1662 two major new areas of research:
projective geometry and probability
theory.
17 August Pierre de Fermat Fermat's favourite subject was the
1601 – 12 theory of numbers. He along with
3
January Pascal, founded the theories of
1965 probability. The mathematical
theory of probability has its roots
through games of chance and
gambling.
8 February Daniel Bernoulli Daniel Bernoulli FRS was a Swiss
1700 – 17 mathematician and physicist and
March 1782 was one of the famous
mathematicians in the Bernoulli
family. The Bernoulli principle is
credited to him. The principle
describes the mathematics of the
mechanism underlying the
operation of two important
technologies the carburetor and the
airplane wing.
23 March Pierre-Simon Laplace established fundamentals
1749 – 5 Laplace of statistics in the book théorie
March 1827 analytique des probabilités. The
treatise discussed probability
methods and problems and
statistical methods and
applications, normal curve,
regression through study of
astronomy.
30 April 1777 Carl Freidrich Carl Friedrich Gauss made
– 23 Gauss tremendous contribution in many
February fields of mathematics and science
1855 and is considered as one of most
influential mathematicians of all
time. In the area of probability and
statistics, Gauss introduced which
is now known as Gaussian
distribution, the Gaussian function
and the Gaussian error curve.
22 February Adolphe Quetelet Adolphe Quetelet was a Belgian
1796 – 17 astronomer, mathematician,
February statistician and sociologist. At that
1874 time, the science of probability and
statistics was mainly applied in
astronomy.
4
16 February Francis Galton Galton studied genetic variation in
1822 – 17 humans through regression and
January correlation.
1911;
27 March Karl Pearson Karl Pearson is considered to be
1857 – 27 the father of modern statistics
April 1936 which emerged from his seminal
work in mathematical biology and
biometry. He has laid foundation to
the discipline of mathematical
statistics.
10 Charles Edward He pioneered the idea of factor
September Spearman analysis and Spearman's rank
1863 – 17 correlation coefficient. In statistics,
September Spearman developed rank
1945 correlation (1904) is a non-
parametric version of the Pearson
correlation.
5 August Wesley Clair Wesley Clair Mitchell is well known
1874 – 29 Mitchell for his empirical work on business
October cycles and for guiding the National
1948 Bureau of Economic Research in
its first decades.
13 June William Sealy William Sealy Gosset, an English
1876 – 16 Gosset (known statistician published under the pen
October under the name Student, and developed the
1937 pseudonym Student's t-distribution. Studentized
Student) residuals are named in Student's
honour because, like the problem
that led to Student's t-distribution,
the idea of adjusting for estimated
standard deviations is central to
that concept.
12 April 1878 Kirstine Smith Smith developed minimum chi-
– 11 squared estimation of the
November correlation coefficient. She initiated
1939 research on optimal design theory
where she computed G-optimal
designs for polynomial regression
of order up to 6, and explicitly
obtained some of these designs.
5
17 February Ronald A. Fisher Sir Ronald Aylmer Fisher was an
1890 – 29 English mathematician, geneticist,
July 1962; statistician, evolutionary biologist
and eugenicist. His breeding
experiments conducted at
Rothamsted Experimental Station
led to the theories of gene
dominance and fitness and
selection.
22 April 1891 Harold Jeffreys Sir Harold Jeffreys, FRS was an
– 18 March English mathematician, statistician,
1989 geophysicist, and astronomer. He
published his book Theory of
Probability in 1939. The book
revived the Bayesian view of
probability.
02 Frank Wilcoxon Wilcoxon developed Non-
September parametric Wilcoxon signed-rank
1892 – 18 test and the Wilcoxon ranksum test.
November
1965
29 June June 1972 P C Prasanta Chandra Mahalanobis is
1893 – 28 Mahalanobis best known for the Mahalanobis
June 1972 distance. He made pioneering
studies in anthropometry in India
and contributed to the design of
large-scale sample surveys. He is
founder of Indian Statistical
Institute.
16 April 1894 Jerzy Neyman Jerzy Neyman was a Polish
– 5 August mathematician and statistician. He
1981 first introduced the concept of a
confidence interval into statistical
hypothesis testing and in
collaboration with Egon Pearson,
he co-devised null hypothesis
testing and presented
NeymanPearson lemma, the basis
of hypothesis testing.
19 June R C Bose Raj Chandra Bose was an Indian
1901 – 31 American mathematician and
October statistician best remembered for his
1987 work in design theory and the
6
theory of error-correcting codes in
which the class of BCH codes is
partly named after him.
12 May 1902 Frank Yates Yates mainly worked on the design
– 17 June of experiments and made many
1994 contributions to the theory of
analysis of variance, the Yates's
algorithm and the balanced
incomplete block design.
31 October Abraham Wald Abraham Wald was an Austrian
1902 – 13 mathematician who contributed to
December geometry, economics,
1950 econometrics and seasonal
movements in time series. He
pioneered the concept of statistical
sequential analysis.
25 April 1903 Andrey Andrey Nikolaevich Kolmogorov
– 20 October Kolmogorov was a 20th-century Soviet
1987 mathematician who made
significant contributions to the
mathematics of probability theory,
algorithmic information theory and
computational complexity.
Kolmogorov and the British
mathematician Sydney Chapman
independently developed
Chapman– Kolmogorov equations
in the field of stochastic processes.
06 Maurice George Sir Maurice George Kendall was a
September Kendall British statistician. His main
1907 – 29 contribution is the Kendall tau rank
March 1983 correlation which is named after
him. Around 1939 he along with
Bernard Babington-Smith,
developed one of the first early
mechanical devices to produce
random digits and formulated a
series of tests for statistical
randomness in a given set of digits.
19 March J. Wolfowitz Wolfowitz's main contributions were
1910 – 16 in the fields of statistical decision
July 1981 theory, nonparametric statistics,
7
sequential analysis, and
information theory.
27 July 1911 P V Sukhatme Pandurang Vasudeo Sukhatme
– 28 January was an Indian statistician who
1997 during his early days in late 1930’s
came under the influence of
eminent authorities of that era Sir
R. A. Fisher, Jerzy Neyman and E.
S. Pearson. His two major
contributions were to bipartitional
functions, for which he worked
under the guidance of Sir R. A.
Fisher and the contributions to the
theory of the representative
method, for which he worked under
the guidance of J. Neyman and
E.S. Pearson. Prof. Sukhatrme also
made important contribution to the
problem of plot-size in large scale
yield surveys, in general and use of
small size plots in yiled surveys, in
particular. Prof. Sukhatme
developed statistical models for
assessing the dimensions of
hunger and future food supplies for
the world.
05 W. Allen Wallis Wilson Allen Wallis was an
November American economist and
1912 – 12 statistician. He was president of the
October University of Rochester. He along
1998 with William Kruskal presented the
Kruskal– Wallis one-way analysis
of variance.
16 June John Tukey John Wilder Tukey has made
1915 – 26 numerous contributions in the field
July 2000 of statistics. He was an American
mathematician best remembered
for development of the FFT
algorithm and box plot. His major
contributions are the Tukey range
test, the Tukey lambda distribution,
the Tukey's test of additivity, and
the Teichmüller–Tukey lemma.
8
03 January Sir David John David Finney’s main contribution is
1917 - Finney on probit analysis and biological
assays in pharmacology and
pencilin assays in forestry. He was
a pioneer in the development of
systematic monitoring of drugs for
detection of adverse reactions, an
undesired harmful effect resulting
from a medication or other
intervention like surgery. He
worked on a table of logarithms to
the base of 2. He first introduced
the concept of fractional replication.
10 October William Kruskal William Henry Kruskal was an
1919 – 21 American mathematician and
April 2005 statistician. He is best remembered
for developing the widely used non-
parametric test Kruskal–Wallis
oneway analysis of variance in
collaboration with W. Allen Wallis.
10 - C R Rao Calyampudi Radhakrishna Rao,
September popularly known as C R Rao has
1920 made outstanding contributions in
statistics. His path-breaking
contributions are the Cramér-Rao
bound and the Rao-Blackwell
theorem. Rao also introduced
second-order efficiency, which
initiated studies on higher order
asymptotics. Rao introduced a new
asymptotic test, termed as Rao’s
Score Test, as an alternative to the
likelihood ratio and Wald tests, the
three known as holy trinity.
25 January Jack Carl Kiefer Kiefer is one of the pioneer in
1924 – 10 optimal experimental design theory.
August 1981 The American Statistician obituary
calls him "undoubtedly the foremost
worker in optimal experimental
design".
15 July 1924 Sir David Roxbee D R Cox, may be considered as
- Cox one of the world’s leading living
statisticians. He has made several
9
important contributions in
numerous areas of statistics and
applied probability. He presented
ground breaking proportional
hazards model which is widely
used in the analysis of survival
data.
June 1933 – J N Srivastava JN Srivastava has immensely
13 contributed in the development of
November Statistics. His major contributions
2010 towards statistics are in design of
experiments. Some notable
contributions in this field are the
mixed factorial, search linear
models and search designs, which
is a path breaking research, self
relocating designs, etc.
May 23, Jayanta K Ghosh Jayanta Ghosh has made
1937 - monumental contributions towards
Bayesian inference and Bayesian
non-parametrics, asymptotics,
modeling and model selection,
invariance in testing and
estimation, high dimensional data
analysis, non-parametric regression
and density estimation, survival
analysis, statistical genetics,
multiple testing, mixture models,
etc. His outstanding contributions
include Bahadur-Ghosh-Kiefer
representation and the Ghosh-
Pratty identity.
May 24, Bradley Efron Bradley Efron is known for
1938- introducing the bootstrap
resampling technique. The
bootstrap technique has made a
significant impact in the field of
applied statistics. It is one of the
first computer-intensive statistical
techniques which has the capability
to replace traditional algebraic
derivations.
10
Definition
Etymology
The term statistics is ultimately derived from the New Latin statisticum
collegium ("council of state") and the Italian word statista ("statesman"
or "politician"). The German Statistik, first introduced by Gottfried
Achenwall (1749), originally designated the analysis of data about
the state, signifying the "science of state" (then called political
arithmetic in English). It acquired the meaning of the collection and
classification of data generally in the early 19th century. It was introduced
into English in 1791 by Sir John Sinclair when he published the first of
21 volumes titled Statistical Account
of Scotland.[1]
Thus, the original principal purpose
of Statistik was data to be used by
governmental and (often centralized)
administrative bodies. The collection
of data about states and localities
continues, largely through national
and international statistical services.
In particular, censuses provide
frequently updated information about
the population.
The first book to have 'statistics' in its
title was "Contributions to Vital
Statistics" (1845) by Francis GP
Neison, actuary to the Medical Invalid
and General Life Office.
Statistics
11
Field of Statistics
1. Descriptive Statistics
2. Inferential statistics
Uses of Statistics
it generally helps people answer questions and make decisions
about many things
Assess student’s performance
Determine attitudinal patterns, causes and effects of
misbehavior
Analyze a wide range of data
Monitor status of customers, employees, orders
Validate or test a claim or inferences
It is the excellent basis for forming conclusions
12
Uses of Statistics in Different Fields
In psychology:
Statistical tools are used to organize data on intelligence
scores, attitudes, personality traits, ratings, aptitudes,
values etc.
In the government:
Various records are collected, organized and analyzed
statistically for intelligent policy –making.
Statistics is a very important tool in researches and studies
The study of statistics requires primarily the understanding
of basic concepts, symbols and mathematical notions.
Misuse of Statistics
Evil intent on the part of dishonest people
Unintentional errors on the part of people who don’t know any better.
The study of statistics requires primarily the understanding of basic
concepts, symbols and mathematical notions.
13
GRAPHS
14
PICTOGRAPHS
1. One company has a growth rate (in sales) of 60%% while the
other company has a growth rate of only 20%.
Does this mean that the first company has bigger sales than
the other one? *** not necessarily***
For example: last year, the sales of the first company was
$1M and this year is $1.6M (60%)
The sales of the second company last year was $10M and
this year is $12M (20%)
15
In terms of the amount gained, he 1st company’s ain is
$0.6M while that of the second is $2M
. 2. A car manufacturer might claim that 90% of all the cars sold in the last
10 years are still on the roads.
This may convey the impression that heir cars are well-built.
He might have intentionally left out the fact that 90% of the cars they
had sold were only during the past 4 years.
3. A claim based on the result of a sample: “70% of the Filipinos did not
favour the impeachment of Corona”.
Points to Consider:
16
This implies that statistics can be misused or even abused.
The user must always exercise care:
To see to it that he/she uses statistics in away as to be able
to recognize distorted data.
To learn to interpret the results appropriately.
Summation Formulas
Summation, ∑x
The summation of a variable , say X, ∑x is a value as a result of adding
all the elements of X as in a one column table or a single dimension array.
𝑖=𝑛
The same as
∑ 𝑋𝑖 𝑜𝑟 ∑𝑋
1
where: ∑𝑖=𝑛
𝑖=1 𝑋𝑖 = 𝑥1 + 𝑥2 + ⋯ . +𝑋𝑛 are the elements of X
X This means
that
So, n= 5 4 X1 = 4
2 X2 = 2
1 X3 =1
1 X4 = 1
2 X5 = 2
𝑖=5
∑ 𝑋𝑖 = 𝑋1 + 𝑋2 + 𝑋3 + 𝑋4 + 𝑋5
𝑖=1
17
∑ 𝑋𝑖 = 4 + 2 + 1 + 1 + 2
∑ 𝑋 = 𝟏𝟎
The sum of squares of a variable, say X is the sum of all the squares of the
elements of X. That is squaring each value of X then get the sum
The sum of all the element of X; squaring each value of X then getting the
sum.
𝑖=𝑛
read as “ The summation of xi squared from I
∑ 𝑥𝑖²
equals to 1 to I equals n”
𝑖=1
same as
∑ 𝑥𝑖² 𝑜𝑟 𝑠𝑖𝑚𝑝𝑙𝑦 ∑ 𝑥²
1
∑𝑖=𝑛
𝑖=1 𝑥𝑖² = x 1 + x 2 +… + x n
2 2 2
= 42+22+12+12+22
= 16 + 4+ 1+ 1 + 4
∑ 𝑥 2 = 𝟐𝟔
18
The Square of a Sum, (∑x)²
(∑x)² = (4 +2+1+1+2)2
(∑x)² = (10)2
(∑x)² = 100
19
Example:
X Y
4 3
2 2
1 2
1 1
2 1
∑ 𝑋 = 𝟏𝟎 ∑ 𝑌=𝟗
∑x∑y = (10)(9)
∑x∑y = 90
Examples:
X Y XY
4 3 12
2 2 4
1 2 2
1 1 1
2 1 2
20
∑XY = (4)(3)+ (2)(2) + (1)(2) + (1)(1) + (2)(1)
= 12+ 4+ 2+1+2
= 21
𝑖=𝑝 𝑗=𝑟
The summation of Xij from I equals 1 to p and j
∑ ∑ 𝑋𝑗𝑖
𝑖=1 𝑗=1
equals 1 to r”
∑ ∑ 𝑋𝑗𝑖 𝑜𝑟 𝑠𝑖𝑚𝑝𝑙𝑦 ∑ ∑𝑋
1 1
𝑝 𝑟
21
= 𝑋21 + 𝑋22 + ⋯ 𝑋2𝑟 +
In a two-way table
p = no. of rows
r = no. of columns
∑∑X = sum of all values in a two-
way table
A two-way table
… … … …
2 1 3
4 5 4
1 1 2
2 2 1
So, p= 4, r = 3
X11 = 2 x 12 = 1 x13 = 3
22
x21 = 4 x22 =5 x23 = 4
x31 = 1 x32 = 1 x33 = 2
x41 = 2 x42 = 2 x43 = 1
4 3
∑ ∑ 𝑋𝑖 = (2 + 1 + 3) +
(4 +5 + 4) +
( 1+ 1 + 2) +
( 2+ 2 + 1)
∑∑X = 28
Summary:
Assessment:
Task 1. Discussion
In two to three sentences answer the following questions:
23
1. Why it is important to know to historical development of statistics?
2. In this present scenario, cite one event that depicts the use of
statistics.
3. In what instances, statistics could be abused or misused?
8 9 3 5
3 3 8 9
3 5 4 2
8 9 10 8
13 13 15 12
3 2 1 3
8 9 10
13 13 15
3 2 1
24
4. Given the table below, compute the following expressions:
a. Correction Factor – CF = (∑Yij )2/pr
b. Total SS = ∑Y2ij – (∑Yij )2/pr
Show your solution considering that i=1 to p, j=1 to r and that
p = 4, r= 3
2 2 3
5 5 6
10 10 1
1 2 1
References:
Prepared by:
JESSA D. PABILLORE
jessapabillore916@gmail.com
09179869017
25