Principal Component Analysis (PCA) Based Indexing: March 2017

See discussions, stats, and author profiles for this publication at: https://www.researchgate.
net/publication/321824664
Principal Component Analysis (PCA) based Indexing
Chapter · March 2017
CITATIONS READS
6 4,372
2 authors:
Darshnaben Mahida Sendhil Ramadas

National Dairy Research Institute ICAR-Indian Institute of Wheat and Barley Research, Karnal, Haryana (India)
12 PUBLICATIONS 37 CITATIONS 207 PUBLICATIONS 956 CITATIONS
SEE PROFILE SEE PROFILE
Some of the authors of this publication are also working on these related projects:
CLIMATE CHANGE-INDUCED PRODUCTION RISK AND ADAPTATION STRATEGIES IN DAIRY FARM HOUSEHOLDS OF ODISHA View project
Multilocational and Multidisciplinary Research Programme on Wheat and Barley Improvement View project
All content following this page was uploaded by Sendhil Ramadas on 27 June 2020.
The user has requested enhancement of the downloaded file.

Citation
Sendhil R, Anuj Kumar, Satyavir Singh, Ajay Verma, Karnam Venkatesh and Vikas Gupta
(2017). Data Analysis Tools and Approaches (DATA) in Agricultural Sciences. ICAR-Indian
Institute of Wheat and Barley Research. pp 1-126.
ISBN
978-93-5300-510-8
Publication
March, 2017
Patron
Director, ICAR-Indian Institute of Wheat and Barley Research, Karnal
Disclaimer
The views expressed in the chapters are the sole responsibility of the respective authors. The
editors bear no responsibility with regard to source and authenticity of the contents.
Data Analysis Tools and Approaches (DATA) in Agricultural Sciences
10. Principal Component Analysis (PCA) based Indexing

Darshnaben Mahida1 and R Sendhil2
1
ICAR- National Dairy Research Institute,Karnal-132001,Haryana
2
ICAR-Indian Institute of Wheat and Barley Research, Karnal-132001, Haryana
Principal Component Analysis (PCA)

PCA is a tool to identify the similarities and dissimilarities pattern in the data. It is an effective
tool in identifying the prototype of high dimension data where we cannot have luxury of
graphical presentation. It facilitate in reducing the dimensions of the data after identifying the
pattern without much loss of information. PCA have many alternative uses among which
assigning the weights while computing a index is one.
Steps for Computing Index using PCA

 Collection of data on relevant variables
 Normalization of variables based on the association with the objective of grouping
 Assignment of weights using PCA
Wi =∑│Lij│Ej
where,
W i is the weight of ith indicator
Ej is the eigen value of the jth factor
Lij is the loading value of the ith unit of grouping on jth factor
i = 1,2,3,…..n indicators
j = 1,2,…. Factors or Principal Components (PCs)
 Index formation
I = ∑ XiWi
∑W i
where,
I is the index of each unit
Xi is the normalized value of ith indicator
W i is the weight of ith indicator
PCA based Indexing: An Illustration

PCA requires a large sample size. It is based on the correlation matrix of the variables
involved, and correlations usually need a large sample size before they stabilize. As a rule of
thumb, a bare minimum of 10 observations per variable is necessary to avoid computational
difficulties.
1. Collect cross section data on different variables across the population for which index
has to be made.
2. Normalization: Since different explanatory variables are measured in different units, in
order to bring them in a comparable range they need to be normalized. There are many
methods for normalization appropriate to different situations.
54 | ICAR-Indian Institute of Wheat and Barley Research

Formula for normalization

a. When a variable have positive influence then
Actual value - Min value
Max value - Min value
b. When a variable have negative influence then

Max value - Actual value
Max value - Min value
3. Run PCA in the software (XL Stat/SPSS/SAS/EVIEWS) using the normalized values of
different variables which will produce Principal Components i.e. variables have
significant variations, eigen values and factor loading values.
For assignment of weights, the PCs having eigen value greater than one have to be
selected. Eigen value represent the variation in the PCs. Standardisation of variable
indicates that each variable have variance of 1. So PCs having variance less one accounts
for less variance then did the original variables (which has variance of 1) and so are of little
use. The sum product of absolute value of factor loading values with their respective eigen
values give the weights for each variable (Formula in step c).
Through PCA we can also have a

scree plot which graphs eigen values
against the PCs. We can see that
after third component, there is not
much difference in the height of
successive bar of the graph,
meaning the each successive PC is
accounting for smaller and smaller
variation to the total variance.

4. Finally a composite index for each unit under consideration can be computed by dividing
the sum product of normalized observed value and their respective assigned weights,
and then divided by sum of the all weights (Formula in step d).
Suggested Readings
Ayyoob, K. C., Krishnadas, M. and Kaeel, C. M. H. (2013). Intra-regional disparities in

agricultural development in Kerala. Agricultural Update, 8: 103-106.
Chotia,V. and Chaudhry,O.(2014). Investing the inter-relationship between regional
infrastructure, poverty and economic growth: A district-wise study of Rajasthan.
International Journal Of Social-Science And Interdisciplinary Research, 3:211-233.
Chotia,V. and Rao. N.V.M.(2015). Examining the interlinkage between regional infrastructure
disparities and poverty: A case of Indian states. Economic Annals, 205:53-71.
Feroz, S. M. and Chauhan, A. K. (2010). Performance of dairy self help groups (SHGs)
India: Principal component analysis (PCA) approach. Indian Journal of Agricultural
Economics, 65:308-320.
GOI.(1999).Education Development Index. Department of Secondary and Higher Education,
GOI, MHRD, 167-185.
Kale, R. B., Ponnusamy, K., Chakravaty, A. K., Sendhil, R. And Mohmmad, A. (2016).
Assessing resource and infrastructure disparities to strengthen Indian dairy sector.
Indian Journal of Animal Science, 86:720-725.
View publication stats

Principal Component Analysis (PCA) Based Indexing: March 2017

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Principal Component Analysis (PCA) Based Indexing: March 2017

Uploaded by

Copyright:

Available Formats

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Principal Component Analysis (PCA) based Indexing

Chapter · March 2017

Darshnaben Mahida Sendhil Ramadas

SEE PROFILE SEE PROFILE

The user has requested enhancement of the downloaded file.

10. Principal Component Analysis (PCA) based Indexing

Principal Component Analysis (PCA)

Steps for Computing Index using PCA

PCA based Indexing: An Illustration

54 | ICAR-Indian Institute of Wheat and Barley Research

Formula for normalization

b. When a variable have negative influence then

Through PCA we can also have a

55 | ICAR-Indian Institute of Wheat and Barley Research

Ayyoob, K. C., Krishnadas, M. and Kaeel, C. M. H. (2013). Intra-regional disparities in

56 | ICAR-Indian Institute of Wheat and Barley Research

View publication stats

You might also like