You are on page 1of 10
We SPECIAL SECTION: COMPUTATIONAL SCIENCE ‘TUTORIALS An introduction to the proper orthogonal decomposition Anindya Chatterjee* Depatmest of Engineering ‘A tutorial Is presented on the Proper Orthogonal Decomposition (POD), which finds applications in computa tionally processing Inrge amounts of high-dimensional data with the alm of obtaining low-dimensional descriptions that capture much of the phenomena of interest. The dis- crete version of the POD, which is the singular value de- composition (SVD) of matrices, is described in some deta, ‘The continuous version of the POD is outlined, Low-rank approximations to data using the SVD are discussed, The SVD and the eigenvalue decomposition are compared. Two geometric Interpretations of the SVD/POD are given, Computational strategles (using standard software) are mentioned. Two numerical examples are provided: one shows low-rank approximations of a surface, and the other demonstrates simple a posteriori analysis of data from 2 simulated vibroimpaet system, Some relevant computer code Is supplied, 1, Introduction ‘ComPUTERS have increased our eapecity to not only sinn- late complicated systems, but also-to collect and analyse large amounts of data. Using personal computers, it is now unremarkable to record data at, say, 20 Kitz for afew hours then process hundreds of millions of data points few quantities of final interest. For example, ex pensive machinery might be instrumented and monitored lover days, with the sole objective of efficiently scheduling ‘maintenance. ‘This article provides an introduetion to the Proper Ontho- gonal Decomposition (POD) which is a powerful and elegant method of data analysis aimed at obtaining tow: dimensional approximate descriptions of high-dimensional processes. The POD was developed by several poople (among the first was Kosambi'), and is also known as Prin cipal Component Analysis, the Karhuen-Loéve Decomposition, and the single value decomposition. The POD has been wed to obtain approximate, low-dimensional descriptions of turbulent fluid flows%, structural vibra tions™, and insect gait’, and has been used for damage de- tection’ to name @ few applications in dynamic systems. ft mals anindva@erah em paueds 08 8 and Mechanis, Penn State Unversity, Univesity Pak, Peansylvans, PA, 16802, USA has also been extensively used in image processing, signal analysis and data compression, For references to the many sources of the POD, for applications of the POD in a variety of fclds, as well as for a nice treatment that complements this tutorial, the reader is encouraged to read Holmes, Lum Jey and Berkooz! (chapter 3). Data analysis using the POD is often conducted to ex: tract ‘mode shapes’ or basis functions, from experimental data of detailed simulations of high-i for subsequent use in Galerkin projections that yield low- dimensional dynamieal models (se ref. 2). This attcle con- centrates on the data analysis aspect, and subsequent ‘duced-order modelling is not discussed. 2. Motivation Suppose we wish to approximate a function 2(x,4) over some domain of interest as a finite sum in the variables: separated form tos Beaco, o withthe reasonable expectation thatthe approximation be- comes exact in the limit as M approaches infinity, except possibly on a set of measure zero (readers unfamiliar wi rmeasuce theory may ignore it if they deat with iite-dimen- sional calculations; snd consult, eg. Rudin” otherwise) ‘While in eq, (1) there is no’ fundamental difference be- tween f and x, we usually think of x as a spatial coordinate (possibly vector-valued) and think of fas a temporal coor dinate, ‘The repeesentation of eg (1) isnot unique. For example, if the domain ofx isa bounded interval X on the real line, then the functions g(x) can be chosen as a Fourier series, or Legendre polynomials, or Chebyshev polynomials, and so fon. For each such choice of a sequence d(x) that forms a basis for some suitable class of functions 2, # (See note 1), the sequence of time-functions a(t) is different. That is, for sines and cosines. «= we = get—one. sequence of funetions ay(0 is different. That is, for sines and cosines we get one sequence of functions ay(), for Legendre polynomials Wve get another, and so on, The POD. Js concetaed with one possible choice of the functions (a). ‘CURRENT SCIENCE, VOL. 78, NO. 7, 10 APRIL 2000, SPECIAL SECTION: COMPUTATIONAL SCIENCE If we have chosen orthonormal basis functions [acne ee one then 2400= [2000 (945, . {., for orthonormal basis funtion, the determination ofthe coefficient function a(?) depends only on gs) and not on the ather gs. ‘What erteria should we use for seleting the functions 42 Onhonormality would be useful. Moreover, while an approximation to any desired accwacy in eq. (1) can always be obtained if 2/ ean be chosen large enough, we may like to choose the 4G) in such a way that the pproximation foreach is a5 good as possible in. least squares sense. That is, we would try to find, onee and for all,asequence of orthonormal functions (2) such thatthe fiat two of these functions give the best possible two-term approximation, the first seven give the best possible sever- term approximation, and soon, These spacial ‘ordered, orthonormal functions are celled the proper arthogonal nodes forthe function 2(%,). With these fune- tions the expression ine (1) i ealled the POD of (0) 3. Theory: Finite-dimensional ease Consider a system where we take measurements of m state variables (these eould be from m strain gauges on a siruc- ture, or m velocity probes in a fig, or mixture of two kinds of probes in a system with flow-induced vibrations, te.) Assume that at W instants of time, we take N sets of simultaneous measurements at these o locations. We a- range out data in an N * m matrix A, suc that element Ay is the measurement from the jth probe taken at the ith tn instant. ‘The m state variables are nor assumed to be measured by transducers that are arranged in some straight line in physi- cal space. We merely assume that the transducers have ‘been numbered for identification; and that their outputs have beon placed side by side in the matrix 4. In the actual physical system these measurements might represent one spatial dimension (e.g. accelerometers on a beam), oF more than one spatial dimension (e.g. pressure probes ina three- dimensional fluid flow experiment). Each physical u ‘ducer may itself measure more than one scalar quantity (e.g, triaxial accelerometers); in such cases, the different sealar time series from the same physical transducer are arranged in different columns of 4. The final result of the data collection is assumed here tobe the > mati 4 ft is common to subtract, from each column of 4, the ‘mean value of that column. Whether or not this is done does not affect the basie calculation, though it affects the ‘CURRENT SCIENCE, VOL. 78, NO. 7, 10 APRIL. 2000, interpretation ofthe result (see section 3.5 below). 3.1 The singular value decomposition We now compute the singular value decomposition (SVD) ‘of the matrix A, which is of the form (fora discussion ofthe SVD, see ref. 8): AsuBy’, % where U is an N20 orthogonal mais, Vis an ann oF thogonal matrix, the superscript 7 indicates matrix tans- pose, and E is an Nm matrix with al elements zero except slong the diagonal. The diagonal elements 2, consist of ‘r= min(¥,m) nonnegative numbers cy, which are arranged in decreasing order, Le, = @z.,.2 Fz 0. The o's are called the singular values of 4 (and also of 4”) and ‘unique. The rank of 4 equals the number of nonzero singu- lar values it has. In the presence of noise, the number of singular values larger than some suitably small fraction of the largest singular value might be taken as the ‘aumerical rk’ Since the singular values are arranged in a specific ‘order, th index & ofthe kth singular value will be called the singular value number (see Figures | e and 2c), 3.2. Correspondence with egs (1) and (2) In eq, (3, let US=Q. Then the matrix Q is Nxm, and V7". Letting qu be the kth column of and vy be the kth column of F, we write out the matrix product as anor" = Soh ® Equation (4) is the dseote form of on. (1). The furtion 6,2) is ropresented ere by the mateix 4. The fune- tion ay) is represented by the colunsn matrix gu. The fune- tion ¢Gs) is represented by the row mati vf, The approx tation of eg. (1) is now exact because the dimension is it. Duet the oxthonormaity ofthe columns oF ¥, eg (2) oesponds to multiplying ea. (4) by one ofthe vs onthe viet 33. Lower-rank approximations 10 A For any 1)))¢0inG.91); subplot (3,2,1) urd (X,T,2) x19 (0,1,0,2,0,4,2.4]) xlabel('x!), ylabel(°t"), zlebel(/2!), titled Actual sucfa ‘CURRENT SCIENCE, VOL. 78, NO. 7,10 APRIL 2000 4% gonore the dave matrix & % plot the surface ") sis il ‘SPECIAL SECTION: COMPUTATIONAL SCIENCE tu,s.vlaavacad; % note: the meana ara not aubtracted % trom the colums 4X for rank 1,2 and 3 approxinations 4% compare vith Bq. 6, and folloving text ad evcs 3 ubplot(3,2,ke1) surf (L,1,22) ,arts({0,1,0,2,0.4,2.11) mlabel x'), ylabel('t!), alebel Cs?) title ({/Rank muster (h),’ approxization’]) % plot the singular values, somilog acale xlabel Cmusber"), ylabel *aingular value’), titleC’Singular values of 2°) subplot(3,2,6) eit J drop ali bot first 3 colums of V oven; 4% nove, thin is the transpose of zev, which % —" corrooponds to Aev in the text phot tye(,1),!—? tye(2s 2,12? stacy xlabe1('t?}, "label nodal coordinates’ title('The soda contributions’) egend('t* 924,139) °) % plot nodal coordinates A2 From section 5.2 % data file containing time t, and ye[x,xdot) subplot(2,2,1) plot(t,x(: .4)) % plot dioplecenent of anos 4 nlabel (tine?) ,ylabel('dioplacenent?) title(?(a) divplacenent of uass 4?) axia([0,600,-3,17]) ssubplot(2,2,2 X start another displacenent plot plot (e (400026500) ,(4000:6600,2) ,'~. (40006600) (400016600, 5),’=") zlabeL('tine') jylabel(' displacement”) title? (>) dashedot: muss 2; solid Line: mass 6°) axia((340,456,-3,20]) for wat:10 2G ADex(:)k)-mean(xC,29);—% wenn subtraction ond fu,e,v}ava(x (20016000, % the second argunent (1.0., 0) in used for 4% far-tron-nquare watrices to save tino and menory mote: first 2000 pointa are dropped (transieats) stiag(s) 416 ‘CURRENT SCIENCE, VOL. 18, NO. 7,10 APRIL 2000 ‘SPECIAL SECTION: COMPUTATIONAL SCIENCE, subplot (2,2,8) sentlogy(s, 0!) 4 plot singular values xlabel (‘singular value musber'),ylabel (aingular value’) titte("e) singular values of soan-subtracted date’) axia((0,11,10-2,1640) supplee(2,2,4) plot (1:20, (5,4), 70~" 10evC 2,24" 121004 9,1 1% plot proper orthogens} modes wlabel "location (nase number) ') ,ylabel /tirat 3 proper orthogonal wodes") ‘title? (@) modes 1 (circles), 2 (otare) and 3 (triangles) xi0((0,11,-0.7,0.71) Notes 1. We assume these functions ae bounded and itereble, In exe periment, meusremants are Bounded and srt integration Is guivlent to summation; andthe wbtstis of iteration an ately be ignored. Heo, 1 stay away ffom Integration theory ‘The fnteresod render may conse. Rudin. 2. Strict peaking, one should sty ‘ank wi most 5 m> Nihon r= Me ther tem egoavsues but only 1 sloglar ‘ler the large 2 eigenetues equal the squats of he N sie- tule values; end the smallest 1 elgeavaoes are al exactly Zero Im 31, then rm, end the m eigenvalues eq the gutes ofthe m singular vues. (Readers my wish to Wore out the details of thee cleslations using = umercal example of a eee 1. Koshi D. D. Indlan Math, Soe, 1943, 7, 76-88, 2, Holmes P., Lmley, J. L,and Barkow, G. Turbulence. Coher- ‘ent Seactres, Dynamical Syeteme and Symmetry, Cambidge ‘Monogt. Mech, Cambridge Univesity Press, 1996, = Cusumano, J.P, Sharky, M. T. and Kimble, BW, Phils Trans. R See. London, See. A, 1994, 347, 421-38. Feary, BF and Kappagaoty, RJ Sotnd Vir, 1998, 218, 60 16, Koulschek, D. Schwing, W., Garcia, Mand Full, Ry 1999 (ranuroipt under propration) Ruta, R. and Siree, C, J. Sound Pir, 1999, 226, 425- a9. Rudin, W., Prniles of Mathematical Analyt, Internationa! Sere in Pare and Applied Mathers, MeGrn Hil, 1976, Sr en 5. Gol, GA, sed Van Loa, C.F Maris Compton, Johns Hopkins University Press, Baltimoce, 1990, 2nd ed. Powe, D. and Sting, D8. G., Integral Equations: 4 Practical Treatment, From Spectral They 10 Applications, Cambie ‘Texts in Applied. Mathematics, Cambridge University Pres, 1990, Ou, B, Chaos in Dynamical Sytem, Cambeldge Univesity res, 1993, Raviedra, B, J. Sound ib, 1999, 219, 189-192 CURRENT SCIENCE, VOL. 78,NO. 7, 10 APRIL. 2000 ar oy

You might also like