Instatistics,linearregressionisanapproachformodelingtherelationshipbetweenascalardependent
variableyandoneormoreexplanatoryvariables(orindependentvariables)denotedX.Thecaseofone
explanatoryvariableiscalledsimplelinearregression.Formorethanoneexplanatoryvariable,the
processiscalledmultiplelinearregression.[1](Thistermshouldbedistinguishedfrommultivariate
linearregression,wheremultiplecorrelateddependentvariablesarepredicted,ratherthanasinglescalar
variable.)[2]
Inlinearregression,therelationshipsaremodeledusinglinearpredictorfunctionswhoseunknown
modelparametersareestimatedfromthedata.Suchmodelsarecalledlinearmodels.[3]Mostcommonly,
theconditionalmeanofygiventhevalueofXisassumedtobeanaffinefunctionofXlesscommonly,
themedianorsomeotherquantileoftheconditionaldistributionofygivenXisexpressedasalinear
functionofX.Likeallformsofregressionanalysis,linearregressionfocusesontheconditional
probabilitydistributionofygivenX,ratherthanonthejointprobabilitydistributionofyandX,whichis
thedomainofmultivariateanalysis.
Linearregressionwasthefirsttypeofregressionanalysistobestudiedrigorously,andtobeused
extensivelyinpracticalapplications.[4]Thisisbecausemodelswhichdependlinearlyontheirunknown
parametersareeasiertofitthanmodelswhicharenonlinearlyrelatedtotheirparametersandbecause
thestatisticalpropertiesoftheresultingestimatorsareeasiertodetermine.
categories:
Ifthegoalisprediction,orforecasting,orerrorreduction,linearregressioncanbeusedtofita
predictivemodeltoanobserveddatasetofyandXvalues.Afterdevelopingsuchamodel,ifan
usedtomakeapredictionofthevalueofy.
GivenavariableyandanumberofvariablesX1,...,Xpthatmayberelatedtoy,linearregression
analysiscanbeappliedtoquantifythestrengthoftherelationshipbetweenyandtheXj,toassess
whichXjmayhavenorelationshipwithyatall,andtoidentifywhichsubsetsoftheXjcontain
Linearregressionmodelsareoftenfittedusingtheleastsquaresapproach,buttheymayalsobefittedin
otherways,suchasbyminimizingthe"lackoffit"insomeothernorm(aswithleastabsolutedeviations
regression),orbyminimizingapenalizedversionoftheleastsquareslossfunctionasinridgeregression
(L2normpenalty)andlasso(L1normpenalty).Conversely,theleastsquaresapproachcanbeusedtofit
modelsthatarenotlinearmodels.Thus,althoughtheterms"leastsquares"and"linearmodel"are

Introductiontolinearregression
ofn
statisticalunits,alinearregression
modelassumesthattherelationship
betweenthedependentvariableyi
andthepvectorofregressorsxiis
linear.Thisrelationshipismodeled
variableianunobservedrandom
relationshipbetweenthedependent
variableandregressors.Thusthe
modeltakestheform
Exampleofsimplelinearregression,whichhasoneindependent
variable

whereTdenotesthetranspose,sothatxiTistheinnerproductbetweenvectorsxiand.
Oftenthesenequationsarestackedtogetherandwritteninvectorformas

where

Exampleofacubicpolynomialregression,whichisa
typeoflinearregression.

Someremarksonterminologyandgeneraluse:
iscalledtheregressand,endogenousvariable,responsevariable,measuredvariable,criterion
variable,ordependentvariable(seedependentandindependentvariables.)Thedecisionasto
independentvariablesmaybebasedonapresumptionthatthevalueofoneofthevariablesis
causedby,ordirectlyinfluencedbytheothervariables.Alternatively,theremaybeanoperational
reasontomodeloneofthevariablesintermsoftheothers,inwhichcasethereneedbeno
presumptionofcausality.
arecalledregressors,exogenousvariables,explanatoryvariables,
covariates,inputvariables,predictorvariables,orindependentvariables(seedependentand
independentvariables,butnottobeconfusedwithindependentrandomvariables).Thematrix
issometimescalledthedesignmatrix.
Usuallyaconstantisincludedasoneoftheregressors.Forexample,wecantakexi1=1for
i=1,...,n.Thecorrespondingelementofiscalledtheintercept.Manystatisticalinference
proceduresforlinearmodelsrequireanintercepttobepresent,soitisoftenincludedevenif
theoreticalconsiderationssuggestthatitsvalueshouldbezero.
Sometimesoneoftheregressorscanbeanonlinearfunctionofanotherregressororofthe
data,asinpolynomialregressionandsegmentedregression.Themodelremainslinearas
longasitislinearintheparametervector.
Theregressorsxijmaybeviewedeitherasrandomvariables,whichwesimplyobserve,or
theycanbeconsideredaspredeterminedfixedvalueswhichwecanchoose.Both
estimationprocedureshoweverdifferentapproachestoasymptoticanalysisareusedin
thesetwosituations.
isapdimensionalparametervector.Itselementsarealsocalledeffects,orregression
coefficients.Statisticalestimationandinferenceinlinearregressionfocuseson.Theelementsof
thisparametervectorareinterpretedasthepartialderivativesofthedependentvariablewith
respecttothevariousindependentvariables.
iscalledtheerrorterm,disturbanceterm,ornoise.Thisvariablecapturesallotherfactors
whichinfluencethedependentvariableyiotherthantheregressorsxi.Therelationshipbetween
theerrortermandtheregressors,forexamplewhethertheyarecorrelated,isacrucialstepin
formulatingalinearregressionmodel,asitwilldeterminethemethodtouseforestimation.
Example.Considerasituationwhereasmallballisbeingtossedupintheairandthenwemeasureits
heightsofascenthiatvariousmomentsintimeti.Physicstellsusthat,ignoringthedrag,the
relationshipcanbemodeledas

where1determinestheinitialvelocityoftheball,2isproportionaltothestandardgravity,andiis
duetomeasurementerrors.Linearregressioncanbeusedtoestimatethevaluesof1and2fromthe
measureddata.Thismodelisnonlinearinthetimevariable,butitislinearintheparameters1and2
ifwetakeregressorsxi=(xi1,xi2)=(ti,ti2),themodeltakesonthestandardform

Assumptions
Standardlinearregressionmodelswithstandardestimationtechniquesmakeanumberofassumptions
beendevelopedthatalloweachoftheseassumptionstoberelaxed(i.e.reducedtoaweakerform),and
insomecaseseliminatedentirely.Somemethodsaregeneralenoughthattheycanrelaxmultiple
assumptionsatonce,andinothercasesthiscanbeachievedbycombiningdifferentextensions.
Generallytheseextensionsmaketheestimationproceduremorecomplexandtimeconsuming,andmay
alsorequiremoredatainordertoproduceanequallyprecisemodel.
estimationtechniques(e.g.ordinaryleastsquares):
Weakexogeneity.Thisessentiallymeansthatthepredictorvariablesxcanbetreatedasfixed
values,ratherthanrandomvariables.Thismeans,forexample,thatthepredictorvariablesare
assumedtobeerrorfreethatis,notcontaminatedwithmeasurementerrors.Althoughthis
invariablesmodels.
Linearity.Thismeansthatthemeanoftheresponsevariableisalinearcombinationofthe
parameters(regressioncoefficients)andthepredictorvariables.Notethatthisassumptionismuch
lessrestrictivethanitmayatfirstseem.Becausethepredictorvariablesaretreatedasfixedvalues
(seeabove),linearityisreallyonlyarestrictionontheparameters.Thepredictorvariables
themselvescanbearbitrarilytransformed,andinfactmultiplecopiesofthesameunderlying
inpolynomialregression,whichuseslinearregressiontofittheresponsevariableasanarbitrary
polynomialfunction(uptoagivenrank)ofapredictorvariable.Thismakeslinearregressionan
extremelypowerfulinferencemethod.Infact,modelssuchaspolynomialregressionareoften
"toopowerful",inthattheytendtooverfitthedata.Asaresult,somekindofregularizationmust
typicallybeusedtopreventunreasonablesolutionscomingoutoftheestimationprocess.
Commonexamplesareridgeregressionandlassoregression.Bayesianlinearregressioncanalso
beused,whichbyitsnatureismoreorlessimmunetotheproblemofoverfitting.(Infact,ridge
regressionandlassoregressioncanbothbeviewedasspecialcasesofBayesianlinearregression,
withparticulartypesofpriordistributionsplacedontheregressioncoefficients.)
Constantvariance(a.k.a.homoscedasticity).Thismeansthatdifferentresponsevariableshave
thesamevarianceintheirerrors,regardlessofthevaluesofthepredictorvariables.Inpracticethis
assumptionisinvalid(i.e.theerrorsareheteroscedastic)iftheresponsevariablescanvaryovera
widescale.Inordertodetermineforheterogeneouserrorvariance,orwhenapatternofresiduals
violatesmodelassumptionsofhomoscedasticity(errorisequallyvariablearoundthe'bestfitting
line'forallpointsofx),itisprudenttolookfora"fanningeffect"betweenresidualerrorand
predictedvalues.Thisistosaytherewillbeasystematicchangeintheabsoluteorsquared
residualswhenplottedagainstthepredictingoutcome.Errorwillnotbeevenlydistributedacross
theregressionline.Heteroscedasticitywillresultintheaveragingoverofdistinguishable
variancesaroundthepointstogetasinglevariancethatisinaccuratelyrepresentingallthe
forlargerandsmallervaluesforpointsalongthelinearregressionline,andthemeansquarederror
forthemodelwillbewrong.Typically,forexample,aresponsevariablewhosemeanislargewill
haveagreatervariancethanonewhosemeanissmall.Forexample,agivenpersonwhoseincome
ispredictedtobe\$100,000mayeasilyhaveanactualincomeof\$80,000or\$120,000(astandard
deviationofaround\$20,000),whileanotherpersonwithapredictedincomeof\$10,000isunlikely
tohavethesame\$20,000standarddeviation,whichwouldimplytheiractualincomewouldvary
anywherebetween\$10,000and\$30,000.(Infact,asthisshows,inmanycasesoftenthesame
caseswheretheassumptionofnormallydistributederrorsfailsthevarianceorstandard
deviationshouldbepredictedtobeproportionaltothemean,ratherthanconstant.)Simplelinear
quantitiessuchasstandarderrorswhensubstantialheteroscedasticityispresent.However,various
estimationtechniques(e.g.weightedleastsquaresandheteroscedasticityconsistentstandard
errors)canhandleheteroscedasticityinaquitegeneralway.Bayesianlinearregressiontechniques
canalsobeusedwhenthevarianceisassumedtobeafunctionofthemean.Itisalsopossiblein
somecasestofixtheproblembyapplyingatransformationtotheresponsevariable(e.g.fitthe
logarithmoftheresponsevariableusingalinearregressionmodel,whichimpliesthattheresponse
variablehasalognormaldistributionratherthananormaldistribution).
Independenceoferrors.Thisassumesthattheerrorsoftheresponsevariablesareuncorrelated
witheachother.(Actualstatisticalindependenceisastrongerconditionthanmerelackof
correlationandisoftennotneeded,althoughitcanbeexploitedifitisknowntohold.)Some
methods(e.g.generalizedleastsquares)arecapableofhandlingcorrelatederrors,althoughthey
typicallyrequiresignificantlymoredataunlesssomesortofregularizationisusedtobiasthe
modeltowardsassuminguncorrelatederrors.Bayesianlinearregressionisageneralwayof
handlingthisissue.
Lackofmulticollinearityinthepredictors.Forstandardleastsquaresestimationmethods,the
designmatrixXmusthavefullcolumnrankp,otherwise,wehaveaconditionknownas
multicollinearityinthepredictorvariables.Thiscanbetriggeredbyhavingtwoormoreperfectly
correlatedpredictorvariables(e.g.ifthesamepredictorvariableismistakenlygiventwice,either
withouttransformingoneofthecopiesorbytransformingoneofthecopieslinearly).Itcanalso
happenifthereistoolittledataavailablecomparedtothenumberofparameterstobeestimated
(e.g.fewerdatapointsthanregressioncoefficients).Inthecaseofmulticollinearity,theparameter
vectorwillbenonidentifiableithasnouniquesolution.Atmostwewillbeabletoidentify
someoftheparameters,i.e.narrowdownitsvaluetosomelinearsubspaceofRp.Seepartialleast
squaresregression.Methodsforfittinglinearmodelswithmulticollinearityhavebeen
fractionoftheeffectsareexactlyzero.Notethatthemorecomputationallyexpensiveiterated
algorithmsforparameterestimation,suchasthoseusedingeneralizedlinearmodels,donotsuffer
fromthisproblemandinfactit'squitenormaltowhenhandlingcategoricallyvaluedpredictors
tointroduceaseparateindicatorvariablepredictorforeachpossiblecategory,whichinevitably
introducesmulticollinearity.
Beyondtheseassumptions,severalotherstatisticalpropertiesofthedatastronglyinfluencethe
performanceofdifferentestimationmethods:
Thestatisticalrelationshipbetweentheerrortermsandtheregressorsplaysanimportantrolein
determiningwhetheranestimationprocedurehasdesirablesamplingpropertiessuchasbeing
unbiasedandconsistent.
Thearrangement,orprobabilitydistributionofthepredictorvariablesxhasamajorinfluenceon
theprecisionofestimatesof.Samplinganddesignofexperimentsarehighlydeveloped
subfieldsofstatisticsthatprovideguidanceforcollectingdatainsuchawaytoachieveaprecise
estimateof.

Interpretation
Afittedlinearregressionmodel
canbeusedtoidentifythe
relationshipbetweenasingle
predictorvariablexjandthe
responsevariableywhenallthe
otherpredictorvariablesinthe
modelare"heldfixed".
Specifically,theinterpretationof
jistheexpectedchangeinyfor
aoneunitchangeinxjwhenthe
othercovariatesareheldfixed
thatis,theexpectedvalueofthe
partialderivativeofywith
respecttoxj.Thisissometimes
calledtheuniqueeffectofxjony.
Incontrast,themarginaleffectof
xjonycanbeassessedusinga

ThesetsintheAnscombe'squartethavethesamelinearregressionline
butarethemselvesverydifferent.

correlationcoefficientorsimple
linearregressionmodelrelating
xjtoythiseffectisthetotalderivativeofywithrespecttoxj.

Caremustbetakenwheninterpretingregressionresults,assomeoftheregressorsmaynotallowfor
marginalchanges(suchasdummyvariables,ortheinterceptterm),whileotherscannotbeheldfixed
(recalltheexamplefromtheintroduction:itwouldbeimpossibleto"holdtifixed"andatthesametime
changethevalueofti2).

Itispossiblethattheuniqueeffectcanbenearlyzeroevenwhenthemarginaleffectislarge.Thismay
implythatsomeothercovariatecapturesalltheinformationinxj,sothatoncethatvariableisinthe
model,thereisnocontributionofxjtothevariationiny.Conversely,theuniqueeffectofxjcanbelarge
whileitsmarginaleffectisnearlyzero.Thiswouldhappeniftheothercovariatesexplainedagreatdeal
ofthevariationofy,buttheymainlyexplainvariationinawaythatiscomplementarytowhatis
capturedbyxj.Inthiscase,includingtheothervariablesinthemodelreducesthepartofthevariability
ofythatisunrelatedtoxj,therebystrengtheningtheapparentrelationshipwithxj.
Themeaningoftheexpression"heldfixed"maydependonhowthevaluesofthepredictorvariables
arise.Iftheexperimenterdirectlysetsthevaluesofthepredictorvariablesaccordingtoastudydesign,
thecomparisonsofinterestmayliterallycorrespondtocomparisonsamongunitswhosepredictor
variableshavebeen"heldfixed"bytheexperimenter.Alternatively,theexpression"heldfixed"canrefer
toaselectionthattakesplaceinthecontextofdataanalysis.Inthiscase,we"holdavariablefixed"by
restrictingourattentiontothesubsetsofthedatathathappentohaveacommonvalueforthegiven
predictorvariable.Thisistheonlyinterpretationof"heldfixed"thatcanbeusedinanobservational
study.
Thenotionofa"uniqueeffect"isappealingwhenstudyingacomplexsystemwheremultiple
interrelatedcomponentsinfluencetheresponsevariable.Insomecases,itcanliterallybeinterpretedas
beenarguedthatinmanycasesmultipleregressionanalysisfailstoclarifytherelationshipsbetweenthe
predictorvariablesandtheresponsevariablewhenthepredictorsarecorrelatedwitheachotherandare
sharedanduniqueimpactsofcorrelatedindependentvariables.[10]

Extensions
Numerousextensionsoflinearregressionhavebeendeveloped,whichallowsomeorallofthe
assumptionsunderlyingthebasicmodeltoberelaxed.

Simpleandmultipleregression
Theverysimplestcaseofasinglescalarpredictorvariablexandasinglescalarresponsevariableyis
knownassimplelinearregression.Theextensiontomultipleand/orvectorvaluedpredictorvariables
(denotedwithacapitalX)isknownasmultiplelinearregression,alsoknownasmultivariablelinear
regression.Nearlyallrealworldregressionmodelsinvolvemultiplepredictors,andbasicdescriptionsof
linearregressionareoftenphrasedintermsofthemultipleregressionmodel.Note,however,thatin
thesecasestheresponsevariableyisstillascalar.Anothertermmultivariatelinearregressionrefersto
caseswhereyisavector,i.e.,thesameasgenerallinearregression.Thedifferencebetweenmultivariate
linearregressionandmultivariablelinearregressionshouldbeemphasizedasitcausesmuchconfusion
andmisunderstandingintheliterature.

Generallinearmodels
ThegenerallinearmodelconsidersthesituationwhentheresponsevariableYisnotascalarbutavector.
ConditionallinearityofE(y|x)=Bxisstillassumed,withamatrixBreplacingthevectorofthe
classicallinearregressionmodel.MultivariateanaloguesofOrdinaryLeastSquares(OLS)and
GeneralizedLeastSquares(GLS)havebeendeveloped.Theterm"generallinearmodels"isequivalent
to"multivariatelinearmodels".Itshouldbenotedthedifferenceof"multivariatelinearmodels"and
"multivariablelinearmodels,"wheretheformeristhesameas"generallinearmodels"andthelatteris
thesameas"multiplelinearmodels."

Heteroscedasticmodels
Variousmodelshavebeencreatedthatallowforheteroscedasticity,i.e.theerrorsfordifferentresponse
variablesmayhavedifferentvariances.Forexample,weightedleastsquaresisamethodforestimating
linearregressionmodelswhentheresponsevariablesmayhavedifferenterrorvariances,possiblywith
correlatederrors.(SeealsoWeightedlinearleastsquares,andgeneralizedleastsquares.)
Heteroscedasticityconsistentstandarderrorsisanimprovedmethodforusewithuncorrelatedbut
potentiallyheteroscedasticerrors.

Generalizedlinearmodels
Generalizedlinearmodels(GLMs)areaframeworkformodelingaresponsevariableythatisbounded
ordiscrete.Thisisused,forexample:
whenmodelingpositivequantities(e.g.pricesorpopulations)thatvaryoveralargescalewhich
simplytransformedusingthelogarithmfunction)
whenmodelingcategoricaldata,suchasthechoiceofagivencandidateinanelection(whichis
betterdescribedusingaBernoullidistribution/binomialdistributionforbinarychoices,ora
categoricaldistribution/multinomialdistributionformultiwaychoices),wherethereareafixed
numberofchoicesthatcannotbemeaningfullyordered
whenmodelingordinaldata,e.g.ratingsonascalefrom0to5,wherethedifferentoutcomescan
beorderedbutwherethequantityitselfmaynothaveanyabsolutemeaning(e.g.aratingof4may
notbe"twiceasgood"inanyobjectivesenseasaratingof2,butsimplyindicatesthatitisbetter
than2or3butnotasgoodas5).
response,andinparticularittypicallyhastheeffectoftransformingbetweenthe
rangeof
thelinearpredictorandtherangeoftheresponsevariable.
SomecommonexamplesofGLMsare:
Poissonregressionforcountdata.
Logisticregressionandprobitregressionforbinarydata.
Multinomiallogisticregressionandmultinomialprobitregressionforcategoricaldata.
Orderedprobitregressionforordinaldata.
Singleindexmodelsallowsomedegreeofnonlinearityintherelationshipbetweenxandy,while
preservingthecentralroleofthelinearpredictorxasintheclassicallinearregressionmodel.Under
certainconditions,simplyapplyingOLStodatafromasingleindexmodelwillconsistentlyestimate
uptoaproportionalityconstant.[11]

Hierarchicallinearmodels

Hierarchicallinearmodels(ormultilevelregression)organizesthedataintoahierarchyofregressions,
forexamplewhereAisregressedonB,andBisregressedonC.Itisoftenusedwherethevariablesof
interesthaveanaturalhierarchicalstructuresuchasineducationalstatistics,wherestudentsarenestedin
suchasaschooldistrict.Theresponsevariablemightbeameasureofstudentachievementsuchasatest
score,anddifferentcovariateswouldbecollectedattheclassroom,school,andschooldistrictlevels.

Errorsinvariables
modeltoallowthepredictorvariablesXtobeobservedwitherror.Thiserrorcausesstandardestimators
oftobecomebiased.Generally,theformofbiasisanattenuation,meaningthattheeffectsarebiased
towardzero.

Others
InDempsterShafertheory,oralinearbelieffunctioninparticular,alinearregressionmodelmay
berepresentedasapartiallysweptmatrix,whichcanbecombinedwithsimilarmatrices
representingobservationsandotherassumednormaldistributionsandstateequations.The
combinationofsweptorunsweptmatricesprovidesanalternativemethodforestimatinglinear
regressionmodels.

Estimationmethods
Alargenumberofprocedureshavebeendevelopedforparameter
estimationandinferenceinlinearregression.Thesemethods
differincomputationalsimplicityofalgorithms,presenceofa
closedformsolution,robustnesswithrespecttoheavytailed
distributions,andtheoreticalassumptionsneededtovalidate
desirablestatisticalpropertiessuchasconsistencyand
asymptoticefficiency.
Someofthemorecommonestimationtechniquesforlinear
regressionaresummarizedbelow.

Leastsquaresestimationandrelatedtechniques
Ordinaryleastsquares(OLS)isthesimplestandthus
mostcommonestimator.Itisconceptuallysimpleand
computationallystraightforward.OLSestimatesare
commonlyusedtoanalyzebothexperimentaland
observationaldata.

ComparisonoftheTheilSen
estimator(black)andsimplelinear
regression(blue)forasetofpoints
withoutliers.

fortheestimatedvalueoftheunknownparameter:

Theestimatorisunbiasedandconsistentiftheerrorshavefinitevarianceandareuncorrelated
withtheregressors[12]
Itisalsoefficientundertheassumptionthattheerrorshavefinitevarianceandarehomoscedastic,
meaningthatE[i2|xi]doesnotdependoni.Theconditionthattheerrorsareuncorrelatedwiththe
regressorswillgenerallybesatisfiedinanexperiment,butinthecaseofobservationaldata,itis
difficulttoexcludethepossibilityofanomittedcovariatezthatisrelatedtoboththeobserved
correlationbetweentheregressorsandtheresponsevariable,andhencetoaninconsistent
estimatorof.Theconditionofhomoscedasticitycanfailwitheitherexperimentalor
observationaldata.Ifthegoaliseitherinferenceorpredictivemodeling,theperformanceofOLS
estimatescanbepoorifmulticollinearityispresent,unlessthesamplesizeislarge.
Insimplelinearregression,wherethereisonlyoneregressor(withaconstant),theOLS
coefficientestimateshaveasimpleformthatiscloselyrelatedtothecorrelationcoefficient
betweenthecovariateandtheresponse.
Generalizedleastsquares(GLS)isanextensionoftheOLSmethod,thatallowsefficient
estimationofwheneitherheteroscedasticity,orcorrelations,orbotharepresentamongtheerror
termsofthemodel,aslongastheformofheteroscedasticityandcorrelationisknown
independentlyofthedata.Tohandleheteroscedasticitywhentheerrortermsareuncorrelatedwith
eachother,GLSminimizesaweightedanaloguetothesumofsquaredresidualsfromOLS
regression,wheretheweightfortheithcaseisinverselyproportionaltovar(i).Thisspecialcase
ofGLSiscalled"weightedleastsquares".TheGLSsolutiontoestimationproblemis

whereisthecovariancematrixoftheerrors.GLScanbeviewedasapplyingalinear
transformationtothedatasothattheassumptionsofOLSaremetforthetransformeddata.For
GLStobeapplied,thecovariancestructureoftheerrorsmustbeknownuptoamultiplicative
constant.
Percentageleastsquaresfocusesonreducingpercentageerrors,whichisusefulinthefieldof
forecastingortimeseriesanalysis.Itisalsousefulinsituationswherethedependentvariablehasa
widerangewithoutconstantvariance,asherethelargerresidualsattheupperendoftherange
woulddominateifOLSwereused.Whenthepercentageorrelativeerrorisnormallydistributed,
leastsquarespercentageregressionprovidesmaximumlikelihoodestimates.Percentageregression
errorterm.[13]
Iterativelyreweightedleastsquares(IRLS)isusedwhenheteroscedasticity,orcorrelations,or
covariancestructureoftheerrorsindependentlyofthedata.[14]Inthefirstiteration,OLS,orGLS
withaprovisionalcovariancestructureiscarriedout,andtheresidualsareobtainedfromthefit.
Basedontheresiduals,animprovedestimateofthecovariancestructureoftheerrorscanusually
beobtained.AsubsequentGLSiterationisthenperformedusingthisestimateoftheerror
structuretodefinetheweights.Theprocesscanbeiteratedtoconvergence,butinmanycases,
onlyoneiterationissufficienttoachieveanefficientestimateof.[15][16]
Instrumentalvariablesregression(IV)canbeperformedwhentheregressorsarecorrelatedwith
theerrors.Inthiscase,weneedtheexistenceofsomeauxiliaryinstrumentalvariableszisuchthat
E[zii]=0.IfZisthematrixofinstruments,thentheestimatorcanbegiveninclosedformas

OptimalinstrumentsregressionisanextensionofclassicalIVregressiontothesituationwhere
E[i|zi]=0.
Totalleastsquares(TLS)[17]isanapproachtoleastsquaresestimationofthelinearregression
modelthattreatsthecovariatesandresponsevariableinamoregeometricallysymmetricmanner
thanOLS.Itisoneapproachtohandlingthe"errorsinvariables"problem,andisalsosometimes
usedevenwhenthecovariatesareassumedtobeerrorfree.

Maximumlikelihoodestimationandrelatedtechniques
Maximumlikelihoodestimationcanbeperformedwhenthedistributionoftheerrortermsis
knowntobelongtoacertainparametricfamilyofprobabilitydistributions.[18]Whenfisa
normaldistributionwithzeromeanandvariance,theresultingestimateisidenticaltotheOLS
estimate.GLSestimatesaremaximumlikelihoodestimateswhenfollowsamultivariatenormal
distributionwithaknowncovariancematrix.
Ridgeregression,[19][20][21]andotherformsofpenalizedestimationsuchasLassoregression,[5]
deliberatelyintroducebiasintotheestimationofinordertoreducethevariabilityofthe
estimate.TheresultingestimatorsgenerallyhavelowermeansquarederrorthantheOLS
estimates,particularlywhenmulticollinearityispresentorwhenoverfittingisaproblem.Theyare
generallyusedwhenthegoalistopredictthevalueoftheresponsevariableyforvaluesofthe
predictorsxthathavenotyetbeenobserved.Thesemethodsarenotascommonlyusedwhenthe
goalisinference,sinceitisdifficulttoaccountforthebias.
sensitivetothepresenceofoutliersthanOLS(butislessefficientthanOLSwhennooutliersare
present).ItisequivalenttomaximumlikelihoodestimationunderaLaplacedistributionmodelfor
.[22]
,
theoptimalestimatoristhe2stepMLE,wherethefirststepisusedtononparametricallyestimate
thedistributionoftheerrorterm.[23]

Otherestimationtechniques
BayesianlinearregressionappliestheframeworkofBayesianstatisticstolinearregression.(See
alsoBayesianmultivariatelinearregression.)Inparticular,theregressioncoefficientsare
assumedtoberandomvariableswithaspecifiedpriordistribution.Thepriordistributioncanbias
thesolutionsfortheregressioncoefficients,inawaysimilarto(butmoregeneralthan)ridge
pointestimateforthe"best"valuesoftheregressioncoefficientsbutanentireposterior
distribution,completelydescribingtheuncertaintysurroundingthequantity.Thiscanbeusedto
estimatethe"best"coefficientsusingthemean,mode,median,anyquantile(seequantile
regression),oranyotherfunctionoftheposteriordistribution.
QuantileregressionfocusesontheconditionalquantilesofygivenXratherthantheconditional
meanofygivenX.Linearquantileregressionmodelsaparticularconditionalquantile,for
exampletheconditionalmedian,asalinearfunctionTxofthepredictors.
Mixedmodelsarewidelyusedtoanalyzelinearregressionrelationshipsinvolvingdependentdata
whenthedependencieshaveaknownstructure.Commonapplicationsofmixedmodelsinclude
analysisofdatainvolvingrepeatedmeasurements,suchaslongitudinaldata,ordataobtainedfrom
clustersampling.Theyaregenerallyfitasparametricmodels,usingmaximumlikelihoodor
Bayesianestimation.Inthecasewheretheerrorsaremodeledasnormalrandomvariables,thereis
acloseconnectionbetweenmixedmodelsandgeneralizedleastsquares.[24]Fixedeffects
estimationisanalternativeapproachtoanalyzingthistypeofdata.
Principalcomponentregression(PCR)[7][8]isusedwhenthenumberofpredictorvariablesis
large,orwhenstrongcorrelationsexistamongthepredictorvariables.Thistwostageprocedure
firstreducesthepredictorvariablesusingprincipalcomponentanalysisthenusesthereduced
variablesinanOLSregressionfit.Whileitoftenworkswellinpractice,thereisnogeneral
theoreticalreasonthatthemostinformativelinearfunctionofthepredictorvariablesshouldlie
amongthedominantprincipalcomponentsofthemultivariatedistributionofthepredictor
variables.ThepartialleastsquaresregressionistheextensionofthePCRmethodwhichdoesnot
sufferfromthementioneddeficiency.
Leastangleregression[6]isanestimationprocedureforlinearregressionmodelsthatwas
developedtohandlehighdimensionalcovariatevectors,potentiallywithmorecovariatesthan
observations.
TheTheilSenestimatorisasimplerobustestimationtechniquethatchoosestheslopeofthefit
linetobethemedianoftheslopesofthelinesthroughpairsofsamplepoints.Ithassimilar
statisticalefficiencypropertiestosimplelinearregressionbutismuchlesssensitivetooutliers.[25]
Otherrobustestimationtechniques,includingthetrimmedmeanapproach,andL,M,S,and
Restimatorshavebeenintroduced.

Furtherdiscussion
Instatisticsandnumericalanalysis,theproblemofnumericalmethodsforlinearleastsquaresisan
importantonebecauselinearregressionmodelsareoneofthemostimportanttypesofmodel,bothas
formalstatisticalmodelsandforexplorationofdatasets.Themajorityofstatisticalcomputerpackages
containfacilitiesforregressionanalysisthatmakeuseoflinearleastsquarescomputations.Henceitis
undertakenefficientlyandwithdueregardtonumericalprecision.
Individualstatisticalanalysesareseldomundertakeninisolation,butratherarepartofasequenceof
investigatorysteps.Someofthetopicsinvolvedinconsideringnumericalmethodsforlinearleast
squaresrelatetothispoint.Thusimportanttopicscanbe
Computationswhereanumberofsimilar,andoftennested,modelsareconsideredforthesame
dataset.Thatis,wheremodelswiththesamedependentvariablebutdifferentsetsofindependent
variablesaretobeconsidered,foressentiallythesamesetofdatapoints.
Computationsforanalysesthatoccurinasequence,asthenumberofdatapointsincreases.
Specialconsiderationsforveryextensivedatasets.
Fittingoflinearmodelsbyleastsquaresoften,butnotalways,arisesinthecontextofstatisticalanalysis.
Itcanthereforebeimportantthatconsiderationsofcomputationalefficiencyforsuchproblemsextendto
alloftheauxiliaryquantitiesrequiredforsuchanalyses,andarenotrestrictedtotheformalsolutionof
thelinearleastsquaresproblem.
Matrixcalculations,likeanyothers,areaffectedbyroundingerrors.Anearlysummaryoftheseeffects,
regardingthechoiceofcomputationalmethodsformatrixinversion,wasprovidedbyWilkinson.[26]

Usinglinearalgebra
Itfollowsthatonecanfinda"best"approximationofanotherfunctionbyminimizingtheareabetween
twofunctions,acontinuousfunction on
andafunction
where isasubspaceof
:
,

allwithinthesubspace .Duetothefrequentdifficultyofevaluatingintegrandsinvolvingabsolute

theinnerproductspace .
Assuch,

or,equivalently,

,canthusbewritteninvectorform:
.

Inotherwords,theleastsquaresapproximationof isthefunction
termsoftheinnerproduct
.Furthermore,thiscanbeappliedwithatheorem:
Let becontinuouson
,andlet beafinitedimensionalsubspaceof
squaresapproximatingfunctionof withrespectto isgivenby

closestto in
.Theleast

,
where

isanorthonormalbasisfor

Applicationsoflinearregression
Linearregressioniswidelyusedinbiological,behavioralandsocialsciencestodescribepossible
relationshipsbetweenvariables.Itranksasoneofthemostimportanttoolsusedinthesedisciplines.

Trendline
Atrendlinerepresentsatrend,thelongtermmovementintimeseriesdataafterothercomponentshave
beenaccountedfor.Ittellswhetheraparticulardataset(sayGDP,oilpricesorstockprices)have
increasedordecreasedovertheperiodoftime.Atrendlinecouldsimplybedrawnbyeyethroughaset
ofdatapoints,butmoreproperlytheirpositionandslopeiscalculatedusingstatisticaltechniqueslike
linearregression.Trendlinestypicallyarestraightlines,althoughsomevariationsusehigherdegree
polynomialsdependingonthedegreeofcurvaturedesiredintheline.
technique,anddoesnotrequireacontrolgroup,experimentaldesign,orasophisticatedanalysis
technique.However,itsuffersfromalackofscientificvalidityincaseswhereotherpotentialchanges
canaffectthedata.

Epidemiology
Earlyevidencerelatingtobaccosmokingtomortalityandmorbiditycamefromobservationalstudies
employingregressionanalysis.Inordertoreducespuriouscorrelationswhenanalyzingobservational
ofprimaryinterest.Forexample,supposewehavearegressionmodelinwhichcigarettesmokingisthe
independentvariableofinterest,andthedependentvariableislifespanmeasuredinyears.Researchers
effectofsmokingonlifespanisnotduetosomeeffectofeducationorincome.However,itisnever
possibletoincludeallpossibleconfoundingvariablesinanempiricalanalysis.Forexample,a
hypotheticalgenemightincreasemortalityandalsocausepeopletosmokemore.Forthisreason,
randomizedcontrolledtrialsareoftenabletogeneratemorecompellingevidenceofcausalrelationships
thancanbeobtainedusingregressionanalysesofobservationaldata.Whencontrolledexperimentsare
notfeasible,variantsofregressionanalysissuchasinstrumentalvariablesregressionmaybeusedto
attempttoestimatecausalrelationshipsfromobservationaldata.

Finance
Thecapitalassetpricingmodeluseslinearregressionaswellastheconceptofbetaforanalyzingand
quantifyingthesystematicriskofaninvestment.Thiscomesdirectlyfromthebetacoefficientofthe
linearregressionmodelthatrelatesthereturnontheinvestmenttothereturnonallriskyassets.

Economics
Linearregressionisthepredominantempiricaltoolineconomics.Forexample,itisusedtopredict
consumptionspending,[27]fixedinvestmentspending,inventoryinvestment,purchasesofacountry's
exports,[28]spendingonimports,[28]thedemandtoholdliquidassets,[29]labordemand,[30]andlabor
supply.[30]

Environmentalscience
EnvironmentalEffectsMonitoringProgramusesstatisticalanalysesonfishandbenthicsurveysto
measuretheeffectsofpulpmillormetalmineeffluentontheaquaticecosystem.[31]

