27 views

Original Title: laine reed linreg project report

Uploaded by api-251525028

laine reed linreg project report

© All Rights Reserved

- Logistic Regression EBay
- Linear Regression and Correlation Analysis PPT @ BEC DOMS
- Sport Economics Paper
- Linear Regression and Corelation
- Piecewise Linear Regression Examples (Lesson 1) Truncated
- An Overview of Regression Analysis
- Unit 9 Regression SLM
- 2018 - User Violence and Psychological Well-being in Primary Health-Care Professionals
- Statistices
- LundholmPaper-forecasting sales with regression.pdf
- 15 Types of Regression You Should Know
- Optimization of Growth
- ch02_1
- HW 5, 448
- STAT 3008 Outline
- Effect of Running Speed and Leg Prostheses on Mediolateral Foot Placement and Its Variability.pdf
- Syllabus Business Analytics
- 3
- Unit-9.pdf
- Class 21 Regression

You are on page 1of 6

One day I was thinking about the future and where I would like to go to college, and a sudden question popped into my head why are there so many schools in California? Probably because there are so many people that live there, and since the weather in California is always perfect, no one ever leaves. So what about other states in the US? The state populations must affect in some way the number of colleges and universities present in that particular state. Based off of this sudden inquiry I decided to find the linear relation between state populations and number of colleges/universities in that state.

Population vs Colleges

1400 1200 Colleges in State 1000 800 600 400 200 0 0 10,000,000 20,000,000 30,000,000 40,000,000 Population in State Colleges

As shown in the scatterplot, the explanatory variable (x axis) of the relationship is the population, and the response variable (y axis) is the number of colleges. This relationship can safely be assumed this way because the number of colleges a specific state has can be

explained by the population. When put into a least squares regression line, this model can be written by the equation: = -7.966 + (2.89e-5)x This equation will give predicted values. In this situation, is equal to the predicted number of colleges, and x represents the population value. We know a linear model is appropriate for this data because the r-value, or correlation coefficient, is .95, meaning there is a very strong, positive linear association. In the context of this set of data, the slope shows that for every additional person added to a states population, there will be a 2.89e -5 increase in the number of colleges. The y-intercept, in this case -7.966, represents how many colleges there would be in a state of the population were zero. The r2 value, or coefficient of determination (.9025), shows that 90.25% of the variation of number of colleges can be explained by the regression line on population. As you can see on the scatterplot, there is one point (38041430, 1246) that is very far away from the others. Using the 1.5IQR outlier method for both the x and y variables, I found that this point is in fact an outlier in the data set. However, if it were to be removed from the data set, it would change the regression equation drastically and weaken the strong positive correlation. If this point were to be removed, the new equation would be = 27.273 + (2.385e5)x, the r-value would be 0.91, and the r2-value would be 0.83. Since the equation changed so much, this outlier can be considered an influential point. Below is a residual plot of the data. This shows how the actual data compares to the predicted data. The points along the graph are fairly scattered and many are relatively close to

the regression line. This means that the data can be represented linearly and that the actual data is very close to the predicted data.

Residual Plot

200 150 100 Residuals 50 0 -50 0 -100 -150 -200 Population 10,000,000 20,000,000 30,000,000 40,000,000

To find out if the least squares regression equation is really accurate and can be used reasonably to predict the number of colleges in a state, we will use the equation to predict how many colleges Colorado has (assuming we dont already know). = -7.966 + (2.89e-5)(5,187,582) = 141.96

This prediction shows fairly accurate, the actual number of colleges in Colorado being 171. This makes the residual approximately 29, meaning the least squares regression line underestimates the number of colleges per state. This is pretty good considering the large range for the variable and shows that the least squares regression line is a good method of predicting colleges. This information could be helpful to many people. It would especially be helpful for people who work as professional grant writers. Grant Writers write to the government and request to have money loaned to certain colleges and institutions for research. There are more

than one grant writer in each state, and they could all use this information to find which states are in need of more college funding, and how many colleges they could loan money to. This is a very important career because without these people writing grants, many colleges and universities would not have the necessary funding for certain types of research. Population vs. college count results in a very strong linear relationship, and almost fits its predicted values. Its linear because as the population count rises, so does the college count. This increase is a continuous one. If a single states population skyrockets, then the state will probably build new learning institutions to cater to the sudden influx of people. A linear regression, not any other type of regression, is the most appropriate for this model of data.

Works Cited

"Colleges and Universities in US by State/ Possession." US Colleges and Universities Directory. N.p., n.d. Web. 2 Oct. 2013. "Top 50 Cities in the U.S. by Population and Rank | Infoplease.com." Infoplease: Encyclopedia, Almanac, Atlas, Biographies, Dictionary, Thesaurus. Free online reference, research & homework help. | Infoplease.com. Pearson Education, n.d. Web. 2 Oct. 2013.

- Logistic Regression EBayUploaded byPrateek Shukla
- Linear Regression and Correlation Analysis PPT @ BEC DOMSUploaded byBabasab Patil (Karrisatte)
- Sport Economics PaperUploaded bycsdh09
- Linear Regression and CorelationUploaded bySally Goodwill
- Piecewise Linear Regression Examples (Lesson 1) TruncatedUploaded byStanley Yong
- An Overview of Regression AnalysisUploaded byMichael Glynn
- Unit 9 Regression SLMUploaded bymunmun8327
- 2018 - User Violence and Psychological Well-being in Primary Health-Care ProfessionalsUploaded byjarh358
- StatisticesUploaded byMehnaz Tabassum Shanta
- LundholmPaper-forecasting sales with regression.pdfUploaded byHåvard Nilsson
- 15 Types of Regression You Should KnowUploaded byRashidAli
- Optimization of GrowthUploaded bysiamak77
- ch02_1Uploaded byMubashir Ali Khan
- HW 5, 448Uploaded bypdrogos02
- STAT 3008 OutlineUploaded byEmerald Lam
- Effect of Running Speed and Leg Prostheses on Mediolateral Foot Placement and Its Variability.pdfUploaded byMitu Leonard-Gabriel
- Syllabus Business AnalyticsUploaded byAbhishek Kumar
- 3Uploaded byElana agrhy
- Unit-9.pdfUploaded byswingbike
- Class 21 RegressionUploaded byUno de Madrid
- session3demandestUploaded bySumeet Sharma
- The Steps to Follow in a Multiple Regression AnalysisUploaded byAvinash Acharya
- 1Uploaded byCeci_Sunshine
- Estimation of Transmission Line Parameters from Historical Data.pdfUploaded bylutfi
- Arima TestUploaded byAbhishek Prasad
- Mukhyi, Nurul & Dwi AsihUploaded bypatamxitin
- statistikUploaded byMuhammad Nuzuluddin
- Lect 7Uploaded bySoumitra Chakraborty
- Simulating the Concentration of Some Heavy Metals in Mista Ali River Mining Pond in Jos NigeriaUploaded byIJSTR Research Publication
- Topic 3 EconometricUploaded byLim Son Eng

- Module in Math iUploaded byBhel Lyn
- Probability TermsUploaded byMatt Gallion
- AbstractUploaded bydwi wahyuningtyas
- Introduction to Econometrics- Stock & Watson -Ch 4 Slides.docUploaded byAntonio Alvino
- SL Math Trig Practice Sheet 1Uploaded bynora
- A Common Fixed Point Theorem for Two Random Operators Using Random Mann Iteration SchemeUploaded byAlexander Decker
- Polaris Process_IT SectorUploaded byalmighty777
- ParabolaUploaded byalinahmohdnorazmie
- National Curriculum FrameworkUploaded bySwami Gurunand
- O'Connor -- The Birthday Paradox and Random Number GenerationUploaded byDerek O'Connor
- Turing VariationsUploaded bynomore891
- UF Sparse Matrix Collection - HB GroupUploaded bythecodefactory
- 2-59Uploaded byJoanna Linette
- Exercises 2Uploaded byabe97
- Optimum Design of a Condenser.pdfUploaded byaminiaan
- MA2265 DM(1)Uploaded bysr71919
- 6 1 Maxima and MinimaUploaded bySebastian Garcia
- Outwp Addwt Amt VwtclUploaded byrfs_2008
- Exercises, LE2Uploaded byLenard Santiago Punzal
- Parametric Study of Charging Inlet Part2Uploaded bymayurghule19
- Sequences Achen UniversityUploaded bypgolan
- Elementary School Math sample testUploaded byHonolulu Star-Advertiser
- Un paseo por el modelo GARCH y sus variantes.pdfUploaded byCarlos Peralta
- A Simple Gregorian Calendar Algorithm Based Upon Single-digit NumbersUploaded bySalisu Borodo
- math_anxiety_material.pdfUploaded byChristian Calma
- bpj lesson 6Uploaded byapi-307094748
- Special Topics In One-Dimensional Quantum Mechanics: Selected Exercises In Spatial and Momentum TranslationsUploaded bySpiros Konstantogiannis
- ex5Uploaded byleloiboi
- Reissner-Nordstrom metric - Gulmammad Mammadov.pdfUploaded byguignan
- 2nd Directional DerivativeUploaded byPrashant Kochar