@vtucode - in 21CS71 Module 5 PDF

Uploaded by

丂卂Ҝ丂卄卂ㄒ卄

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views5 pages

@vtucode - in 21CS71 Module 5 PDF

Uploaded by

丂卂Ҝ丂卄卂ㄒ卄

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

The value of F calculated using the above-mentioned formula is to be compared to the

critical value of F for the given degrees of freedom. If the F value calculated is equal or
exceeds the critical value, then significant differences between the means of samples exist.
This reveals that the samples are not drawn from the same population and thus null hypothesis
is rejected.

6.2.5.5 No Refosonshfp Case

Statistical relationship is a dependence or association between two random variables or
bivariate data. Bivariate means „two variables‟. In other words, there are two types of
data. Relationships between variables need to be studied and analyzed before drawing
conclusions based on it. One cannot determine the right conclusion or association when no
relationship between the variables exists.

6.2.6 Correlation
Correlation means analysis which lets us find the association or the absence of the relationship
etween two variables, x and y. Correlation gives the strength of the relationship between
the model and the dependent variable on a convenient 0-100% scale.
R-Square ft is a measure of correlation between the predicted values y and the observed values
of x. R-squared (ft²) is a goodness-of-fit measure in linear- regression model. It is also
known as the coefficient of determination. ft² is the square of ft, the coefficient of multiple
correlations, and includes additional independent (explanatory) variables in regression
equation.
Interpretation of R-squared The larger the it², the better the regression model fits the
observations, i.e., the correlation is better. Theoretically, if a model shows 100% variance,then
the fitted values are always equal to the observed values, and therefore, all the data points
would fall on the fitted regression line.
Correlation differs from a regression analysis. Regression analysis predicts the value of the
dependent predictor or response variable based on the known value of the independent variable,
assuming a more or less mathematical relationship between two or more variables within the
specified variances.

6.2.6.1 Correlation Indicators o/Linenr itelntionships

Correlation is a statistical technique that measures and describes the „strength‟ and „direction‟
of the relationship between two variables. Let us explore the relations between only two
variables. The significant questions are:
Does y increase or decrease with x? For example, expenditure increases with income or does
the number of patients decrease with proper medication. (Direction)
(i) Suppose y does increase with x; then, how fast?

(ii) Is this relationship strong?

(iii) Can reliable predictions be made? That is, if one tells the income, can the expenditure be
predicted?

Relationships and correlations enable training model on sample data using statistical or
ML algorithms. Statistical correlation is measured by the coefficient of correlation. The
most common correlation coefficient, called the Pearson product-moment correlation
coefficient.
It measures the strength of the linear association between variables. The correlation r
between the two variables x andy is:

where n is the number of observations in the sample, x; is the x value for observation
i, x- is the sample mean of x,y is the y value for observation i, y- is the sample mean of y,
sx is the sample standard deviation of x, and s is the sample standard deviation of y.
Summation is over all n values of i, i - 1, 2, ..., n.[N is square of sample correlation coefficient
between the observed outcomes and the observed predictor values, and includes intercept on
y-axis in case of linear regression.]
Use of Statistical Correlation Assume one sample dataset is (ul, Hal containing n
values of a parameter r. The r,, is i-th data point in dataset u. (i = 1, 2, ..., n). Another sample
dataset is (vl,..., v,) containing n values of fi‹ ‟v,i is i-th data point in dataset v. Let the
correlation among two samples is being measured. Sample Pearson correlation metric c,
measures how well two sample datasets fit on a straight line.
where the summations are over the values of parameter in the datasets.Three other similarities
based on correlation are:
(i) Constrained Pearson correlation — It is a variation of Pearson correlation that uses
midpoint instead of mean rate.
(ii) Spearman rank correlation - It is similar to Pearson correlation, except that the ratings are
ranks.
(iii) Kendall's G correlation - It is similar to the Spearman rank correlation, but instead of
using ranks themselves, only the relative ranks are used to calculate the
correlation.Numerical value of correlation coefficient ranges from +1.0 to -1.0. It gives an
indication of both the strength and direction of the relationship between variables.
In general, a correlation coefficient r» 0 indicates a positive relationship; r • 0 indicates a
negative relationship; r - 0 indicates no relationship (or that the variables are independent
of each other and not related). Here r - ml.0 describes a perfect positive correlation and r
= -1.0 describes a perfect negative correlation.
The closer the coefficients are to +1.0 and -1.0, the greater is the strength of the relationship
between the variables.
Table 6.1 The strength of the relationship as a function of r

Correlation is only appropriate for examining the relationship between meaningful

quantifiable data (such as, temperature, marks, score) rather than categorical data, such as
gender, color etc. Figure 6.4 shows perfect and imperfect, linear positive and negative
relationships, and the strength and direction of the relationship between variables
Self-Assessment Exercise linked to LO 6.1

1. Define non-linear relation. Plot on the same graph, a company car sales,for its two models
every year between 2012 to 2017, using the formula -— n0 + n1.xp + n2.xq²). How will you
predict the sales in 2010? Assume for first model n0 — 490, a1 - 10 and n2 — 5. Assume
for second model n0 4900, n₁ = 100 and n₂ = 50. Assume, xp - 0 for year 2011, xp = 1 for
2012and xq - 6 for 2017.
2. How does the P(x) vary in normal distribution when expected mean is at x = 6.0 and
standard deviation s is 1.0? Show a plot of P(x) and x and points at deviations of 1.0, 2.0 and
3.0 (means at cr, 2P and 3 cr).
3. Define mean, variance and standard deviation. How do the Oᵗh moment, 1ˢᵗ moment, 2ⁿᵈ
moment and 3‟ᵈ moment compute from the values and their probabilities?
4. When will you perform t-test and F-test?
5. What does variable R-squared mean? How is the correlation parameter between predicted
valued and observed value evaluated? When do you use R, r, R² and when N?
6. Consider correlation r between two variables. How do you interpret r » 0, r • 0 and r = 0?
7. How is the inference made that two variables do not correlate?

6.3 Regression Analysis

Correlation and regression are two analyses based on multivariate distribution. A
multivariate distribution means a distribution in multiple variables.Suppose a company
wishes to plan the manufacturing of Jaguar cars for coming years. The company
looks at sales data regressively, i.e., data of degrussian analysis using Iinear and n on- linaar
regression models,K-Nearest-Noi gh bours,. an d using distance measures for prediction
sprevious years' sales. Regressive analysis means estimating relationships between
variables. Regression analysis is a set of statistical steps, which estimate the
relationships among variables. Regression analysis may require many techniques for
modeling and performing the analysis using multiple variables. The aim of the analysis
is to find the relationships between a dependent variable and one or more
independent, outcome, predictor or response variables. Regression analysis facilitates
prediction of future values of dependent variables.It helps to find how a dependent variable
changes when variation is in an independent variable among a set of them, while the
remaining independent variables in the set are kept fixed.
Non-linear regression equation is as follows:

where number of terms on the right-hand side are 3 or 4. Linear regression means only

Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
35 pages
Correlation and Regression Analysis Explained
No ratings yet
Correlation and Regression Analysis Explained
8 pages
Lesson 6.2 Correlation and Regression Analysis Final Edition
No ratings yet
Lesson 6.2 Correlation and Regression Analysis Final Edition
8 pages
Correlation and Simple Linear Regression Guide
No ratings yet
Correlation and Simple Linear Regression Guide
6 pages
Introduction To Correlation and Regression Analysis
No ratings yet
Introduction To Correlation and Regression Analysis
14 pages
Correlation and Regression Explained
No ratings yet
Correlation and Regression Explained
6 pages
Understanding Measures of Correlation
No ratings yet
Understanding Measures of Correlation
24 pages
Linear Correlation and Regression Guide
No ratings yet
Linear Correlation and Regression Guide
13 pages
Correlation vs. Regression Explained
No ratings yet
Correlation vs. Regression Explained
3 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
5 pages
Advanced Data Analysis Techniques
No ratings yet
Advanced Data Analysis Techniques
23 pages
Correlation and Regression
100% (5)
Correlation and Regression
49 pages
Microsoft PowerPoint Session 4 PDF
No ratings yet
Microsoft PowerPoint Session 4 PDF
86 pages
Chapter - Six
No ratings yet
Chapter - Six
8 pages
Correlation vs. Regression Analysis Explained
No ratings yet
Correlation vs. Regression Analysis Explained
14 pages
Unit 2
No ratings yet
Unit 2
44 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
5 pages
Regression and Correlation Analysis Guide
No ratings yet
Regression and Correlation Analysis Guide
9 pages
Correlation and Regression
No ratings yet
Correlation and Regression
10 pages
Correlation and Regression Guide
No ratings yet
Correlation and Regression Guide
9 pages
Statistics Correlation Analysis
No ratings yet
Statistics Correlation Analysis
10 pages
Correlation
100% (1)
Correlation
29 pages
Lecture 5.Correlation&Regression
No ratings yet
Lecture 5.Correlation&Regression
42 pages
Correlation and Its Significance
No ratings yet
Correlation and Its Significance
15 pages
Unit III Describing Relationships
No ratings yet
Unit III Describing Relationships
56 pages
Regression and Correlation Analysis Guide
No ratings yet
Regression and Correlation Analysis Guide
25 pages
Cha 6
No ratings yet
Cha 6
8 pages
Understanding Regression and Correlation
No ratings yet
Understanding Regression and Correlation
42 pages
Notes - Correlation and Regression
No ratings yet
Notes - Correlation and Regression
26 pages
Regression and Correlation
No ratings yet
Regression and Correlation
37 pages
07 - Correlation and Regression Analysis-1
No ratings yet
07 - Correlation and Regression Analysis-1
13 pages
Correlation
No ratings yet
Correlation
8 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
28 pages
Regression and Correlation Analysis Guide
No ratings yet
Regression and Correlation Analysis Guide
14 pages
Correlation Analysis Notes-2
No ratings yet
Correlation Analysis Notes-2
5 pages
CORRELATION
No ratings yet
CORRELATION
23 pages
Correlation and Linear Regression Explained
No ratings yet
Correlation and Linear Regression Explained
7 pages
Understanding Correlation Coefficients
No ratings yet
Understanding Correlation Coefficients
3 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
17 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
41 pages
Biostatistics: Lect6: Correlation and Regression Analysis Dr. Ecem Yeğin
No ratings yet
Biostatistics: Lect6: Correlation and Regression Analysis Dr. Ecem Yeğin
28 pages
Correlation and Regression Analysis Guide
100% (2)
Correlation and Regression Analysis Guide
44 pages
Understanding Correlation Analysis
No ratings yet
Understanding Correlation Analysis
102 pages
Correlation and Regression
100% (1)
Correlation and Regression
45 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
58 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
18 pages
Statistics for Students
No ratings yet
Statistics for Students
30 pages
Bivariate Data and Scatterplots Explained
No ratings yet
Bivariate Data and Scatterplots Explained
10 pages
The Significance of Correlation
No ratings yet
The Significance of Correlation
6 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
19 pages
Understanding Regression and Correlation
No ratings yet
Understanding Regression and Correlation
14 pages
Understanding Linear Regression and Correlation
No ratings yet
Understanding Linear Regression and Correlation
9 pages
Relationship - Correlation and Regression
No ratings yet
Relationship - Correlation and Regression
42 pages
Intro to Linear Correlation
No ratings yet
Intro to Linear Correlation
5 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
27 pages
5 Correlation and Cofficient 2023
No ratings yet
5 Correlation and Cofficient 2023
51 pages
Residual Ridge Resorption Seminar Final (Autosaved) Latest
No ratings yet
Residual Ridge Resorption Seminar Final (Autosaved) Latest
118 pages
Nike Crocking Test SOP
No ratings yet
Nike Crocking Test SOP
14 pages
Prabhat Report11
No ratings yet
Prabhat Report11
70 pages
Polling Application PDF
No ratings yet
Polling Application PDF
22 pages
Egyptian Phonology Overview
No ratings yet
Egyptian Phonology Overview
25 pages
CS APISwaggerforExternalClient 100323 0936
No ratings yet
CS APISwaggerforExternalClient 100323 0936
11 pages
B.Sc. Genetic Engineering Exam Prep
No ratings yet
B.Sc. Genetic Engineering Exam Prep
4 pages
Plant Pest Detection and Control Using Image Processing For Agriculture Application
No ratings yet
Plant Pest Detection and Control Using Image Processing For Agriculture Application
6 pages
Besavilla: Review Center
100% (1)
Besavilla: Review Center
33 pages
1 BEI502 Process Instruemntation Humidity Measurement 17-9-24
No ratings yet
1 BEI502 Process Instruemntation Humidity Measurement 17-9-24
64 pages
S&P Bse Sensex: 1. Future Retail Ltd. 523574
No ratings yet
S&P Bse Sensex: 1. Future Retail Ltd. 523574
11 pages
Portrait Lighting Techniques Guide
100% (4)
Portrait Lighting Techniques Guide
46 pages
Notes On 'Calculus' by M. Spivak
No ratings yet
Notes On 'Calculus' by M. Spivak
14 pages
BH40 Rear Dump Truck Specs
100% (1)
BH40 Rear Dump Truck Specs
4 pages
Asset Management and Tracking
No ratings yet
Asset Management and Tracking
19 pages
By Mge Ups Systems: Merlin Gerin
No ratings yet
By Mge Ups Systems: Merlin Gerin
60 pages
Head Preparation
No ratings yet
Head Preparation
7 pages
IIT-JAM Physics Exam Scheme & Syllabus
No ratings yet
IIT-JAM Physics Exam Scheme & Syllabus
5 pages
NTSE Maths Test - 11 Probability
No ratings yet
NTSE Maths Test - 11 Probability
3 pages
ZN4PD1 63HP S+
No ratings yet
ZN4PD1 63HP S+
3 pages
Design and Analysis of Form Tools
No ratings yet
Design and Analysis of Form Tools
12 pages
Turbina TME 400
No ratings yet
Turbina TME 400
8 pages
Thin-Walled Pressure Vessel Study
No ratings yet
Thin-Walled Pressure Vessel Study
4 pages
Binary Search Tree Operations Guide
No ratings yet
Binary Search Tree Operations Guide
3 pages
IARI PH.D Agronomy 2013
No ratings yet
IARI PH.D Agronomy 2013
12 pages
Ets 05
No ratings yet
Ets 05
3 pages
Biomolecules NEET Aakash Pattern
No ratings yet
Biomolecules NEET Aakash Pattern
4 pages
Philo - of Language and Culture Notes
No ratings yet
Philo - of Language and Culture Notes
4 pages
W1 ANfB
No ratings yet
W1 ANfB
30 pages

@vtucode - in 21CS71 Module 5 PDF

Uploaded by

@vtucode - in 21CS71 Module 5 PDF

Uploaded by

The value of F calculated using the above-mentioned formula is to be compared to the

6.2.5.5 No Refosonshfp Case

6.2.6.1 Correlation Indicators o/Linenr itelntionships

(ii) Is this relationship strong?

Correlation is only appropriate for examining the relationship between meaningful

6.3 Regression Analysis

You might also like