Professional Documents
Culture Documents
Learning to Predict
Student
Performance
adnaik.rohit@gmail.com
Linear regression-
First, the probability of “Pass” being 1 with the
Regression method takes a finite set relations
specified conditions must be calculated. This is
between dependent variable and independent
denoted by
variables, and creates a continuous function
generalizing these relations (Watt et al., 2016).
P(Pass=1 |GPA=3, Age=12)
Table 2 shows another data set containing
Using the Bayes' formula, this is equal to
information about students.
age passed
P(Pass=1|GPA=3, Age=12)= P(GPA=3,
14 1 Age=12| Pass=1) P(Pass=1) P(GPA=3,
13 0 Age=12) .
15 1 Using the chain rule of conditional probability,
Student data the first part of numerator can be expanded to
produce this equation:
For the sake of simplicity, the data has only one
independent variable. Figure 1 depicts a two
P(Pass=1|GPA=3, Age=12)= P(GPA=3|
dimensional graph that shows the relation
Pass=1) P(Age=12| Pass=1) P(Pass=1)
between the student age and the dependent P(GPA=3, Age=12)
variable indicating whether they have passed the
exam or not.
Now, the numerator can be calculated:
Actual True 7 59
Table . Confusion matrix for linear regression
used on the second raw data set.