You are on page 1of 18

VELAGAPUDI RAMAKRISHNA SIDDHARTHA

ENGINEERING COLLEGE
DEPARTMENT OF INFORMATION TECHNOLOGY

University Admission Prediction

Presented by:
Ch. Sai Sudha(188W1A1270)
Abstract:

In the present time there are plenty of scholars seeking after their instruction away from
their nations of origin.Also student admission problem is very important in educational
institutions.This paper addresses machine learning models to predict the chance of a
student to be admitted to a master’s program. This will assist students to know in advance
if they have a chance to get accepted.The machine learning models that are applied in this
project multiple linear regression, random forest. Accuracy of this models are compared
and considered Multiple Linear regression as it gave high accuracy.
Problem Statement

Graduate Admission Prediction helps the students to know about what is the chance of
getting admission in reputed Foreign Universities. In order to do this, the proposed work
uses multiple machine learning algorithms like Multilinear Regression, Random Forest
and develop a model which predicts the chance of getting admission in reputed university.
Literature Survey

Publication details Dataset/sample details Remarks

S.Sujay(IJTRD) Dataset is collected from kaggle Linear regression is the only


August 2020 algorithm used which
provides an accuracy of 82%
only

M.S.Acharya A. Armaan, and Admission dataset Gradient booster regression


A. S. Antony(ICCIDS) and linear regression are
December 2020 used.R^2 score is used as
metric

J.Jamison(SIGCSE) Davidson College dataset An accuracy of 82% is


March 2017 achieved in this paper.
Dataset:
The dataset we have used for this project is a csv file in which the data is relevant, high
quality, and is in adequate volume.

Attributes in dataset:

1. GRE Scores ( out of 340 )


2. TOEFL Scores ( out of 120 )
3. University Rating ( out of 5 )
4. Statement of Purpose and Letter of Recommendation Strength ( out of 5 )
5. Undergraduate GPA ( out of 10 )
6. Research Experience ( either 0 or 1 )
7. Chance of Admit ( ranging from 0 to 1 )
Software Libraries
Numpy:

Numpy is a Python package that stands for Numerical Python.It is a Python library
used for working with arrays.It also has functions for working in domain of linear
algebra, fourier transform, and matrices.

Pandas:

Pandas is a Python library used for working with data sets.It has functions for
analyzing, cleaning, exploring, and manipulating data.

MatplotLib:

Matplotlib is a low level graph plotting library in python that serves as a visualization
utility.
Sklearn:

Scikit-learn is probably the most useful library for machine learning in Python. The
sklearn library contains a lot of efficient tools for machine learning and statistical
modeling including classification, regression, clustering and dimensionality

Seaborn:

It is a data visualization library built on top of matplotlib and closely integrated with
pandas data structures in Python.
Architecture Diagram
Algorithms

1. Multi Linear Regression:


Step-1: Data Pre-Processing

1. Importing The Libraries.

2. Importing the Data Set.

3. Encoding the Categorical Data.

4. Avoiding the Dummy Variable Trap.

5. Splitting the Data set into Training Set and Test Set.

Step-2: Fitting Multiple Linear Regression to the Training set

Step-3: Predicting the Test set results.


s 2. Random Forest Regression:
Step-1: Select random K data points from the training set.

Step-2: Build the decision trees associated with the selected data points (Subsets).

Step-3: Choose the number N for decision trees that you want to build.

Step-4: Repeat Step 1 & 2.

Step-5: For new data points, find the predictions of each decision tree, and assign the new data points to
the category that wins the majority votes.
Implementation

1.Collecting dataset

2.Data Pre-processing

3.Dividing data into training data and testing data

4.Building the model with training data

5.Predicting the test results

.
Data Visualizations
● The heat map tells us about the

correlation between columns


● Two attributes are highly

related if the color is dark


University rating vs chance of admit GRE Score vs chance of admit
TOEFL Score vs chance of admit CGPA Score vs chance of admit

s
Results

The following figure describes the comparison of accuracies obtained from different
classification algorithms.After comparing the accuracies of the algorithms, we came to
conclude that multiple linear regression gives the better results when compared to other
algorithms with accuracy of 81%.
Conclusion

Hence, from the comparison between four ML models, it is clear that the accuracy of
MLR(Multiple linear regression) has the highest accuracy.

The goal of research will be accomplished successfully, as the system allows students to
save a lot of time and money that they would spend on educational mentors and
application fees for colleges where they have less chances of getting admissions. Finally,
students can have an open-source machine Learning model which will help the students to
know their chance of admission into a particular university with high accuracy.
References
[1] S. Sujay, “Supervised Machine Learning Modelling & Analysis for Graduate Admission Prediction,” vol.
7, no. 4, pp. 5–7, 2020

[2] M. S. Acharya, A. Armaan, and A. S. Antony, “A comparison of regression models for prediction of
graduate admissions,” ICCIDS 2019 - 2nd Int. Conf. Comput. Intell. Data Sci. Proc., pp. 1–5, 2019

[3] Data Visualizaton, Machine Learning


https://www.analyticsvidhya.com/blog/2017/09/common-machin e-learning-algorithms/
[4] Jamison, J. (2017). Applying Machine Learning to Predict Davidson College’s Admissions Yield, pp.
765-766

[5] Mishra, S. and Sahoo, S. (2016). A Quality-Based Automated Admission System for Educational
Domain, pp. 221 - 223.

You might also like