Professional Documents
Culture Documents
ANALYTICS SUBMITTED BY
ALEN GEORGE
USING S3 MCA
MACHINE ROLL NO: 2
LEARNING
CONTENTS
⮚ INTRODUCTION
⮚ LITERATURE REVIEW
⮚ ALGORITHMS
⮚ DATASET
⮚ PROJECT PIPELINE
⮚ FEASIBILITY ANALYSIS
⮚ SYSTEM ENVIRONMENT
⮚ MODEL PLANNING
⮚ MODEL TRAINING
⮚ TESTING
⮚ RESULT AND DISCUSSION
⮚ MODEL DEPLOYMENT
⮚ GIT HISTORY
⮚ CONCLUSION
⮚ REFERENCES
INTRODUCTION
• In paper [1] the paper suggests that Student performance prediction is very important to understand a
student's progress rate. To predict the student’s performance, they begin by collecting data sets in order to
anticipate the students' performance. As a result, they attempted to gather students' class test, attendance,
presentation, assignment, midterm, and final examination marks. For the greatest accuracy rate, they
propose using K-Nearest Neighbors and Decision Tree Classifier. This proposed model outperforms
Student Performance across three semesters. The training and testing sets provide optimum results and
event accuracy. Finally, they are able to get the best results and accuracy with the K-Nearest Neighbors,
Decision Tree Classifier model with 89.74 percent & 94.44 percent accuracy.
2. Hina Gull;Madeeha Saqib;Sardar Zafar Iqbal;Saqib Saeed , Improving Learning Experience of Students
by Early Prediction of Student Performance using Machine Learning 2020 IEEE International Conference for
Innovation in Technology (INOCON)
• In paper[2] They developed a model to predict the grades of students taking the same course in the next
term using logistic regression, linear discriminant analysis, K-nearest neighbours, classification and regression
trees, gaussian Naive Bayes, and support vector machines on historical data of student grades in one of the
undergraduate courses Their investigations reveal that linear discrimination analysis is the most effective
method for accurately predicting students' final exam results. Out of a total of 54 records, the model correctly
predicted 49, resulting in an accuracy of 90.74 percent. They evaluated the accuracy of students' success using
historical course data.
• .
3. Chew Li Sa;Dayang Hanani bt. Abang Ibrahim;Emmy Dahliana Hossain;Mohammad bin Hossin ,
Student performance analysis system (SPAS), The 5th International Conference on Information and
Com munication Technology for The Muslim World (ICT4M)
• This paper[3] attempted to provide a prediction of students' examination performance. They used
WEKA to examine the feasibility of linear regression and multilayer perceptron in terms of accuracy,
performance, and error rate. The classification techniques, linear regression, multilayer perceptron and
support vector machine are used to forecast final examination results Then, using the value of mean
absolute error difference, they are compared . According to the findings,support vector machine has the
highest accuracy with 94.88%
• From the above three papers, we get to know that different approaches are used for students performance
analysis. First paper use KNN and Decision Tree Classifier. This proposed model outperforms Student
Performance across three semesters. The training and testing sets provide optimum results and event
accuracy. In the second paper they developed a model to predict the grades of students taking the same
course in the next term using logistic regression, linear discriminant analysis, K-nearest neighbours,
classification and regression trees, gaussian Naive Bayes, and support vector machines on historical data of
student grades in one of the undergraduate courses. In the third paper, They used WEKA to examine the
feasibility of linear regression and multilayer perceptron in terms of accuracy, performance, and error rate.
According to the findings,support vector machine has the highest accuracy with 94.88%.
ALGORITHMS
• THE MODEL IS TRAINED USING SVM. ONCE THE TRAINING IS COMPLETED, MAKE
PREDICTIONS USING SVM AND THE ACCURACY SCORE AND CLASSIFICATION
REPORT GENERATED BY THE ALGORITHM IS DISPLAYED IN TEXTUAL FORMAT.
• SVM IS A GOOD ALGORITHM TO EXTRACT THE BINARY CLASS BASED ON THE
DATA GIVEN TO THE MODEL. IN THE PROPOSED MODEL, THE WORK IS TO
CLASSIFY THE ARTICLE IN TWO CATEGORIES PAGE | 13 EITHER TRUE OR FALSE.
A SUPPORT VECTOR MACHINE (SVM) IS A SUPERVISED MACHINE LEARNING
ALGORITHM THAT CAN BE USED FOR BOTH REGRESSION AND
CLASSIFICATION PURPOSES. IT IS BASED ON THE IDEA OF FINDING THE
HYPERPLANE THAT BEST DIVIDES THE DATASET INTO TWO CLASSES. HYPER-
PLANES ARE DECISION BOUNDARIES THAT HELP THE MACHINE LEARNING
MODEL CLASSIFY THE DATA OR DATA POINTS.
DATASET
• Technical Feasibility: The application is technically feasible because all the technical resources
required for the development and working of the application is easily available and reliable.
The codes are written in Google Colab, therefore all the libraries will be available, no need to
install or import each of those
• Economic Feasibility: The code is working on Google Colab .So the colab consumes an
amount of internet. The development of the system will not need a huge amount of money. It
will be economically feasible.
• Operational Feasibility:Since the code is written on Google Colab, no need for worrying about
importing or installing the libraries required. There is no need of skill for a new user to open
this application and use it
System Environment
• Software Environment: Various software used for the development of this application are the
following :
• P a n d a s , P y t h o n , M a t p l o t l i b , N u m p y, Vi s u a l S t u d i o , H T M L & C S S , F l a s k ,
GitHub
• Hardware Environment:
• The Model used is SVM and trained using the training datas
Testing
Results and Discussion
• The calculated accuracy for SVM is 97.5. The saved model model.pkl was loaded by
importing pickle package.
Model Deployment
GIT HISTORY
CONCLUSION