Professional Documents
Culture Documents
Abstract:
2. EXISTING SYSTEM
In fact, recent studies show that only 50 of the more than 580 public four-year
institutions in the United States have on-time graduation rates at or above 50
percent for their full-time students . To make college more affordable, it is thus
crucial to ensure that many more students graduate on time through early
interventions on students whose performance will be unlikely to meet the
graduation criteria of the degree program on time. A critical step towards
effective intervention is to build a system that can continuously keep track of
students’ academic performance and accurately predict their future
performance, such as when they are likely to graduate and their estimated final
GPAs, given the current progress. Although predicting student performance has
been extensively studied in the literature, it was primarily studied in the
contexts of solving problems in Intelligent Tutoring Systems (ITSs).
SOFTWARE REQUIREMENTS
HARDWARE REQUIREMENTS
[5] H.-F. Yu, H.-Y. Lo, H.-P. Hsieh, J.-K. Lou, T. G. McKenzie, J.-W. Chou, P.-
H. Chung, C.-H. Ho, C.-F. Chang, Y.-H. Wei et al., “Feature engineering and
classifier ensemble for kdd cup 2010,” in Proceedings of the KDD Cup 2010
Workshop, 2010, pp. 1–16.
[6] Z. A. Pardos and N. T. Heffernan, “Using hmms and bagged decision trees to
leverage rich features of user and skill from an intelligent tutoring system dataset,”
Journal of Machine Learning Research W & CP, 2010.
[7] Y. Meier, J. Xu, O. Atan, and M. van der Schaar, “Personalized grade
prediction: A data mining approach,” in Data Mining (ICDM), 2015 IEEE
International Conference on. IEEE, 2015, pp. 907–912.
[8] C. G. Brinton and M. Chiang, “Mooc performance prediction via clickstream
data and social learning networks,” in 2015 IEEE Conference on Computer
Communications (INFOCOM). IEEE, 2015, pp. 2299– 2307.
[12] Y.-h. Wang and H.-C. Liao, “Data mining for adaptive learning in a tesl-
based e-learning system,” Expert Systems with Applications, vol. 38, no. 6, pp.
6480–6485, 2011.
Unauthorized user
Yes Check NO
Preprocess Dataset
Feature Extraction
Model Generation
Matrix Factorization
Predict Performance
End process
GOALS:
The Primary goals in the design of the UML are as follows:
1. Provide users a ready-to-use, expressive visual modeling Language so that
they can develop and exchange meaningful models.
2. Provide extendibility and specialization mechanisms to extend the core
concepts.
3. Be independent of particular programming languages and development
process.
4. Provide a formal basis for understanding the modeling language.
5. Encourage the growth of OO tools market.
6. Support higher level development concepts such as collaborations,
frameworks, patterns and components.
7. Integrate best practices.
The class diagram is used to refine the use case diagram and define a detailed design of
the system. The class diagram classifies the actors defined in the use case diagram into a set of
interrelated classes. The relationship or association between the classes can be either an "is-a" or
"has-a" relationship. Each class in the class diagram may be capable of providing certain
functionalities. These functionalities provided by the class are termed "methods" of the class.
Apart from this, each class may have certain "attributes" that uniquely identify the class.
Object diagram:
The object diagram is a special kind of class diagram. An object is an instance of a class.
This essentially means that an object represents the state of a class at a given point of time while
the system is running. The object diagram captures the state of different classes in the system and
their relationships or associations at a given point of time.
State diagram:
A state diagram, as the name suggests, represents the different states that objects in the
system undergo during their life cycle. Objects in the system change states in response to events.
In addition to this, a state diagram also captures the transition of the object's state from an initial
state to a final state in response to events affecting the system.
Activity diagram:
The process flows in the system are captured in the activity diagram. Similar to a state
diagram, an activity diagram also consists of activities, actions, transitions, initial and final
states, and guard conditions.
Sequence diagram:
A sequence diagram represents the interaction between different objects in the system. The
important aspect of a sequence diagram is that it is time-ordered. This means that the exact
sequence of the interactions between the objects is represented step by step. Different objects in
the sequence diagram interact with each other by passing "messages".
Collaboration diagram:
A collaboration diagram groups together the interactions between different objects. The
interactions are listed as numbered interactions that help to trace the sequence of the interactions.
The collaboration diagram helps to identify all the possible interactions that each object has with
other objects.
4.3 IMPLEMENTATION:
MODULES:
Upload UCLA Students Dataset
Preprocess Dataset
Feature Extraction
Model Generation
Matrix Factorization
Run SVM Algorithm
Run Random Forest Algorithm
Run Logistic Regression Algorithm
Propose Ensemble based Progressive Prediction (EPP)
Algorithm
Predict Performance
Mean Square Error Graph
To implements this paper author has face 3 problem and gave solution to all 3
problems
1) All students will be differ in selected courses as different students may take
different courses then how we can build machine learning model to predict
performance of particular student in particular course. To solve this issue
author is using Clustering concept using Matrix Factorization. In matrix
factorization we will form cluster of related courses and then in matrix we
will put value 1 (or marks obtained by students in term) if user selected
course otherwise we will put value 0. So by forming this cluster we will
have matrix feature vector which contains similar courses student’s data.
This matrix features will be passed to machine learning to train model and
to predict future performances. A single course consists of many subjects
and all those subjects marks will be used to calculate GPA in all terms. Past
course GPA and marks will be input to machine learning algorithms to
predict future course performance.
2) Courses are not equally informative formaking accurate predictions and by
forming feature vector we will get prediction for selected course only.
3) All existing algorithms were concentrating on past data to predict future
performance but in this paper we will used past data as well as on going
course performance data to predict future course GPA. Students’ evolving
progressneeds to be incorporated into the prediction.
To implement this paper author using base classifiers such as Random Forest,
SVM, Logistic Regression or KNN. The prediction results of base classifier will be
pass to ensemble classifier to predict better results for ongoing courses and
future courses.
To implement this project author using student performance dataset from UCLA
University and this dataset saved inside dataset folder. Below are some example
of dataset records.
2.LITERATURE SURVEY
2.2 Addressing the assessment challenge with an online system that tutors as it
assesses
ABSTRACT: Secondary teachers across the United States are being asked to
use formative assessment data (Black & Wiliam, 1998a, 1998b; Roediger &
Karpicke, 2006) to inform their classroom instruction. At the same time, critics
of US government's No Child Left Behind legislation are calling the bill "No
Child Left Untested". Among other things, critics point out that every hour
spent assessing students is an hour lost from instruction. But, does it have to be?
What if we better integrated assessment into classroom instruction and allowed
students to learn during the test? We developed an approach that provides
immediate tutoring on practice assessment items that students cannot solve on
their own. Our hypothesis is that we can achieve more accurate assessment by
not only using data on whether students get test items right or wrong, but by
also using data on the effort required for students to solve a test item with
instructional assistance. We have integrated assistance and assessment in the
ASSISTment system. The system helps teachers make better use of their time
by offering instruction to students while providing a more detailed evaluation of
student abilities to the teachers, which is impossible under current approaches.
Our approach for assessing student math proficiency is to use data that our
system collects through its interactions with students to estimate their
performance on an end-of-year high stakes state test. Our results show that we
can do a reliably better job predicting student end-of-year exam scores by
leveraging the interaction data, and the model based on only the interaction
information makes better predictions than the traditional assessment model that
uses only information about correctness on the test items.
2.4 Mooc performance prediction via clickstream data and social learning
networks
To run this project double click on ‘run.bat’ file to get below screen
In above screen click on ‘Upload UCLA Students Dataset’ button to upload dataset
In above screen we can see all records converted to feature vector and in above
screen in first 3 lines we can see from above matrix application using 61 records
to train machine learning model and 16 records to test accuracy or to calculate
Mean Square Error of classifier. If algorithm prediction result is high then accuracy
will be more and Mean Square Error (MSE) will be less. Now we got matrix and
data to train and test classifier. Now click on ‘Run SVM Algorithm’ to train SVM
classifier and to get it accuracy and MSE value
In above screen SVM MSE is 56%. Now click on ‘Run Random Forest Algorithm’ to
generate training model using Random Forest and to get it accuracy and MSE
In above screen random forest got 43% MSE and now click on ‘Run Logistic
Regression Algorithm’ button to get it accuracy and MSE
In above screen logistic regression got 43% MSE and now click on ‘Propose
Ensemble-based Progressive Prediction (EPP) Algorithm’ button to generate
model using propose EPP algorithm and to get it accuracy and MSE
In above screen EPP propose algorithm got 37% MSE and now click on ‘Predict
Performance’ button to upload student on going test marks and to predict GPA
for future course
In above screen uploading new student records as test file and below are the
prediction results
In above screen in square brackets are the student marks of ongoing subjects and
this marks are converted to matrix factorization and then applied on EPP train
model to predict GPA as LOW or HIGH. In above screen after each test record I am
displaying predicted result value. Now click on “Mean Square Error Graph’ button
to get below graph
In above graph x-axis represents algorithm name and y-axis represents MSE
(mean square error). From above graph we can see propose algorithm got less
MSE error and has high accuracy compare to other algorithms. From above graph
we can conclude that propose EPP is better in prediction compare to other
algorithms