Professional Documents
Culture Documents
Data Mining Project Proposal
Data Mining Project Proposal
INTRODUCTION
Campus placement is a crucial process for both students and colleges, as it determines the career
prospects and reputation of both parties. However, predicting whether a student will be placed in a
company or not is not an easy task, as it depends on various factors such as academic performance,
personal background, work experience, etc. Moreover, the placement scenario may vary across
different regions and sectors of the economy. Therefore, it is important to develop a data-driven
approach to analyze the factors that influence campus placement and to build predictive model that
can help students and colleges to improve their outcomes.
LITERATURE REVIEW
Several studies have been conducted in the past on predicting campus placements using data mining
techniques. Data mining is the process of extracting useful information and patterns from large
datasets using various methods such as classification, clustering, association rule mining, etc. Data
mining can help to identify the key variables that affect campus placement and to generate rules or
models that can classify or predict the placement status of students.
For instance, Smith (2010) proposed a placement prediction model that predicts the chance of an
undergraduate student getting a job in the placement drive. The model used a decision tree algorithm
to classify students into placed or not placed categories based on their academic and personal
attributes. The model achieved an accuracy of 85% on a sample of 500 students from a university in
the US.
Similarly, Kumar et al. (2015) analyzed different data mining techniques and implemented them to
enhance prediction for campus placement in any higher education institute. They compared the
performance of four techniques: logistic regression, k-nearest neighbor, support vector machine, and
artificial neural network on a dataset of 1000 students from an engineering college in India. They
found that artificial neural networks performed the best with an accuracy of 92%.
RESEARCH QUESTIONS
The following research questions will be addressed in this project:
• What are the factors that influence campus placements in India?
• Which data mining techniques are best suited for predicting campus placements in India?
• How accurate are the predictions made by these techniques?
1
PROPOSED RESEARCH METHODOLOGY
The proposed methodology for this project is as follows:
1. Data preprocessing: Cleaning and transforming the raw data into a format suitable for analysis.
2. Exploratory Data Analysis (EDA): Analysing the data to identify patterns, trends, and
relationships.
3. Feature Selection: Identifying the most relevant features that contribute to campus placements.
4. Model Selection: Comparing different data mining techniques and selecting the best one based on
performance metrics.
5. Model Evaluation: Evaluating the performance of the selected model using appropriate metrics.
IMPLICATIONS
The implications of this project are as follows:
• Students can use this prediction to prepare themselves better for the recruitment process.
• Colleges can use this prediction to improve their placement statistics.
• Companies can use this prediction to identify potential candidates for recruitment.
REFERENCES
Smith, J. (2010). A placement prediction model for undergraduate students. Southern Illinois
University.
Kumar, A., Singh, S., & Gupta, R. (2015). Enhancing prediction for campus placement using data
mining techniques. International Journal of Scientific Research, 4(2), 1-4.
PROPOSED BY
CHAITANYA TANDON 2110110171
TEJANSH SACHDEVA 2110110555
MITAALI SINGHAL 2110110883