Professional Documents
Culture Documents
Prof. D. M. Padulkar
(Assistant Professor)
Baramati-413133
1
2020-2021
Vidya Pratishthan’s
Kamalnayan Bajaj Institute of Engineering and Technology, Baramati
Certificate
This is to certify that following students
Ms. Priti Sonawane 2041066
Ms. Anjali Shingavi 2041065
Principal
(Dr. R.S. Bichkar)
2
Internal Examiner External Examiner
Acknowledgments
We are extremely grateful to Dr. R. S. Bichkar, Principal, VPKBIET and Dr. S. K. Shinde HOD, Head of
Department, Department of Computer Engineering, for providing all the required resources for the
successful completion of our mini project.
We express with Our heartfelt gratitude to our project guide Prof. Digambar Padulkar, Assistant
Professor, Computer Engineering, for his valuable suggestions and guidance in the preparation of the
project report.
We will be failing in duty if we do not acknowledge with grateful thanks to the authors of the
references and other literatures referred to in this project.
Last but not the least; we are very much thankful to our parents who guided us in every step which we
took.
Ms.Priti Sonawane
Ms. Anjali Shingavi
VPKBIET, Baramati
Abstract
The Indian Premier League (IPL) is a professional Twenty20 cricket league in India
contested during March or April and May of every year by eight teams representing eight different
cities in India. The league was founded by the Board of Control for Cricket in India (BCCI) in
2008. The IPL has an exclusive window in ICC Future Tours Programme.
3
Contents
Acknowledgments 3
Abstract 4
1 Introduction 6
1.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2 Brief Description. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.3 Problem Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2 Literature Survey 8
3 Dataset Description 9
3.1 Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.1.1 Purpose . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.1.2 Project Scope. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
4 Data Pre-processing 11
5 Classification 13
5.1 Total number of matches till 2020 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.2 Total number of matches win by each team . . . . . . . . . . . . . . . . . . . . . . . . . .
.........
5.3 Toss or win ratio. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.4 success rate of winning matches…………………………………
7 Bibliography 22
4
Introduction
1.1 Overview
This section of this report provides overall description of the project, including Brief
description and Problem definition. Second section contains literature survey. Third section
contains dataset description including Purpose, Project Scope, Design and Implementation
Constraints. Section four contains Data prepossessing. Fifth section contains classification.
Sixth section contains confusion matrix including Analysis of Confusion Matrix and Comparison
of Classifiers. Section seven contains Result Analysis and last section contains Conclusion and
Future.
5
2
Literature Survey
A. Describes about significant challenges that we face for accurate prediction including the
various parameters which affect the outcome of the match. The ball movement gets changed
from every over, so it is considered being important to predicting the outcome of each match
on every ball. Here they had developed a model that predicts the match result of every ball
played.
B. Explains about the concept of identifying rising stars in cricket domain using some
techniques. Rising stars can be predicted by both bats as well as bowling teams. Distinct
features like concept of co-players, team and opposite teams are presented with their
mathematical formulation.
C. Explained the outcome of ODI match depends on various factors. The list of key features
is home-field advantages, winning the toss, game plan, venue and season. Logistic
Regression, SVM are the different types of algorithm used for model building. Logistic
Regression is applied for data that had been already obtained from previous matches.
SVM used for predictive analysis. It was found that SVM was proved to be a better model
based on both the parameters used to predict accuracy and model outcome .
6
Dataset Description
3.1 Introduction
3.1.1 Purpose
The purpose of this report is to provide a detailed overview of our project, its
parameters and goals. The main purpose of this project is basic exploratory data analysis and feature
analysis.
7
Data Pre-processing
Data Cleaning
8
9
10
11
12
13
Classification
Classification is the process of predicting the class of given data points. It belongs to the supervised
learning category where the targets provided with the input data. classes are sometimes called as
labels or categories.
14
2. Total number of matches win by Each Teams
15
3. Toss or Win ratio
16
4. Success rate of winning matches
17
18
5. Most titles win
19
6. Top 10 Players with Most Runs
20
8. Do you know who won the most player of the match?
21
9. Top 10 cities by number of matches
22
23
Conclusion and Future Work
This project has intended on analyzing the results of the IPL match during the year 2008-
2019 by applying the data mining algorithms for existing data, and predicted the new data
for the year 2020 and applied data mining algorithms for the proposed data. The
Implementation tools used are Rapid Miner Studio Version-9.3.This knowledge will be
used in future to predict the winning team. Hence using this prediction, the best team can
be formed.
24
7 Bibliography
[1] Abdullah, F.S., N.S.A. Manan, A. Ahmad, S.W. Wafa and M.R. Shahril et al., 2016.
techniques for classification of childhood obesity among year 6 school children. DOI:
10.1007/978-3-319-51281-5_47
[2] Adnan, M.H.B.M. and W. Husain, 2012. A hybrid approach using Naïve Bayes and
genetic algorithm for childhood obesity prediction. DOI:
10.1109/ICCISci.2012.6297254
[3] Zhang, S., C. Tjortjis, X. Zeng, H. Qiao and I. Buchan et al., 2009. Comparing data
mining methods with logistic regression in childhood obesity prediction.
Inform. Syst. Frontiers, 11: 449-460. DOI: 10.1007/s10796-009-9157-0
25
26
27