You are on page 1of 27

Indian Premier League Analysis (2020)

Mini Project Report


Submitted by
Ms. Priti Sonawane 2041066
Ms. Anjali Shingavi 2041065

In partial fulfilment for the requirement of Laboratory Practice-II

Bachelor of Computer Engineering

Under the guidance of

Prof. D. M. Padulkar
(Assistant Professor)

Department of Computer Engineering

Vidya Pratishthan’s Kamalnayan Bajaj Institute of

Engineering and Technology

Bhigwan Road, Vidyanagari

Baramati-413133

1
2020-2021

Vidya Pratishthan’s
Kamalnayan Bajaj Institute of Engineering and Technology, Baramati

Department of Computer Engineering

Certificate
This is to certify that following students
Ms. Priti Sonawane 2041066
Ms. Anjali Shingavi 2041065

have successfully completed their project work on

INDIAN PREMIER LEAGUE ANALYSIS (2020)


2020)

During the academic year 2020-2021


2021 in the partial fulfilment towards the completion of Laboratory
Practice-IIII in Computer Engineering.

Project Guide HOD


D Deptt. of Comp. Engg.

(Prof. D. M. Padulkar) (Dr. Santaji Shinde)

Principal
(Dr. R.S. Bichkar)

2
Internal Examiner External Examiner

Acknowledgments

We are extremely grateful to Dr. R. S. Bichkar, Principal, VPKBIET and Dr. S. K. Shinde HOD, Head of
Department, Department of Computer Engineering, for providing all the required resources for the
successful completion of our mini project.

We express with Our heartfelt gratitude to our project guide Prof. Digambar Padulkar, Assistant
Professor, Computer Engineering, for his valuable suggestions and guidance in the preparation of the
project report.

We will be failing in duty if we do not acknowledge with grateful thanks to the authors of the
references and other literatures referred to in this project.
Last but not the least; we are very much thankful to our parents who guided us in every step which we
took.

Ms.Priti Sonawane
Ms. Anjali Shingavi

VPKBIET, Baramati

Abstract
The Indian Premier League (IPL) is a professional Twenty20 cricket league in India
contested during March or April and May of every year by eight teams representing eight different
cities in India. The league was founded by the Board of Control for Cricket in India (BCCI) in
2008. The IPL has an exclusive window in ICC Future Tours Programme.

3
Contents
Acknowledgments 3

Abstract 4

1 Introduction 6
1.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2 Brief Description. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.3 Problem Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2 Literature Survey 8

3 Dataset Description 9
3.1 Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.1.1 Purpose . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.1.2 Project Scope. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

3.1.3 Design and Implementation Constraints . . . . . . . . . . . 10

4 Data Pre-processing 11

5 Classification 13
5.1 Total number of matches till 2020 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.2 Total number of matches win by each team . . . . . . . . . . . . . . . . . . . . . . . . . .
.........
5.3 Toss or win ratio. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.4 success rate of winning matches…………………………………

6 Conclusion and Future Work 21

7 Bibliography 22

4
Introduction

1.1 Overview
This section of this report provides overall description of the project, including Brief
description and Problem definition. Second section contains literature survey. Third section
contains dataset description including Purpose, Project Scope, Design and Implementation
Constraints. Section four contains Data prepossessing. Fifth section contains classification.
Sixth section contains confusion matrix including Analysis of Confusion Matrix and Comparison
of Classifiers. Section seven contains Result Analysis and last section contains Conclusion and
Future.

1.2 Brief Description


The Indian Premier League (IPL) is a professional Twenty20 cricket league, contested by
eight teams based out of eight different Indian cities. The league was founded by the Board of
Control for Cricket in India (BCCI) in 2007. It is usually held between March and May of every year
and has an exclusive window in the ICC Future Tours Programme.
The IPL is the most-attended cricket league in the world and in 2014 was ranked sixth by average
attendance among all sports leagues. In 2010, the IPL became the first sporting event in the world
to be broadcast live on YouTube. The brand value of the IPL in 2019 was ₹475
billion (US$6.7 billion), according to Duff & Phelps. According to BCCI, the 2015 IPL season
contributed ₹11.5 billion (US$160 million) to the GDP of the Indian economy.

1.3 Problem Definition


Considering IPL T20 related dataset to analysis whether the winning team has got success or
not.

5
2

Literature Survey
A. Describes about significant challenges that we face for accurate prediction including the
various parameters which affect the outcome of the match. The ball movement gets changed
from every over, so it is considered being important to predicting the outcome of each match
on every ball. Here they had developed a model that predicts the match result of every ball
played.
B. Explains about the concept of identifying rising stars in cricket domain using some
techniques. Rising stars can be predicted by both bats as well as bowling teams. Distinct
features like concept of co-players, team and opposite teams are presented with their
mathematical formulation.
C. Explained the outcome of ODI match depends on various factors. The list of key features
is home-field advantages, winning the toss, game plan, venue and season. Logistic
Regression, SVM are the different types of algorithm used for model building. Logistic
Regression is applied for data that had been already obtained from previous matches.
SVM used for predictive analysis. It was found that SVM was proved to be a better model
based on both the parameters used to predict accuracy and model outcome .

6
Dataset Description
3.1 Introduction
3.1.1 Purpose
The purpose of this report is to provide a detailed overview of our project, its
parameters and goals. The main purpose of this project is basic exploratory data analysis and feature
analysis.

3.1.2 Project Scope


Based on users' physical condition and other parameters classification and prediction of data
is done which will give the final result as winning team is successful.

7
Data Pre-processing

Data Cleaning

8
9
10
11
12
13
Classification

Classification is the process of predicting the class of given data points. It belongs to the supervised
learning category where the targets provided with the input data. classes are sometimes called as
labels or categories.

1. Total number of matches till 2020

14
2. Total number of matches win by Each Teams

15
3. Toss or Win ratio

16
4. Success rate of winning matches

17
18
5. Most titles win

19
6. Top 10 Players with Most Runs

7.. Does toss winning affects the match winner?

20
8. Do you know who won the most player of the match?

21
9. Top 10 cities by number of matches

22
23
Conclusion and Future Work
This project has intended on analyzing the results of the IPL match during the year 2008-
2019 by applying the data mining algorithms for existing data, and predicted the new data
for the year 2020 and applied data mining algorithms for the proposed data. The
Implementation tools used are Rapid Miner Studio Version-9.3.This knowledge will be
used in future to predict the winning team. Hence using this prediction, the best team can
be formed.

24
7 Bibliography

[1] Abdullah, F.S., N.S.A. Manan, A. Ahmad, S.W. Wafa and M.R. Shahril et al., 2016.
techniques for classification of childhood obesity among year 6 school children. DOI:
10.1007/978-3-319-51281-5_47

[2] Adnan, M.H.B.M. and W. Husain, 2012. A hybrid approach using Naïve Bayes and
genetic algorithm for childhood obesity prediction. DOI:
10.1109/ICCISci.2012.6297254

[3] Zhang, S., C. Tjortjis, X. Zeng, H. Qiao and I. Buchan et al., 2009. Comparing data
mining methods with logistic regression in childhood obesity prediction.
Inform. Syst. Frontiers, 11: 449-460. DOI: 10.1007/s10796-009-9157-0

25
26
27

You might also like