Professional Documents
Culture Documents
Journal QR Code:
1
Software Engineer, Audience, Pakistan
Affiliation: 2
Game Developer, Finz Games, Pakistan
Article QR:
A publication of the
Dr Hasan Murad School of Management
University of Management and Technology, Lahore, Pakistan
PREDICTIVE ANALYSIS OF PSL MATCH WINNERS
USING MACHINE LEARNING TECHNIQUES
Khushal Das 1*, Muhammad Awais2
ABSTRACT: Cricket is one of the winner with the help of the Random
most celebrated open-air sports Forest algorithm.
generating a huge amount of KEYWORDS: K-Nearest
measurable information. As Pakistan Neighbours (KNN), machine
Super League (PSL) games grow in learning techniques, Naïve Bayes,
popularity, the potential predictors Random Forest, PSL prediction
which influence the outcomes of the
matches need to be investigated. PSL I. INTRODUCTION
data spanning the years 2016-2019 Recent advancements in technology
and containing the specifics of the have allowed the simultaneous
players, match venue details, squads, collection and processing of large
and ball-by-ball details was collected. amounts of information, collectively
Subsequently, it was analyzed to known as big data. Currently, data
draw various conclusions that offer science is being used in almost every
assistance to enhance a player’s walk of life to achieve better and
performance. Due to the expanding consequential results. The goal is to
number of matches day by day, it is use statistical and machine learning
difficult to oversee or extricate algorithms on the available data for
valuable data from the accessible creating computer programs that are
information of all the matches. This able to retrieve and use data for
paper presents the preprocessing of understanding and self-learning. This
the data, as well as its visualization process begins by data observation,
and prediction. It centers on for instance, events, direct knowledge
measuring the results of PSL matches or training, for the reorganization of
by applying the existing information certain statistical trends and to make
mining algorithms. It includes factors informed decisions based on the
such as team 1, team 2, toss winner, samples collected in the future. The
and toss decision to predict the match
1
Department of Information Systems, Dr Hassan Murad School of Management,
University of Management and Technology, Lahore, Pakistan
*Corresponding Author: f2020313013@umt.edu.pk
2
Department of Information Systems, Dr Hassan Murad School of Management,
University of Management and Technology, Lahore, Pakistan
*Corresponding Author: f2020313003@umt.edu.pk
Dr Hasan Murad School of Management
75
Volume 1 Issue 1, Spring 2021
PREDICTIVE ANALYSIS OF PSL MATCH WINNERS USING …
dataset used in our project contained focused. WASP (Winning and Score
information about the matches of PSL Predictor) [15] is a realistic method
held from the year 2016 through used in cricket. It forecasts the score
2019. Using this data set, we achieved and the possible outcome of a limited
the above mentioned primary over match of cricket, including a T20
objective of our project. or a one-day match. Sky Sports New
Zealand, for the first time, developed
II. LITERATURE REVIEW
such a tool in 2012.
One of the prime sports leagues, Hawk-Eye, which is a software, can
that is, Major Baseball League, has track the ball’s trajectory and visibly
progressed tremendously in the area shows the best statistically important
of sports technology in recent years track. It has also been formally used
by incorporating ball-by-ball in the Umpire Decision Review
statistics and basic awareness into the Method since the year 2009.
game. These are usually not Likewise, this computer-aided
observable otherwise and this is intelligent technology is also used in
where machines come into play. In other sports such as badminton, tennis
order to obtain and use the relevant and snooker. In this paper, we explore
data, experienced teams of technical briefly the relevance of machine
experts implement different machine learning for sports and the methods
learning skills to better train their suggested for its enhancement.
players with improved results.
Some of the categorizing or sorting III. MACHINE LEARNING
problems resolved with the aid of Modern autoencoders are
machine learning in the game of networks that attempt to encode and
baseball [9] include forecasting the With the application of machine
games’ outcomes, categorizing if a learning and Artif icial Intelligence
team will purposely permit a player to (AI), we can address the real-time
walk in to bat, and classifying non- problems of the society. In sports,
fastball pitches by field type. they have a thoughtful influence.
Moreover, sports analytics are also During the game, teams utilize data to
utilized in cricket to predict the result enhance on-the-field performance of
of a match while it is still in progress their players. Such predictive analysis
or even when the match is yet to begin helps them to make directed decisions
[10-14]. Concerns such as predicting and structural adjustments which
the number of wickets taken or the affect various aspects of their sports
runs scored during the match are also association, from recruiting players to
interesting problems which must be
There is a different file for each null values that affect the result.
match containing information about Data preprocessing yields a format
PSL season year, details of match that is easier to handle while using
winner, player of the match, match different algorithms.
number, location, name of the
1. Data Cleaning
stadium, umpires’ particulars,
participating teams, and margin of Data may contain noise and missing
winning. Since PSL is only five values for different columns due to
years old, the files of only 146 error(s) in recording or parsing. This
matches played during the five results in improper classification.
seasons are available. The data Hence, we replaced these values
contains null values. For prediction with dummy values such as mean
analysis, certain attributes may not and mode, depending on column
be needed. types. Further, we removed columns
that were completely null.
B. Data Pre-Processing
2. Choosing Required Attributes
Since the available data was in
YAML format and contained a In this step, we dropped certain
different file for each match, it columns of the data that did not help
needed strong preprocessing. First, in prediction or only resulted in
we read it into python and decreased accuracy. Feature
normalized it using python library importance was used to point out
known as json [999]. Then, we such columns.
converted it to dataframe [999] 3. Model Evaluation and
using another library called pandas Development
[999]. Afterwards, we concatenated We applied different classification
all of these files to construct a single algorithms including KNN,
dataframe [999] containing the data Random Forest and Naïve Bayes
of all matches. Subsequently, we did classifier. The basic procedure
exploratory data analysis to identify followed in applying all of these
any existing anomalies. Usually, algorithms is given below.
datasets contain flaws which need to 1. for for each sample iteration do
be addressed before applying the 2. Split data into train and test set.
algorithms. There could be several
Dr Hasan Murad School of Management
75
81
Volume 1 Issue 1, Spring 2021
PREDICTIVE ANALYSIS OF PSL MATCH WINNERS USING …