Professional Documents
Culture Documents
net/publication/346564787
CITATIONS READS
0 2,200
2 authors:
Some of the authors of this publication are also working on these related projects:
Multi-Class Sentiment Classification using Machine Learning and Deep Learning Techniques View project
All content following this page was uploaded by Saurav Singla on 02 December 2020.
decade, usage of Analytics and optimisation techniques set of high-performance teams that can provide a basis for
have picked up pace only in the last few years. team selection during IPL Player Auctions.
In the past, many ranking methods have been used for III. METHODOLOGY
optimal team selection. The ranking methods used specific
performance indices such as batting averages, strike rates, As discussed briefly in Section I, a user gets to build his
number of fours and sixes for evaluating a batsman. Some own team for an upcoming match. From a pool of 30
of these works are of Lemmer [1], Boorah and Mangan [2]. players (inclusive of both teams), a team of 11 players
Kimber and Hansford [3], Damodaran [4] proposed a need to be selected subject to budget and player
method to calculate batting averages when a player constraints. Every player has an associated cost. A good
remains not out. Ovens and Bukiet [5], Swartz et.al.[6] player has a high cost and vice versa. Selecting all the
have explored mathematical models to optimize the batting good players will exhaust the budget and the team may not
order. be complete. In order to have a balanced team, the
Dream11 also imposes restrictions on number of batsmen,
When it comes to quantifying bowling performances, bowlers, all-rounders and wicket-keepers.
Lemmer [7] has proposed a method to calculate bowling
averages and bowling strike rates. Lewis [8] have proposed The objective of any user is to pick a team that maximizes
metrics to evaluate performances of players using his expected team score. Since the objective of this project
Duckworth/Lewis percentage values. In the past, many is to formulate an optimization problem and solve it, we
Fantasy league players would simply choose a team, by remove the element of prediction from our model and base
using these ranking metrics. However, with the our player selection on the mean score and standard
introduction of budget and other constraints, a simple deviation of his previous matches. The problem resembles
ranking method would not have sufficed. Over the years, a classic Knapsack problem where the decision variable is
many researches have used different Machine Learning
and optimisation techniques to help fantasy league users in binary with
selecting an appropriate team.
The decision variable not just indicates the selection of a
In this section, we explore some of the works done by particular player but also indicates the role in which he is
researchers using Machine Learning approaches. Farhana selected.
et.al. [9] have proposed a Support Vector Machine (SVM)
based solution that generates a ranked list of players based III.I Decision Variables
on their abilities, from which a user can select a playing Any player j can only be selected as a normal player, a
11. C Deep Prakash [10] has proposed a Random Forests vice-captain, a captain or be not-selected at all. We denote
based Recursive Feature estimation algorithm to rank the the role of the player by adding another subscript k.
players. The features that were passed into the Random
Forest algorithm were a combination of customised if player j is selected as a normal player
metrices that accounted for both IPL and T20 International
matches. These metrices so constructed, removed bias if player j is selected as vice-captain
against the players who had been playing for a longer time. if player j is selected as captain
Once the performance evaluation metrices were selected
on the basis of feature importance, an MGA (Memetic for all k∈{1,2,3} if player j is not selected
Genetic Algorithm) was used to optimise the team
selection for IPL Teams. III.II Objective Function
We take the scores of each player Sj as information given
Sharp et.al. [11] have formulated an Integer programming to us. The optimization problem at hand involves finding a
method for T20 team selection as a maximisation problem, solution set that maximizes the total team score subject to
wherein the decision variable indicates whether a certain team, role and player type constraints.
player having a specific skill is selected or not. The
coefficients for different skills such as batting, bowling and The objective function can be written as:
fielding were normalised using a method that utilized
certain indices established indices such as averages and
strike rates for batsmen and bowlers using a combination
of different performance statistics.
If Pj denotes the price of the jth player, the budget from the table below. The same logic follows for the other
constraint can be written as: type of players as well.
We improve the optimization model by including a penalty Scenario 2. At times depending on weather conditions, the
for inconsistent performance. We calculate the standard team which is batting second, is at a disadvantage. The
deviation of scores for previous 10 matches as the measure Dew on the pitch ensures that the ball experiences
of inconsistency in performance and include in the unpredictable swings. In this case, the bowling team
objective function by coupling it with a risk aversion should include more bowlers in the team.
5
6
7
λ Players Selection
AR C JA UY DK MD JB JU KJ R
0
1
2
3
4
5
6
Scenario 3. The team management has decided to rest V 7
Kohli, who has been the choice for 'Captain' for both risk-
taking and risk-averse Dream 11 participants. Who shall be Legend Not Selected Selected Captain Vice-Captain
the new Captain?
Table 5. Expected and Actual Total Scores per Risk Aversion
Index
λ Scores
Expected Scores Actual %Difference
Scores
0 4127 4408 6.8%
1 3909 4095 4.8%
2 3809 3909 2.6%
3 3809 3909 2.6%
4 3701 3896 5.3%
5 3600 3732 3.7%
6 3600 3732 3.7%
7 3388 3525 4.0%
It’s important to note that, the team selection depends on
the risk-taking appetite of a Dream 11 User. A user, who is From a pool of 30 players from India and England, a team
willing to take less risk may try to go for highly consistent of 11 players is chosen based on the risk-aversion index (0
players, resulting in a low overall score. It’s important the being least risk averse to 7 being highly risk averse). The
test the robustness of our proposed method in terms of difference between the expected score, as predicted by the
risk-reward trade off. model and the observed score is then noted.
To test the robustness of our model, we create a random set We observed that highest possible value for our case
of scores to see how well our model performs on a set of actually comes with λ=0. The difference decreases first
randomized data. We check the players selected, expected with λ =3 and then starts to slowly increase again and the
score, observed scores and the percentage difference expected score starts to drop faster. This is because after a
between the expected and the observed for different values certain point of time, the punishment term for variance
of λ. starts to dominate. Thus, for our case a nice spread comes
around λ of 2 or 3 with a high enough score and a low
Table 4. Team Composition per Risk Aversion Index of a Dream percentage difference.
11 User
λ Players Selection Even though in our case the best possible outcome comes
VK SD JD RS JR KR JY JV AH E when λ =0 for later matches we may not turn out so lucky.
0
Here the observed value is greater than the expected, which
1
is why the higher variation is in our favour here. However,
2
3
in later matches if the expected is larger than the observed,
4 the high variation will give us larger loss. Thus, the risk
5 averse way is to minimize the difference between the
6 expected and the observed and thus making the expected
7 scoring a bit more accurate which gives us a clear idea of
λ Players Selection whether with the expected score, we should invest in the
BS HP TC M JB LP BK KY MS Y particular match or not.
0
1 V. CONCLUSION AND FUTURE SCOPE
2
3 The paper presents a novel method of using Integer
4 Optimisation using Gurobi to predict an optimal team for a