Professional Documents
Culture Documents
Big Data SPSS
Big Data SPSS
RATE :
SOLUTIONS -
Variables Entered/Removeda
Variables Variables
Model Entered Removed Method
1 SR -Bb . Enter
a. Dependent Variable: SOLD PRICE
b. All requested variables entered.
Model Summaryb
Adjusted R Std. Error of Durbin-
Model R R Square Square the Estimate Watson
1 .184a .034 .026 401399.957 2.132
a. Predictors: (Constant), SR -B
b. Dependent Variable: SOLD PRICE
ANOVAa
Sum of
Model Squares df Mean Square F Sig.
1 Regression 72489011718 72489011718
1 4.499 .036b
6.121 6.121
Residual 20623606413 16112192510
128
583.100 6.118
Total 21348496530
129
769.223
a. Dependent Variable: SOLD PRICE
b. Predictors: (Constant), SR -B
Coefficientsa
Unstandardized Standardized
Coefficients Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) 289521.427 114770.006 2.523 .013
SR -B 2086.394 983.643 .184 2.121 .036
a. Dependent Variable: SOLD PRICE
Residuals Statisticsa
Std.
Minimum Maximum Mean Deviation N
Predicted Value 289521.44 780846.25 521223.08 74962.012 130
Residual -
1261739.625 .000 399841.116 130
567774.563
Std. Predicted
-3.091 3.463 .000 1.000 130
Value
Std. Residual -1.414 3.143 .000 .996 130
a. Dependent Variable: SOLD PRICE
INTERPRETATION 1:
Explanation of model – when the value of strike rate batting( SRB) is zero, the minimum sold
price of the player is 289521.427.
When the SRB changes by 1 unit, the change in the value of Sold price changes by 2086.394.
- P value ( 0.036 ) is less than 0.05, f value ( 4.499 ) means that batting strike rate is
statistically significant in determining the sold price.
- Value of R is .184 which is closer to zero, henceforth there is low level of correlation
between X and Y.
- Value of R square is 0.034, which suggests only 3.4% of variance in the Y values around
the mean are explained by the x values.
- Durbin Watson – 2.132 since the value lies between 2 AND 4, no auto-correlation.
To summarise: There is correlation between X and Y, But due values of R closer to zero, the
level of correlation is low– hence, the model is not a good fit.
QUESTION 2 : RELATIONSHIP BETWEEN SIXERS AND PLAYERS PRICE.
Variables Entered/Removeda
Variables Variables
Model Entered Removed Method
1 SIXERSb . Enter
a. Dependent Variable: BASE PRICE
b. All requested variables entered.
Model Summaryb
Adjusted R Std. Error of Durbin-
Model R R Square Square the Estimate Watson
1 .162a .026 .019 151666.788 2.037
a. Predictors: (Constant), SIXERS
b. Dependent Variable: BASE PRICE
ANOVAa
Sum of
Model Squares df Mean Square F Sig.
1 Regression 79242823989 79242823989
1 3.445 .066b
.771 .771
Residual 29443602529 23002814476
128
33.306 .041
Total 30236030769
129
23.077
a. Dependent Variable: BASE PRICE
b. Predictors: (Constant), SIXERS
Coefficientsa
Unstandardized Standardized
Coefficients Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) 173828.163 16590.680 10.477 .000
SIXERS 1040.147 560.409 .162 1.856 .066
a. Dependent Variable: BASE PRICE
Residuals Statisticsa
Std.
Minimum Maximum Mean Deviation N
Predicted Value 173828.16 308007.16 192230.77 24784.783 130
Residual -
1144967.375 .000 151077.788 130
166309.938
Std. Predicted
-.742 4.671 .000 1.000 130
Value
Std. Residual -1.097 7.549 .000 .996 130
a. Dependent Variable: BASE PRICE
INTERPRETATION 2 :
Explanation of the model : When the value of sixers is zero, the players price is 173828.163.
Also, when the sixers ( X ) value changes by 1 unit, the change in the players price changes by
1040.147 .
- P value ( 0.066) > 0.05, and F value 3.445, means that sixers is statistically insignificant
in determining the players price.
- R value – 0.162 which is closer to zero, hence there is low level of correlation between X
and Y.
- R Square ( 0.026) means only 2.6 % of the variance in the Y values around the mean are
explained by the x values.
- Durbin Watson – 2.037 since the value lies between 2 AND 4, no auto-correlation.
Variables Entered/Removeda
Variables Variables
Model Entered Removed Method
1 SIXERS, SR
. Enter
-Bb
a. Dependent Variable: SOLD PRICE
b. All requested variables entered.
Model Summaryb
Adjusted R Std. Error of Durbin-
Model R R Square Square the Estimate Watson
a
1 .451 .203 .191 365998.658 1.994
a. Predictors: (Constant), SIXERS, SR -B
b. Dependent Variable: SOLD PRICE
ANOVAa
Sum of
Model Squares df Mean Square F Sig.
1 Regression 43362092602 21681046301
2 16.185 .000b
89.346 44.673
Residual 17012287270 13395501787
127
479.877 7.794
Total 21348496530
129
769.223
a. Dependent Variable: SOLD PRICE
b. Predictors: (Constant), SIXERS, SR -B
Coefficientsa
Unstandardized Standardized
Coefficients Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) 395337.539 106613.879 3.708 .000
SR -B -102.524 991.030 -.009 -.103 .918
SIXERS 7758.805 1494.312 .454 5.192 .000
a. Dependent Variable: SOLD PRICE
Residuals Statisticsa
Std.
Minimum Maximum Mean Deviation N
Predicted Value 371194.25 1379635.63 521223.08 183341.281 130
Residual -
1230674.000 .000 363150.376 130
579635.625
Std. Predicted
-.818 4.682 .000 1.000 130
Value
Std. Residual -1.584 3.363 .000 .992 130
a. Dependent Variable: SOLD PRICE
INTERPRETATION 3 :
Explanations of the model – when the value of SRB ( x1) and SIXERS ( x2) is zero, the sold
price of the player is 395337.539.
When the value of x2 is kept constant, one unit change in SRB ( x1 ), the value of sold price
changes by – 102.524 .
When the value of x1 is kept constant, one unit change in sixers ( x2 ), the value of sold price
changes by 7758.805 .
- P value for SRB ( 0.918) > 0.05, hence insignificant in determining the sold price of the
players, whereas P value for sixers (0.000) < 0.05 ,f value (16.185) therefore it is
significant in determining the sold price of the Player.
- R value : 0.451, hence there is mid-level of correlation between X and Y.
- R Square ( 0.203 ) means only 20.3 % of the variance in the Y values around the mean are
explained by the x values.
- Durbin Watson – 1.994 since the value is less than 2, there is auto-correlation.
To summarise: In this model, in determining the sold price of the player – sixers is significant,
whereas strike rate batting is insignificant, which is explained by their P values.
However, overall the model is a good fit, which can be explained by F value (16.185) and p value
(.000).
QUESTION 4 : RELATIONSHIP BETWEEN PLAYERS PRICE AND THEIR AGE.
Variables Entered/Removeda
Variables Variables
Model Entered Removed Method
b
1 AGE . Enter
a. Dependent Variable: BASE PRICE
b. All requested variables entered.
Model Summaryb
Adjusted R Std. Error of Durbin-
Model R R Square Square the Estimate Watson
1 .209a .044 .036 150290.950 2.026
a. Predictors: (Constant), AGE
b. Dependent Variable: BASE PRICE
ANOVAa
Sum of
Model Squares df Mean Square F Sig.
1 Regression 13241975554 13241975554
1 5.863 .017b
5.745 5.745
Residual 28911833213 22587369698
128
77.332 .260
Total 30236030769
129
23.077
a. Dependent Variable: BASE PRICE
b. Predictors: (Constant), AGE
Coefficientsa
Unstandardized Standardized
Coefficients Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) 75975.610 49790.577 1.526 .130
AGE 55563.128 22947.909 .209 2.421 .017
a. Dependent Variable: BASE PRICE
Residuals Statisticsa
Std.
Minimum Maximum Mean Deviation N
Predicted Value 131538.73 242665.00 192230.77 32039.191 130
Residual -
1162898.125 .000 149707.294 130
167101.859
Std. Predicted
-1.894 1.574 .000 1.000 130
Value
Std. Residual -1.112 7.738 .000 .996 130
a. Dependent Variable: BASE PRICE
INTERPRETATION 4 :
MODEL – Y (PLAYERS PRICE ) = 75975.610 + 55563.128 ( AGE )
Explanation of the model – when the player is a fresher, the minimum amount of the player is
75975.610.
When the age of the player increases by 1 unit, players price changes by 55563.128 .
- P value for age (0.017) < 0.05, F value (5.863) hence age is significant in determining the
players price.
- R value : 0.209, closer to zero. Therefore low level of correlation between X and Y.
- R Square ( 0.044) means only 4.4% of the variance in the Y values around the mean are
explained by the x values.
- Durbin Watson – 2.026 since the value is between 2 and 4, there is no auto-correlation.
To summarise : Age is a significant factor in determining the players price which can be
explained by P and F values. But due to low values of R and R square, the model is not a good fit.
QUESTION 5 : INDIAN ORIGIN PLAYERS PAID MORE THAN PLAYERS OF
OTHER ORIGIN.
Here we have taken 1 for players of Indian origin in country code, and 0 for all the other
countries.
Variables Entered/Removeda
Variables Variables
Model Entered Removed Method
Model Summaryb
ANOVAa
Sum of
Model Squares df Mean Square F Sig.
Coefficientsa
Standardized
Unstandardized Coefficients Coefficients
Country
221365.597 70214.643 .268 3.153 .002
code
Residuals Statisticsa
Explanation of the model – A player irrespective of the country origin has a sold price of
430974.026 rupees.
When a player belonged to Indian origin, the sold price of the player is changed by 221365.597.
- P value (0.002) < 0.05 and F value is 9.940, Hence forth country origin is a significant
factor in determining the sold price of the player.
- R value : ( 0.268) closer to zero. Therefore low level of correlation between X and Y.
- R Square ( 0.072) means only 7.2% of the variance in the Y values around the mean are
explained by the x values.
- Durbin Watson : 2.076 since the value is between 2 and 4, there is no auto-correlation.
To summarise: The players of Indian origin are paid more than the players of other origin, which
can be explained by the positive coefficient.
But the model is not a good fit, which can be explained by the values of R and R sq.
QUESTION 6 : DEVELOPING A MODEL WHICH CAN BE USED BY FRANCHISE TO
PREDICT SOLD PRICE :
FOR BATSMEN :
To determine the model for batsmen, we have used different factors to assess the sold price of
batsmen. Factors such as :
Sixers, age, base price, ODI SRB, T RUNS , AVE , ODI RUNS, RUNS, HSB
Variables Entered/Removeda
Variables Variables
Model Entered Removed Method
1 SIXERS,
AGE, BASE
PRICE, ODI-
SR-B, T- . Enter
RUNS, AVE,
ODI-RUNS-S,
RUNS-S, HSb
Model Summaryb
Sum of
Model Squares df Mean Square F Sig.
Total 213484965307
129
69.223
Coefficientsa
Unstandardized Standardized
Coefficients Coefficients
Residuals Statisticsa
Std. Predicted
-2.195 4.840 .000 1.000 130
Value
Here - age ( x1), base price (x2) , TRUNS ( x3) , ODI RUNS -S ( x4 ) and RUNS ( x5)
Explanation of the model – Irrespective of age, base price, T runs, ODI RUNS-S and RUNS,
the minimum amount paid to the player is 467881.414.
When the base price (x2) , T-runs ( x3 ) , ODI runs ( x4 ) and runs ( x5 ) are kept constant, the
one unit change in the age (x1) – the sold price (Y) of the player changes by -115065.604.
When x1, x3, x4, x5 are kept constant, one unit change in X2 – the sold price of the player
changes by 1.480.
When x1, x2, x4, x5 are kept constant, one unit change in x3 – the sold price of the player
changes by – 56.120.
When x1, x2, x3, x5 are kept constant, one unit change in x4 – the sold price of the player
changes by 50.570.
When x1 x2 x3 x4 are kept constant, the sold price of the player changes by 346.085.
MODEL FOR BALLERS :
The factors taken : age / bowling average / SRBL / WKTS / ECON/ ODI-SRBL / ODI
WKTS / TWKTS / Base price
Variables Entered/Removeda
Variables Variables
Model Entered Removed Method
1 BASE PRICE,
SR-BL, T-
WKTS, ODI-
SR-BL,
WKTS, . Enter
ECON, ODI-
WKTS,
RUNS-C,
AVE-BLb
Model Summaryb
Sum of
Model Squares df Mean Square F Sig.
Total 213484965307
129
69.223
Coefficientsa
Unstandardized Standardized
Coefficients Coefficients
Residuals Statisticsa
Std. Predicted
-1.330 6.709 .000 1.000 130
Value
- When the value of TWKTS, RUNS C and base price is zero, the sold price of the player is
204844.049.
- When the value of X2 and x3 are kept constant, one unit change in the value of x1, the
sold price of the player changes by -753.042.
- When the value of x1 and x3 are kept constant, one unit change in the value of x2, the
sold price of the player changes by 465.320.
- When the value of x1 and x2 are kept constant, one unit change in the value of x3, the
sold price of the player changes by 1.466.