The document describes a global trust score prediction model developed by Team Geekbytes. It pre-processes the data to check for outliers and missing values. Descriptive analysis shows trust increases with trust in independent variables and trust is highest in doctors/nurses. Factor analysis reduces independent variables to two principal components explaining 73% of variability. Regression modeling shows the first principal component affects trust more than the second. The conclusion lists the most important variables affecting overall trust score.
Original Description:
Original Title
Team Geekbytes_Global Trust Score Prediction Model
The document describes a global trust score prediction model developed by Team Geekbytes. It pre-processes the data to check for outliers and missing values. Descriptive analysis shows trust increases with trust in independent variables and trust is highest in doctors/nurses. Factor analysis reduces independent variables to two principal components explaining 73% of variability. Regression modeling shows the first principal component affects trust more than the second. The conclusion lists the most important variables affecting overall trust score.
The document describes a global trust score prediction model developed by Team Geekbytes. It pre-processes the data to check for outliers and missing values. Descriptive analysis shows trust increases with trust in independent variables and trust is highest in doctors/nurses. Factor analysis reduces independent variables to two principal components explaining 73% of variability. Regression modeling shows the first principal component affects trust more than the second. The conclusion lists the most important variables affecting overall trust score.
Prediction Model Team Geekbytes Kiran S Viswa M Danel Hilton W Pramod Raja Prasana Balaji Srinivasan Tiruniteesh Pre-processing the data
• There are no outliers in the data.
• This is checked by comparing the data points with Inter Quartile Range value. • There are no missing values in the data. Descriptive analysis • Overall trust score increases with increase in trust on IVs. • Mean trust on doctor and nurses is the highest followed by scientists. • The overall trust score lies mostly between 1700 and 2400. Multi- collinearity among IVs • Multi-collinearity caused because of correlation among Ivs. • It is important to eliminate multi-collinearity. • VIF of Ivs are greater than 10. Factor Analysis (Reduction of IVs to Factors)
• PCA done to eliminate multi-collinearity among independent variables
• No. of components from PCA: 2 • 73% variability captured in data • PC1: Neighbourhood, Scientist, Doctors & Nurses, Traditional Healers • PC2: Government, Journalist, Philanthropist • All the independent variables significantly affect overall trust score • It is advisable to rank these independent variables according to factor loadings value. Regression model after Factor Analysis
• PC1 affects overall trust score more than PC2. • So, PC1 is more important than PC2. • R square value: 89% - strength of model is “GOOD”. • Root Mean Square Error: 72.79 CONCLUSION
A country’s overall trust score depends on these variables
(starting with most important one), 1. Scientists 2. Neighbourhood 3. Doctors and Nurses 4. Traditional Healers 5. Government 6. Journalist 7. Philanthropists Thank you