Professional Documents
Culture Documents
ABSTRACT
Aim: The aim of research is to classify the cost prediction in health insurance using novel ranking with machine
learning algorithms. Materials and Methods: The categorizing is performed by adopting a sample size of n=10
in Linear Regression and sample size n = 10 in random forest algorithms was iterated 20 times for efficient and
accurate analysis on labeled images with G power in 80% and threshold 0.05%, CI 95% mean and standard
deviation. Results: The analysis of the results shows that the Linear Regression has a high accuracy of
(92.53%) in comparison with the random forest algorithm (91.68 %). There is a statistically significant
difference between the study groups with (p<0.05). Conclusion: Prediction in classifying the cost prediction in
health insurance shows that the Linear Regression appears to generate better accuracy than the cost prediction
random forest algorithm.
Keywords: Health Care, Machine Learning, Random Forest Regression, Novel Ranking, Linear Regression.
2030
Rama Parvathy L.et.al.,A Novel Ranking Approach to Improved Health Insurance Cost Prediction by
Comparing Linear Regression to Random Forest
2031
Journal of Survey in Fisheries Sciences 10(1S) 2030-2039 2023
2032
Rama Parvathy L.et.al.,A Novel Ranking Approach to Improved Health Insurance Cost Prediction by
Comparing Linear Regression to Random Forest
2033
Journal of Survey in Fisheries Sciences 10(1S) 2030-2039 2023
2034
Rama Parvathy L.et.al.,A Novel Ranking Approach to Improved Health Insurance Cost Prediction by
Comparing Linear Regression to Random Forest
2035
Journal of Survey in Fisheries Sciences 10(1S) 2030-2039 2023
https://doi.org/10.7903/ijecs.1400. https://doi.org/10.18178/ijmlc.2019.9.6
9. Karobari, Mohmed Isaqali, Syed Nahid .866.
Basheer, Fazlur Rahman Sayed, 13. Muremyi, Roger, Dominique
Sufiyan Shaikh, Muhammad Atif Haughton, Ignace Kabano, and
Saleem Agwan, Anand Marya, Pietro François Niragire. 2020. “Prediction of
Messina, and Giuseppe Alessandro out-of-Pocket Health Expenditures in
Scardina. 2021. “An In Vitro Rwanda Using Machine Learning
Stereomicroscopic Evaluation of Techniques.” The Pan African Medical
Bioactivity between Neo MTA Plus, Journal 37 (December): 357.
Pro Root MTA, BIODENTINE & 14. Muthukrishnan, Lakshmipathy. 2021.
Glass Ionomer Cement Using Dye “Nanotechnology for Cleaner Leather
Penetration Method.” Materials 14 Production: A Review.”
(12). Environmental Chemistry Letters 19
https://doi.org/10.3390/ma14123159. (3): 2527–49.
10. Karthigadevi, Guruviah, 15. Nakamura, Haruyo, Floriano Amimo,
Sivasubramanian Manikandan, Siyan Yi, Sovannary Tuot, Tomoya
Natchimuthu Karmegam, Ramasamy Yoshida, Makoto Tobe, Md Mizanur
Subbaiya, Sivasankaran Rahman, Daisuke Yoneoka, Aya
Chozhavendhan, Balasubramani Ishizuka, and Shuhei Nomura. 2021.
Ravindran, Soon Woong Chang, and “Developing and Validating
Mukesh Kumar Awasthi. 2021. Regression Models for Predicting
“Chemico-Nanotreatment Methods for Household Consumption to Introduce
the Removal of Persistent Organic an Equitable and Sustainable Health
Pollutants and Xenobiotics in Water - Insurance System in Cambodia.” The
A Review.” Bioresource Technology International Journal of Health
324 (March): 124678. Planning and Management, July.
11. Kharrazi, Hadi, Xiaomeng Ma, Hsien- https://doi.org/10.1002/hpm.3269.
Yen Chang, Thomas M. Richards, and 16. Pauly, Mark. 2015. “Cost-
Changmi Jung. 2021. “Comparing the Effectiveness Analysis and Insurance
Predictive Effects of Patient Coverage: Solving a Puzzle.” Health
Medication Adherence Indices in Economics.
Electronic Health Record and Claims- https://doi.org/10.1002/hec.3044.
Based Risk Stratification Models.” 17. Preethi, K. Auxzilia, K. Auxzilia
Population Health Management, Preethi, Ganesh Lakshmanan, and
February. Durairaj Sekar. 2021. “Antagomir
https://doi.org/10.1089/pop.2020.0306. Technology in the Treatment of
12. Maharani, Dian, The authors are with Different Types of Cancer.”
Universitas Indonesia, Indonesia, Epigenomics.
Hendri Murfi, and Yudi Satria. 2019. https://doi.org/10.2217/epi-2020-0439.
“Performance of Deep Neural Network 18. Quadagno, Jill. 2006. “Cost
for Tabular Data — A Case Study of Containment versus National Health
Loss Cost Prediction in Fire Insurance.” One Nation,
Insurance.” International Journal of UninsuredWhy the U.S. Has No
Machine Learning and Computing. National Health Insurance.
2036
Rama Parvathy L.et.al.,A Novel Ranking Approach to Improved Health Insurance Cost Prediction by
Comparing Linear Regression to Random Forest
2037
Journal of Survey in Fisheries Sciences 10(1S) 2030-2039 2023
Table 2. Group Statistics of Linear Regression with Random Forest by grouping the
iterations with Sample size 10, Mean = 95.02000 , Standard Deviation = 1.50259 , Standard
Error Mean = 0.47516. Descriptive Independent Sample Test of Accuracy and Precision is
applied for the dataset in SPSS. Here it specifies Equal variances with and without assuming
a T-Test Score of two groups with each sample size of 10.
Group N Mean Std.Deviatio Std.Error
n Mean
Table 3. Independent Sample Test of Accuracy and Precision ( Calculate P-value <0.001 and
Significant value= 0.107, Mean Difference= 1.9 and confidence interval = (0.92- 0.10).
Logistic Regression and Random Forest are significantly different from each other.
F Sig T Df Sig(2. Mean Std error lower upper
tailed differe difference
nce
Accu Equal 2.885 0.107 2.065 18 <.001 1.9 0.92 -0.03 3.83
racy Variance
Assume
Preci Equal 0.289 0.598 -11.798 18 <.001 -1.198 0.10 -1.41 -0.98
sion Variance
Assume
d
2038
Rama Parvathy L.et.al.,A Novel Ranking Approach to Improved Health Insurance Cost Prediction by
Comparing Linear Regression to Random Forest
Fig. 1. Comparison of Linear Regression over Random Forest in terms of mean accuracy.It
explores that the mean accuracy is slightly better than random forest and the standard
deviation is moderately improved compared to random forest.Graphical representation of the
bar graph is plotted using group id as X-axis Linear Regression vs Random Forest,Y-Axis
displaying the error bars with a mean accuracy of detection +/-1 SD.
2039