You are on page 1of 6

Proceedings of the Fifth International Conference on Inventive Computation Technologies (ICICT-2020)

IEEE Xplore Part Number:CFP20F70-ART; ISBN:978-1-7281-4685-0

Impact Analysis of the Complete Blood Count


Parameter using Naive Bayes

Bhavinkumar A. Patel Dr. Ajay Parikh


Student-Research Scholar (Dept. of Computer Science) Professor, Dept. of Computer Science
Gujarat Vidyapith, Ahmedabad Gujarat Vidyapith, Ahmedabad
Gujarat, India Gujarat, India
bhavin.patel@gujaratvidyapith.org ajay@gujaratvidyapith.org

Abstract—Complete Blood Count (CBC) test is considered as Organization, the number of people suffering from anemia in
a very important test to evaluate the overall health of a person. It the world is 1.62 billion, with a proportion of 47.4% in children
plays a significant role in finding an infection, leukaemia, cancer, and 41.8% in pregnant women. Therefore, if it is diagnosed at a
anaemia, blood count, deficiency of vitamins and minerals, etc. previous stage, its adverse effect on the study of students can
Gujarat Vidyapith is a deemed to be university in which newly be reduced. Symptoms such as feeling tired, asthma, breathing
admitted students are medically checked up. Here, the health problems, nausea, feeling of restlessness, pail nails, etc are
condition of students, proper diagnosis and appropriate found in anemic students.
treatments are taken care of. CBC test is considered as one of the
most important sources for health analysis. Anemia indicates a reduction of hemoglobin or red cell in
However, there are many other variability and confidentiality human blood. This research paper predicts anemic and non-
in the CBC test, which is difficult to be evaluated directl y using anemic using different parameters of the blood count test
Machine Learning Algorithm. Therefore, a model is developed in including age, gender, blood group, hemoglobin, hematocrit,
this paper using the Naive Bayes Classifier which can predict MCV, MCH, and other parameters. Here, the obtained values
Anemia by analyzing the probabilities of effects of various have been compared with the normal range of such tests to
parameters of the CBC test. It will help to detect anemia at an derive the conclusion.
early stage so that the probability of serious complications can be
reduced. S tudents can take precautions to improve their health Thus, a parser has been created to read the CBC data-set
standards. Ultimately, it will affect the level of concentration, file collected from different laboratories, which converts the
regularity and mental health of students. Therefore, the l evel of data to a readable format. Then anemia can be predicted by
education can be improved. analyzing that data.
Keywords—Anemia, CBC Parameters, Naive Bayes algorithm,
Prediction, Machine Learning, Healthcare. Naive Bayes Algorithm gives a conditional probability,
which works on the Bayesian theorem. The Bayesian theorem
is used for the supervised learning method as well as the
I. INT RODUCT ION statistical method for estimating the classification and the
Complete Blood Count (CBC) test is called as an important probability model underlying it. And it allows capturing
test for whole-body health screening. For the diagnosis of any uncertainty about the model theoretically by determining the
disease, firstly the doctor will prescribe the CBC test which can probabilities of the results. Thereby solving diagnostic and
diagnose any type of infection, cancer, anemia, deficiency of prediction problems.
vitamins and minerals. Nevertheless, it provides information on
body blood disorders and abnormal conditions. The blood
II. M ET HODOLOGY
count test is considered to be a very important source to know
the overall health analysis as well. In this research study, the data-set represents the
information for the prediction of Anemia. The students in the
Gujarat Vidyapith is a deemed to be university in which age group from 16 to 33 years studying in Gujarat Vidyapith
newly admitted students are examined. If there is any disease are selected as the sample of the study. Here, CBC samples of
or symptom, an attempt is made to remove it. As the CBC 2151 students of the last five years (from 2014 to 2018) are
report collected from different laboratories varies and being taken as data set for the research, which is collected from
confidential, it is difficult to be collected. Therefore, because of various pathology laboratories. There are many inequalities due
the diversity in the CBC report, 16 parameters have been to various testing tools. It can be said that not all dimensions of
selected, which play an important role in the classification of the CBC report are important for the prediction of anemia
anemia. (Research of the Department of Pathology at the University of
Anemia means low blood volume in the body. Anemia is Medicine, Utah). Therefore, only 16 parameters which are very
not a disease but a symptom that can cause serious illness in important for the prediction of anemia are considered and
the body. It is the biggest and most dangerous problem of the tested in the present research study.
human body. According to the research of the World Health

978-1-7281-4685-0/20/$31.00 ©2020 IEEE 7

Authorized licensed use limited to: University of Exeter. Downloaded on June 22,2020 at 05:07:10 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Fifth International Conference on Inventive Computation Technologies (ICICT-2020)
IEEE Xplore Part Number:CFP20F70-ART; ISBN:978-1-7281-4685-0

TABLE I. ATTRIBUTES OF ANEMIA DATASET anemia classification, as proven by Medical Science.


Sr.no Attribute Name Description Thus, the following parameters are removed from the
1. Age Age Group Between 16-33 dataset. These include RBCPS, WBCPS, Platelet PS,
2. Sex Male, Para-Count, MCVRBC, and Remark parameter.
Female
3. HB Male <13.2,  Data Imputation: Imputation is the process of replacing
Female < 11.6 the missing value with the appropriate value.
4. RBC Male < 4.35,
Female< 3.92  Data Transformation: The Anemia status parameter is
5. PCV (HCT ) Male<38.3, considered as the target variable and the category
F < 35.5 variable which indicates whether the student is Anemic
6. MCV Microcytic<80, or Non-Anemic. As well as the sex and blood group
Normocytic<80>96,
Macrocytic>96 parameters are also considered as categorical variables
7. MCH Normal between <27 >33, and the rest of the parameters are numerical.
Abnormal ↓<27 ↑>33
8. MCHC Normal between <33.4 >35.5, B. Analysis Algorithm Implementation
Abnormal ↓<33.4 ↑>35.5
9. RDW Normal between <11.8 >15.6,  Data Analysis: Data analysis has been performed on
Abnormal ↓<11.8 ↑>15.6 various dimensions of the CBC report using the Naive
10. WBC Normal between <3400>9600, Bayes classifier and different classifications have been
Abnormal ↓<3400 ↑>9600 obtained from it. It includes sex, age, blood group and
11. Neutrophil Normal between <40 >70, various other parameters based on anemic which are
Abnormal ↓<40 ↑>70
12. Lymphocyte Normal between <20>40,
based on normal and abnormal conditions.
Abnormal ↓<20 ↑>40
13. Eosinophil Normal between <1 >4,
Abnormal ↓<1 ↑>4
14. Monocyte Normal between <2 >6,
Abnormal ↓<2 ↑>6
15. Platelet Male <135000, Female<157000
16. Anemia_Status Male, HB<13.2 = Anemic
Female, HB<11.6 = Anemic

Here, the prediction of anemia is done using 16 parameters Fig.2. Missing Value Parameter
in the data set as mentioned in the above table.

It is shown in the above missing chart that the data-set does


III. RESEARCH FLOW :
not have any missing values of the parameter, i.e. 0%. Th us, a
The research flow has been divided into two parts: One is 100% value has been received.
Pre-processing on data and the second is Analysis Algorithm
Implementation. The prediction process of anemia is divided
into two stages as shown in the following figure:

Fig.3. HB classification

The above chart reflects the HB classification, which plays


a significant role in the prediction of anemia. It is found that
Fig.1. Research Flow. out of the total 2151 students, the number of students with low
blood volume was 1161 (54%) and the number of normal
A. Data Pre-processing students was 990 (46%).
Data Pre-processing is a technique for eliminating the gaps
1. Sex:
in the data and converting it to the correct data. It uses data
cleaning, data integration, data transformation, and data
reduction.
 Data Reduction: The original data file contains more
than 20 parameters, not all of which are important for

978-1-7281-4685-0/20/$31.00 ©2020 IEEE 8

Authorized licensed use limited to: University of Exeter. Downloaded on June 22,2020 at 05:07:10 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Fifth International Conference on Inventive Computation Technologies (ICICT-2020)
IEEE Xplore Part Number:CFP20F70-ART; ISBN:978-1-7281-4685-0

TABLE II. CLASSIFICATION OF SEX P ARAMETER % within 28.9 25.8% 27.5


anemia_status % %
Sex * Anemia Status
Anemia_Status Total
% of T otal 15.6 11.9% 27.5
Anemic NonAnemic
% %
Sex Female Count 730 399 1129 AB-Ve Count 1 3 4
Probability of 0.647 0.353 1 % within bgr 25.0 75.0% 100.0
total female Sex % %
Probability of 0.629 0.403 0.525 % within 0.1% 0.3% 0.2%
Anemia_Status anemia_status
% of T otal 33.9% 18.5% 52.5% % of T otal 0.0% 0.1% 0.2%
Male Count 431 591 1022 AB+V Count 90 85 175
Probability of 0.422 0.578 1 e % within bgr 51.4 48.6% 100.0
total male Sex % %
Probability of 0.371 0.597 0.475 % within 7.8% 8.6% 8.1%
Anemia_Status anemia_status
% of T otal 20.0% 27.5% 47.5% % of T otal 4.2% 4.0% 8.1%
T otal Count 1161 990 2151 B-Ve Count 13 17 30
Probability 0.54 0.46 1 Expected 16.2 13.8 30.0
within Sex Count
Probability 1 1 1 % within bgr 43.3 56.7% 100.0
within % %
Anemia_status % within 1.1% 1.7% 1.4%
% of T otal 54.0% 46.0% 100.0% anemia_status
% of T otal 0.6% 0.8% 1.4%
B+Ve Count 336 307 643
The classification of sex parameters in the table above % within bgr 52.3 47.7% 100.0
% %
shows that 54% of the total 2151 students are anemic students
% within 28.9 31.0% 29.9
and the remaining 46% are non-anemic. Thus, out of 54% of anemia_status % %
all anemic students, the proportion of female anemic is 62.9% % of T otal 15.6 14.3% 29.9
while the proportion of anemic among the males is 37.1%. % %
Hence, it is clearly observed that out of 1129 female patients, O-Ve Count 7 14 21
64.65% are anemic while this portion is 42.17% in male % within bgr 33.3 66.7% 100.0
patients. Hence, it is found that anemic female respondents % %
% within 0.6% 1.4% 1.0%
under study have been found more in proportion as compared anemia_status
to males. % of T otal 0.3% 0.7% 1.0%
O+Ve Count 367 297 664
% within bgr 55.3 44.7% 100.0
% %
% within 31.6 30.0% 30.9
anemia_status % %
% of T otal 17.1 13.8% 30.9
% %
T otal Count 1161 990 2151
% within bgr 54.0 46.0% 100.0
% %
% within 100.0 100.0% 100.0
Fig.4. Sex with Anemia Status anemia_status % %
% of T otal 54.0 46.0% 100.0
The above chart shows the Anemia Status of respondents % %
with their gender. It is shown that 730 (62.87%) females and
Among all the respondents with an O+ve blood group,
431 (37.1%) males are found anemic under study. 55.27% are found anemic. This proportion is 52.26%, 56.85%
2. Blood Group: and 51.43% in the respondents with B+ve, A+ve and AB+ve
group respectively. In another blood group, 41% of
TABLE III. BLOOD GROUP ANEMIA STATUS respondents are anemic, which is statistically significant but
Blood Group * Anemia Status medically not significant.
Anemia_Status Total
Anemic NonAnemic
Bgr A-Ve Count 11 12 23
% within bgr 47.8 52.2% 100.0
% %
% within 0.9% 1.2% 1.1%
anemia_status
% of T otal 0.5% 0.6% 1.1%
A+Ve Count 336 255 591
% within bgr 56.9 43.1% 100.0
% %
Fig.5. Blood Group with Anemia Status

978-1-7281-4685-0/20/$31.00 ©2020 IEEE 9

Authorized licensed use limited to: University of Exeter. Downloaded on June 22,2020 at 05:07:10 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Fifth International Conference on Inventive Computation Technologies (ICICT-2020)
IEEE Xplore Part Number:CFP20F70-ART; ISBN:978-1-7281-4685-0

It is shown in the table that the respondents with O+ve is the proportion of students whose test is positive for anemia
blood group are comparatively more (31.6%) followed by [24].
B+ve (28.9%), A+ve (28.9%). Other blood groups have been
Specificity: 0.8737
found only in 3.63% of the respondents.
Specificity correlates with the ability of tests to properly refuse
In the present research study, a prediction model is anemic students without any conditions, which is the
developed using the Naive Bayes Classifier which can predict proportion of non-anemic students who are not anemic, which
Anemia from the CBC data-set. Here the whole sample of 2151 shows a negative test for them [24].
students has been classified into two parts: 80% of the Training Testing with high specificity for the presence of anemia
set and 20% of the Testing set. Then the anemia in students has results in low levels of hemoglobin positively. Low levels of
been found using Anemia Test as a target variable keeping into hemoglobin are rarely found in non-anemic students. Low
consideration 16 parameters. levels of hemoglobin indicate the probability of the presence of
anemia.
Method: 10 fold Cross-Validation
'Positive' Class: Anemic
Model Evaluation Here shows that through the Naive Bayes classifier model,
Using the test data set on the model to test the efficiency of one can predict whether a student has anemia or not, the
the model prepared by the training set, the confusion matrix, accuracy of the model is about 85%.
and accuracy evaluation model has been prepared as follows.
Bayesian Theorem
TABLE IV. CONFUSION MATRIX The Bayesian theorem provides a way to calculate the
posterior probability.
Confusion Matrix Prediction Predicted Value
Anemic Non-Anemic P (a|b), from P (a), P (b), and P (b|a). The Naive Bayes
classifier assumes that the effect of the predictor (b) on a given
Actual Value Anemic 193(T P) 25(FN) class (a) is independent of other predictor values.
Non-Anemic 39(FP) 173(T N)
Equation: P (a|b) = P (b|a) * P(a)
P(b) (1)
The above confusion matrix table shows the values of TP a=anemia_status (Anemic|NonAnemic)
(True Positive), TN (True Negative), FN (False Negative), FP b1=hb, b2=RBC, b3= pcv, b4=mcv, b5=mch,
(False Positive). It shows probable value and actual value from b6=mchc, b7=rdw, b8=wbc, b9=platelet.
which the accuracy of the model can be calculated as follows:
Accuracy: TP+TN = 193+173 = 366 = 0.85
TP+FP+TN+FN = 193+39+173+25 = 430  P (a|b) is the posterior probability of class (target)
TABLE V. MODEL EVALUATION STATISTICS given predictor (attribute).
Mode l Evaluation with Statistics  P(a) is the prior probability of class.
Accuracy 0.8512 Pos Pred Value 0.8853
P-Value < 2e-16 Neg Pred Value 0.8160  P (b|a) is the likelihood which is the probability of
[Acc > NIR] predictor given class.
Kappa 0.702 Prevalence 0.5395
 P(b) is the prior probability of predictor.
Mcnemar's Test 0.1042 Detection Rate 0.4488
P-Value  Classification of the effect of CBC parameters by Naive
Sensitivity 0.8319 Detection Prevalence 0.5070
Bayes.
Specificity 0.8737 Balanced Accuracy 0.8528
'Positive' Class Anemic Naive Bayes gives the conditional probability. It shows
whether the student is anemic or non-anemic on the basis of 9
parameters in normal/abnormal condition. This is mentioned in
P-Value [Acc > NIR] : < 2e-16 the following table.
Here, the p-value is less than 0.05. Hence the null
hypothesis is rejected at a 5% level of significance. Thus, it can Bayesian Theorem Formula
be said that the CBC report can predict anemia [25].
P (a|b1.....b9) = P (b1|a)*P(b2|a)*P(b3|a)*…*P(b9|a)*P(a)
Kappa: 0.702
Kappa values are always less than or equal to 1. A value of TABLE VI. FEMALE ANEMIC P REDICTION P ROBABILITY T ABLE
1 shows perfect agreement and values less than 1 shows less Prediction of Female Anemic on Normal Condition
than perfect agreement. Here the Kappa value is 0.702, which Parameter Condition Sample Probability
is near to one. Hence, it indicates nearly perfect agreement Sex Female 1129/2151 0.525
[27]. HB Normal 399/1129 0.353
RBC Normal 964/1129 0.854
Sensitivity: 0.8319 PCV Normal 520/1129 0.460
Sensitivity here means the test's ability to detect anemia in MCV Microcytic 623/1129 0.551
students, who have low hemoglobin. The sens itivity of the test MCH Normal 330/1129 0.292
MCHC Normal 113/1129 0.100

978-1-7281-4685-0/20/$31.00 ©2020 IEEE 10

Authorized licensed use limited to: University of Exeter. Downloaded on June 22,2020 at 05:07:10 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Fifth International Conference on Inventive Computation Technologies (ICICT-2020)
IEEE Xplore Part Number:CFP20F70-ART; ISBN:978-1-7281-4685-0

RDW Normal 869/1129 0.770 P(anemia|F-abnormal)=P(RBC-Ab|F)*P(PCVAb|F)*P(MCV-


WBC Normal 978/1129 0.866 Microcytic|F)…*P(F)
Platelet Normal 1113/1129 0.986
Anemia 0.0007 = 0.000004 = P(anemia|F-abnormal) = 0.000004 (4.012992e-6)
Pre diction of Fe male Anemic on Abnormal Condition
Parameter Condition Sample Probability TABLE IX. MALE ANEMIC P REDICTION P ROBABILITY T ABLE
Sex Female 1129/2151 0.525
HB Abnormal 730/1129 0.646 Prediction of Male Anemic on Normal Condition
Parameter Condition Sample Probability
RBC Abnormal 165/1129 0.146
PCV Abnormal 609/1129 0.539 Sex Male 1022/2151 0.475
MCV Microcytic 623/1129 0.551 HB Normal 591/1022 0.578
MCH Abnormal 799/1129 0.708 RBC Normal 906/1022 0.886
MCHC Abnormal 1016/1129 0.900 PCV Normal 837/1022 0.818
RDW Abnormal 260/1129 0.230 MCV Microcytic 385/1022 0.376
WBC Abnormal 151/1129 0.133 MCH Normal 483/1022 0.472
Platelet Abnormal 16/1129 0.014 MCHC Normal 246/1022 0.240
Anemia 0.000004 RDW Normal 913/1022 0.893
WBC Normal 939/1022 0.918
Platelet Normal 1017/1022 0.995
Case: 1 Anemia 0.007
When sex is female and all other parameters are normal, the Prediction of Male Anemic on Abnormal Condition
probability of anemia is explored in the following table. Parameter Condition Sample Probability
Sex Male 1022/2151 0.475
TABLE VII. NORMAL P ARAMETER FEMALE ANEMIA P REDICTION HB Abnormal 431/1022 0.421
RBC Abnormal 116/1022 0.113
Normal Parameter Female Anemic Prediction
PCV Abnormal 185/1022 0.181
Sex F
MCV Microcytic 385/1022 0.376
HB Normal
MCH Abnormal 539/1022 0.527
RBC Normal
MCHC Abnormal 776/1022 0.759
PCV Normal
RDW Abnormal 109/1022 0.106
MCV Microcytic
WBC Abnormal 83/1022 0.081
MCH Normal
Platelet Abnormal 5/1022 0.004
MCHC Normal
Anemia 0.00000002
RDW Normal
WBC Normal
Plt Normal Case: 3
Anemia ? When sex is male and all other parameters are normal, the
probability of anemia is explored in the following table.
P(anemia|F_normal)=P(F_normal)*P(HB-N|anemia)*P(RBC- TABLE X. NORMAL P ARAMETER MALE ANEMIA P REDICTION
N|anemia)*P(PCV-N|anemia)*P(M CV Microcytic|anemia)
Normal Parameter Male Anemic Prediction
*P(Platelet-N|anemia) Sex M
HB Normal
= 0.525 * 0.854 * 0.460 * 0.551 * 0.292 * 0.100 * 0.770 * RBC Normal
0.866 * 0.986 * 0.353 PCV Normal
MCV Microcytic
= 0.0007 = P (anemia|F-normal) = 0.0007 MCH Normal
MCHC Normal
Case: 2
RDW Normal
When sex is female and all other parameters are abnormal, WBC Normal
the probability of anemia is explored in the following table. Plt Normal
Anemia ?
TABLE VIII. ABNORMAL P ARAMETER FEMALE ANEMIA P REDICTION
Abnormal Parameter Female Anemic Prediction P(anemia|M_normal)=P(RBC-N|anemia)*P(PCV-N|anemia)*P
Sex F (MCV-Microcytic|anemia)…*P(M_normal)
HB Abnormal
RBC Abnormal = 0.886*0.818* 0.376* 0.472* 0.240* 0.893* 0.918* 0.868*
PCV Abnormal 0.975* 0.646* 0.995* 0.578
MCV Microcytic
MCH Abnormal = 0.007 = P (anemia|male_normal) = 0.007
MCHC Abnormal
Case: 4
RDW Abnormal
WBC Abnormal When sex is male and all other parameters are abnormal,
Plt Abnormal the probability of anemia is explored in the following table.
Anemia ?

978-1-7281-4685-0/20/$31.00 ©2020 IEEE 11

Authorized licensed use limited to: University of Exeter. Downloaded on June 22,2020 at 05:07:10 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Fifth International Conference on Inventive Computation Technologies (ICICT-2020)
IEEE Xplore Part Number:CFP20F70-ART; ISBN:978-1-7281-4685-0

TABLE XI. ABNORMAL P ARAMETER FEMALE ANEMIA P REDICTION serious illness can be prevented. The health of the students can
Abnormal Parameter Male Anemic Prediction be improved and the drop out ratio can also be reduced.
Sex M
HB Abnormal V. REFERENCES
RBC Abnormal 1. Jaiswal, Manish, Anima Srivastava, and T anveer J. Siddiqui. &quot;
PCV Abnormal Machine Learning Algorithms for Anemia Disease Prediction.&quot; In
MCV Microcytic Recent T rends in Communication, Computing, and Electronics, pp. 463-
MCH Abnormal 469. Springer, Singapore, 2019.
MCHC Abnormal 2. Induja, S. N., and C. G. Raji. &quot; Computational Methods for
RDW Abnormal Predicting Chronic Disease in Healthcare Communities.&quot; In 2019
WBC Abnormal International Conference on Data Science and Communication
Plt Abnormal (IconDSC), pp. 1- 6. IEEE, 2019.
Anemia ? 3. Shetty, Badreesh. &quot; Supervised Machine Learning: Classification.
&quot; T owards Data Science, 2018.
P(anemia|M-Abnormal)=P(RBC-Ab|anemia)*P(PCV- 4. Saxena Shruti. &quot; Precision vs Recall.&quot;Towards Data Science,
Ab|anemia)*P(MCV-Microcytic|anemia)…*P(M|Abnormal) 2018.
=0.113* 0.181* 0.376* 0.527* 0.759* 0.106* 0.081* 0.131* 5. Jaitley, Urvashi. &quot; Why Data Normalization is necessary for
Machine Learning models. &quot; medium.com, 2018.
0.024* 0.353* 0.004*0.421
6. Abdullah, Manal, Salma Al-Asmari. &quot; Anemia type’s prediction
=0.00000000211 based on data mining classification algorithms.&quot; In T aylor &
Francis Group, London, 2017.
= P(anemia|M_Abnormal) = 0.00000000211 (2.112632e-8) 7. Stecanella, Bruno. &quot; A practical explanation of a Naive Bayes
classifier,https://monkeylearn.com/blog/practical-explanation-naive-
bayes-classifier/, 2017.
8. C.H. Yu, M. Bhatnagar, R. Hogen, D. Mao , A. Farzindar, K.
Dhanireddy. &quot; Anemic Status Prediction using Multilayer
Perceptron Neural Network Model. &quot; In EPiC Series in
Computing, Pages 213-220, Volume 50, 2017.
9. Joshi, Renuka. &quot;Accuracy, Precision, Recall & F1 Score:
Interpretation of Performance Measures - Exsilio Blog, 2016.
10. Brownlee, Jason. &quot; What is a Confusion Matrix in Machine
Learning, 2016.
11. Medhekar, Dhanashree S, Mayur P. Bote, Shruti D. Deshmukh. &quot;
Heart Disease Prediction System using Naive Bayes. &quot; In
IJERST E, 2013.
12. Data school. &quot;Simple guide to confusion matrix terminology.
2014.
13. T efferi, Ayalew, Md; Curtis A. Hanson, Md; And David J. Inwards, Md,
How to Interpret and Pursue an Abnormal Complete Blood Cell Count
Fig.6. Importance of parameter in Adults., Mayo Clin Proc,2005.
14. Henry O. Ogedegbe, Ph.D., BB (ASCP)SC,1 Laszlo Csury, MD,2 Byron
The above chart shows the importance of different CBC H. Simmons, MD2, Anemias: A Clinical Laboratory Perspective,
parameters among which the HB parameter is the most laboratory medicine, 2004.
important. Therefore, it can be said that HB plays a significant 15. Pattekari, Shadab Adam and Asma Parveen. &quot; Prediction System
role in the prediction of anemia. Subsequently, PCV, MCV, For Heart Disease Using Naive Bayes.
RDW, WBC and MCH parameters are also important. This can 16. Mehta Vinod Kumar.&quot; Anemia In Urban and Rural School Girls
be said from the research of the Department of Pathology at the Aged12-16 Years Shimla- A Comparative Study.2004, National Institute
Of Epidemiology.
Utah University of Medicine [26]. This plays an important role
in predicting different types of anemia such as iron deficiency, 17. J. David Bessman, M.D., P. Ridgway Gilmer, Jr., M.D., And Frank H.
Gardner, M.D., Improved Classification of Anemias by MCV and RDW,
vitamin B12 deficiency, sickle cell anemia, aplastic anemia, 1983, American Society of Clinical Pathologists.
anemia of chronic disease. Furthermore, it includes parameters 18. Bessman, J.David and Randall K. Johnson, Erythrocyte volume
such as neutrophil, lymph, eosin, mono, bgr, (blood group); distribution in normal and abnormal subjects.
which is statistically significant but medically not significant. 19. Kariyeva, G.K., A. Magtymova, and A. Sharman, ANEMIA, chepter12.
IV. CONCLUSION 20. Naive Bayesian, https://www.saedsayad.com/naive_bayesian.htm.
21. Narkhede, Sarang. Understanding Confusion Matrix, T D Science.
In this research study, an analysis has been conducted on 22. NHNES, Laboratory Procedure Manual.
the CBC data set of the students of higher education in Gujarat
23. Anand, Avati, Evaluation Metrics (Classifiers) CS229 Section.
Vidyapith. In this, the proportion of various parameters has
24. Sensitivity and Specificity.From Wikipedia.
been tested. The analysis revealed that the proportion of
25. https://www.investopedia.com/terms/p/p-value.asp.
anemic students was 54% (1161); while the proportion of non -
anemic students is 46% (990). Therefore, anemia may be one 26. Agarwal, Archana M, M.D., Diagnostic approach to Anemia.
of the reasons for the reduction in the number of students. 27. http://www.pmean.com/definitions/kappa.ht m.

Thus, the findings of the study clearly indicate that the


diagnosis of anemia should be done at the earlier stage so that

978-1-7281-4685-0/20/$31.00 ©2020 IEEE 12

Authorized licensed use limited to: University of Exeter. Downloaded on June 22,2020 at 05:07:10 UTC from IEEE Xplore. Restrictions apply.

You might also like