Professional Documents
Culture Documents
DIABETES
by
K. Kanmani
Asst. professor of Computer Applications
SRM University.
Data Processing
Attribute Identification and selection
Handling Missing values
Numerical Discretization
Decision Tree Construction
separate data into fixed number of
partitions
Methodology - contd
Select the first fold for testing, remaining folds are
used for training
Perform classification and obtain performance metrics
Select the next partition as testing and use the rest as
training data
Repeat classification until each partition has been
used as the test set
Data set - Attributes
Pregnancies Number of times pregnant
Glucose Plasma glucose concentration a 2 hours in an
oral glucose tolerance test
Blood Pressure Diastolic blood pressure
Skin Thickness Triceps skin fold thickness
Insulin2-Hour serum insulin
BMI Body mass index (weight in kg/(height in m)^2)
Age (years)
Outcome Class variable (0 or 1)
conclusion
The aim of the research work is to investigate the risk
factors associated with the diabetes and to make
accurate predictions
Decision tree is one of the most powerful and widely
applied technique for classification and prediction. It
produces reasonably good models. It gives benefits to
society to make decision regarding the diagnosis of
diabetes.
References
[1] Jiawei and kamber, “Data Mining concepts and techniques”
[2] Maham Jahangir et.al, “An expert system for diabetes prediction using
Auto tuned Multi layer Perceptron”, IEEE, 2017, pp .722-728
[3] Asma A. Aljarullah, “Decision Tree Discovery for the diagnosis of Type
II Diabetes”, IEEE, 2011, pp. 303-307
[4] Nongayo Nai arun, Rungruttikarn Moungmai, “ Comparison of
classifiers for the risk of diabetes prediction”, elsevier, 2015
[5]Haroon kaur, Shalini Batra, “HPCC: An ensembled framework for the
prediction of the onset of diabetes”, IEEE International conference on
signal processing computing and control, 2017.