Professional Documents
Culture Documents
Abstract–Nephrolithiasis is a disease with a high and even Probability of Recurrence in each patient would be a
rising incidence. It has a high morbidity, generates high costs challenging factor for further treatments and analysis for
and has a high recurrence rate. Metabolic evaluation in renal medical society. This can be a challenging prediction task
stone formers allows the identification and quantification of risk due to high numbers of contributing attributes in formation of
factors and establishment of individual risk profiles. Based on
these individuals risk profiles, rational therapy for metaphylaxis
renal stones. This problem forces the medical community to
of renal stones lowers stone recurrence rate significantly. make use of data mining techniques to enhance the quality
The purpose of this article is metabolic investigation in and confidence of recurrence predictions.
patients with nephrolithiasis in Isfahan city- Iran. Different data In this article 3 steps including extracting association
mining algorithms such as Clustering and Classification were rules, clustering the data into appropriate numbers of clusters
employed for extracting knowledge in the form of decision rules. to find appropriate groups of patients and finally designating
These results evaluate the risk of morbidity and recurrence of a classifier based on the recurrence event attribute in the
the diseases. observed society and ranking the contributing features.
Some medical attributes gathered based on their medical Preprocessing the data and relevance analysis as the first
importance. The data mining tasks applied in this research have
been applied and tested over 406 observed samples collected at
step, would be an important phase to clear the data and skip
different clinics in the city of Isfahan. the impurities.
Association rule mining as a technique to extract hidden
Keywords: Renal Stone Recurrence, nephrolithiasis, rules which are readable and easily interpretable for medical
Association Rules, Clustering, Classification. expert, has been considered widely as an important technique
in data mining community. In our study, we have first
I. INTRODUCTION conducted the association rule mining which can be
considered as the most important way for a medical expert to
Metabolic evaluation in renal stone formers allows the find out the important relations among the different features
identification and quantification of risk factors and and properties of the data; Rules can reveal the associations
establishment of individual risk profiles. Based on these and correlations among various factors which are important
individuals risk profiles, rational therapy for metaphylaxis of on recurrence of the renal stone and effects, caused by other
renal stones lowers stone recurrence rate significantly. derangements for nephrolithiasis patients such as
This article will focus on the ways to extract useful hypocitraturia, hyperoxaluria, low urinary volume,
knowledge which can come handy in for division of different hyperuricosuria, hypercalciuria and cystinuria. This study
patients and predicting the future status of the new patients. would be conducted by providing the support and confidence
What have been proposed are based on different approaches measurements which are considered as two of the best
in data mining, which tries to extract high quality patterns. metrics to show the quality of each rule.
These patterns could be used as a prediction and analysis tool Second method which would be considered in our study is
in the studied region. clustering. After preprocessing the data, we have conducted
In this paper, Association mining, Clustering and hierarchical and partitional clustering techniques and
Classification techniques due to different usages and analysis compared them in terms of some popular quality
have be used to tackle the mining task from different aspects. measurements.
These data mining tasks and corresponded analysis could In hierarchical clustering study we will conduct the single
be so important due to different attributes which would be linkage, average linkage and complete linkage clustering
considered as effective ones for formation of renal stones. algorithms corresponding their cophenetic distance as the
22 key features based on their medical importance have measure of how the linkage algorithm would affect the
been derived by personal soliciting forms and medical Euclidean distance matrix of data. Dendrograms as the visual
examinations such as CT- scan. plots which could be easily considered by medical experts
would be provided for further analysis on the number of
64
Journal of Applied Computer Science & Mathematics, no. 11 (5) /2011, Suceava
65
Computer Science Section
66
Journal of Applied Computer Science & Mathematics, no. 11 (5) /2011, Suceava
TABLE 3: SILHOUETTE METRICS WITH DIFFERENT CLUSTERING NUMBERS TABLE 5: CLASSIFICATION ACCURACY
WITH K-MEANS ALGORITHM
Classification
SVM 1-NN 2-NN 3-NN
2 clusters 3 clusters 4 clusters 5 clusters 6 clusters method
67
Computer Science Section
[6]. K. R. Muller, S. Mika, G. Ratsch, K. Tsuda, and B.Scholkopf, [7]. U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R.
“An introduction to kernel-based learning algorithms,” IEEE Uthurusamy. Advances in Knowledge Discovery and Data
Trans. Neural Networks, vol. 12, no. 2, pp.181-201, 2001. Mining. AAAI Press/MIT Press, 1996.
Taghi Adl received the B.Sc. degree in Computer Engineering-Hardware, (2009) from department of Electrical and computer
engineering, Shahid Beheshti University, M.Sc. in Computer Architecture (2011) from department of Electrical and computer
engineering, Isfahan University of Technology. In 2010, he joined Data mining lab in Isfahan University of technology. His
current research interest includes data mining.
Arash Givchi received the B.Sc. degree in Computer Engineering-Software (2009) from department of Electrical and
computer engineering, Isfahan University, M.Sc. in Artificial Intelligence and Robotic (2011) from department of Electrical
and computer engineering, Isfahan University of Technology. In 2010, he joined Data mining lab in Isfahan University of
technology. His current research interest includes data mining and robotic.
Mohamad Saraee received his PhD from University of Manchester in Computation,. His main areas of research are Intelligent
databases, Mining advanced and complex data including medical and Bio, Text Mining and E-Commerce. He has published
extensively in each of these areas and served on scientific and organizing committee on number of journals and conferences.
Amid Eshraghi received the Doctorate degree in general physician(2002) from Medical Science of University of Mashhad,
specialist in internal medicine (2009) from Medical Science of University of Isfahan. He is studying gastroenterologist
subspecialist in Medical Science of University of Tehran.
68