Professional Documents
Culture Documents
Abstract--Online Social networks are most popular among the II. LITERARURE REVIEW
people in some past years. People use social networking places to
connect through their relatives, precious ones, friends and The few application of medical data mining as compared to
colleagues for social contacts. As the data increase rapidly and this other fields described their skill in trying to repeatedly attain
create issue related to security and privacy in online social networks. medical data from clinical records. They did some
So, therefore, retrieval of information about the trends and problems experimentations on three clinical databases and the directions
in online social networks. Datasets for online social media networks brought are used to comparison against a set of strong clinical
to be analyzed and visualized by client or by user. For classification rules. Previous research in dealing with this problem can be
and visualization of data, WEKA tool is used. Holistic approach is defined with the next approaches:
considered to classify and analyze the diabetes dataset for data
processing. In this research paper for preprocessing prediction, we • Determine all rules initially and then permit the user to
used diabetes.arff dataset. WEKA tool is very useful for classification
enquiry and repossess those he/she is involved in. This
of data and for analyzing the dataset diabeties.arrf. Result of this
research helps us in prediction that may or may not be individually typical approach is that of patterns [3]. This approach
infected from the diabetes. In this research paper it is evaluated that permits the user to investigate what rules he/she is involved
peoples are infected from diabetes those have age greater or equal to as patterns. Then the system uses the patterns to repossess
40 and mass greater than 35. Beside this, the peoples those are no the rules that match the patterns from the set of discovered
suffered from diabetes have age and mass less than 35. To analyze rules.
any person should not allow to modify the data who does not have • Use restrictions to constrain the mining process to produce
authority. Similarly, Dhawan and Ekta discoursed numerous bogus only related rules. [2] offers an algorithm that can take item
profile of detection procedures in Social Networks which help out the
limitations specified by the user in the association rule
user’s data to keep the safe from damaged.
mining process so that only those rules that satisfy the user
Keywords—WEKA, analysis, classification, dataset, machine specified item constraints are produced. This also does not
learning work well for doctors frequently do not have any specific
rules to mine.
I. INTRODUCTION • Find unpredicted rules. This approach initially requests the
Many people share their ideas, media, feelings on social user to require his/her existing information about the field.
media networks by connecting with each other. When peoples The system finds those unpredicted rules [5].
connect on social networks the data is being generated in very
large amount and at very high rate. Data is generating at large A good amount of data mining research exists in the field of
scale of due to production and development at large scale in an medical diagnosis. It is worthwhile at the outset to take a stock
organization. Social network site necessity to mine the past data of recent development related to the proposed research.
to improve their products and services. Data cannot be modified
by any unauthorized user when he analyzes the data. Ekta and Researchers have applied Incremental Learning and
Dhawan discussed many profile of fake detection techniques in Decision Tree and for the observed symptoms of cardiac and
social media networks which protect user’s data from damage. diabetes at the early stage [1]. Dataset is collected from the
Objects of data sets are classified related to its similarities. The patient data set which were logged in the clinical record of the
most used method and best known is classification. The target hospital. Dataset is analysed by algorithm for classification like
class of object is accurately predicted by classification of which some models name d as decision trees and classification rules.
the class label is unknown. Classification algorithms are
provided in nine groups in WEKA implementation. Following Researchers have developed a decision supporting system
algorithms are selected named as Naïve Bayes, logistic, J48. for analysis of disease that makes use of mining techniques of
data [2]. There are almost three different classifier algorithms
based on artificial intelligence named as Naive Bayes,
Multilayer Perceptron and J.48 were applied on data set of • Comparative behaviour of different algorithms for several
Diabetes. These classifiers are usually implemented in the models are selected as based upon their efficiency.
fields of biomedical engineering, data-mining and medical Evaluation phase measures the degree to which one model
diagnosing the patients. meets the required objectives.
V. CONCLUSION
We can get information from dataset by data mining.
Obtaining information from the data mining help the
organization in improving their business and products. We can
perform data mining efficiently and precisely by through Weka
tool. This research paper evidences the WEK’s performance to
analyze the diabetes data. Three different algorithms are used to
ranking high and lower attributes which predict weather the
individual may infected or have no symptoms of diabetes.
13%
SIMILARITY INDEX
PRIMARY SOURCES
1 es.scribd.com
Internet 208 words — 10%
2 Marinov, M., A. S. M. Mosa, I. Yoo, and S. A. Boren.
"Data-Mining Technologies for Diabetes: A Systematic
30 words — 1%
Review", Journal of Diabetes Science and Technology, 2011.
Crossref
3 www.gezondheidsaward.nl
Internet 18 words — 1%
4 www.educba.com
Internet 13 words — 1%
5 www.science.gov
Internet 8 words — < 1%